Agent psychometrics: Task-level performance prediction in agentic coding benchmarks

Chris Ge*, Daria Kryvosheieva*, Daniel Fried, Uzay Girit, Kaivalya Hariharan
arXiv preprint

Different types of syntactic agreement recruit the same units within large language models

Daria Kryvosheieva, Andrea de Varda, Evelina Fedorenko, Greta Tuckute
ACL 2026

Efficient code embeddings from code generation models

Daria Kryvosheieva, Saba Sturua, Michael Günther, Scott Martens, Han Xiao
DL4Code @ NeurIPS 2025

Controlled evaluation of syntactic knowledge in multilingual language models

Daria Kryvosheieva and Roger Levy
LoResLM @ COLING 2025