Paper

Evaluating AGENTS.md: Are Repository-Level Context Files Helpful for Coding Agents?

Martin Vechev

2026.02.24

·Arxiv·by 네루

#Benchmark#Coding Agents#Context Files#LLM#Software Engineering

Key Points

1This paper rigorously investigates the effectiveness of repository-level context files, such as AGENTS.md, for coding agents on both established benchmarks and a novel dataset with developer-provided files.
2Surprisingly, the study reveals that LLM-generated context files tend to decrease agent task success rates and increase inference costs by over 20%, while human-written files yield only marginal performance gains.
3Trace analysis indicates that context files encourage broader exploration and testing, leading to the conclusion that unnecessary instructions make tasks harder, and context files should contain only minimal requirements.

R

Paper

Martin Vechev

2026.02.24

·Arxiv·by 네루

#Benchmark#Coding Agents#Context Files#LLM#Software Engineering

1This paper rigorously investigates the effectiveness of repository-level context files, such as AGENTS.md, for coding agents on both established benchmarks and a novel dataset with developer-provided files.
2Surprisingly, the study reveals that LLM-generated context files tend to decrease agent task success rates and increase inference costs by over 20%, while human-written files yield only marginal performance gains.
3Trace analysis indicates that context files encourage broader exploration and testing, leading to the conclusion that unnecessary instructions make tasks harder, and context files should contain only minimal requirements.

R