Created in June 24, 2025
2025
New Preprint: LMR-BENCH: Evaluating LLM Agent’s Ability on Reproducing Language Modeling Research.