Picture for Kelvin Niu

Kelvin Niu

AIRS-Bench: a Suite of Tasks for Frontier AI Research Science Agents

Add code
Feb 09, 2026
Viaarxiv icon

A Scalable Measure of Loss Landscape Curvature for Analyzing the Training Dynamics of LLMs

Add code
Jan 23, 2026
Viaarxiv icon

Scaling and Distilling Transformer Models for sEMG

Add code
Jul 29, 2025
Viaarxiv icon

AI Research Agents for Machine Learning: Search, Exploration, and Generalization in MLE-bench

Add code
Jul 03, 2025
Viaarxiv icon