Picture for Mengdi Wang

Mengdi Wang

Does Thinking More always Help? Understanding Test-Time Scaling in Reasoning Models

Add code
Jun 04, 2025
Viaarxiv icon

On Path to Multimodal Historical Reasoning: HistBench and HistAgent

Add code
May 26, 2025
Viaarxiv icon

Genome-Bench: A Scientific Reasoning Benchmark from Real-World Expert Discussions

Add code
May 26, 2025
Viaarxiv icon

Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evolution

Add code
May 26, 2025
Viaarxiv icon

MMaDA: Multimodal Large Diffusion Language Models

Add code
May 21, 2025
Viaarxiv icon

Shallow Preference Signals: Large Language Model Aligns Even Better with Truncated Data?

Add code
May 21, 2025
Viaarxiv icon

PoseX: AI Defeats Physics Approaches on Protein-Ligand Cross Docking

Add code
May 03, 2025
Viaarxiv icon

WenyanGPT: A Large Language Model for Classical Chinese Tasks

Add code
Apr 29, 2025
Viaarxiv icon

OTC: Optimal Tool Calls via Reinforcement Learning

Add code
Apr 21, 2025
Viaarxiv icon

NoWag: A Unified Framework for Shape Preserving Compression of Large Language Models

Add code
Apr 20, 2025
Viaarxiv icon