Picture for Hanghang Tong

Hanghang Tong

University of Illinois Urbana-Champaign

RubricEM: Meta-RL with Rubric-guided Policy Decomposition beyond Verifiable Rewards

Add code
May 11, 2026
Viaarxiv icon

Recursive Multi-Agent Systems

Add code
Apr 28, 2026
Viaarxiv icon

PAPERMIND: Benchmarking Agentic Reasoning and Critique over Scientific Papers in Multimodal LLMs

Add code
Apr 23, 2026
Viaarxiv icon

TRIMS: Trajectory-Ranked Instruction Masked Supervision for Diffusion Language Models

Add code
Apr 01, 2026
Viaarxiv icon

Prune as You Generate: Online Rollout Pruning for Faster and Better RLVR

Add code
Mar 25, 2026
Viaarxiv icon

ReMix: Reinforcement routing for mixtures of LoRAs in LLM finetuning

Add code
Mar 10, 2026
Viaarxiv icon

MC-Search: Evaluating and Enhancing Multimodal Agentic Search with Structured Long Reasoning Chains

Add code
Mar 01, 2026
Viaarxiv icon

dLLM: Simple Diffusion Language Modeling

Add code
Feb 26, 2026
Viaarxiv icon

FeDecider: An LLM-Based Framework for Federated Cross-Domain Recommendation

Add code
Feb 17, 2026
Viaarxiv icon

Graph homophily booster: Reimagining the role of discrete features in heterophilic graph learning

Add code
Feb 06, 2026
Viaarxiv icon