Picture for Haitao Mi

Haitao Mi

The End of Manual Decoding: Towards Truly End-to-End Language Models

Add code
Oct 30, 2025
Viaarxiv icon

Explore to Evolve: Scaling Evolved Aggregation Logic via Proactive Online Exploration for Deep Research Agents

Add code
Oct 16, 2025
Viaarxiv icon

CLUE: Non-parametric Verification from Experience via Hidden-State Clustering

Add code
Oct 02, 2025
Figure 1 for CLUE: Non-parametric Verification from Experience via Hidden-State Clustering
Figure 2 for CLUE: Non-parametric Verification from Experience via Hidden-State Clustering
Figure 3 for CLUE: Non-parametric Verification from Experience via Hidden-State Clustering
Figure 4 for CLUE: Non-parametric Verification from Experience via Hidden-State Clustering
Viaarxiv icon

VOGUE: Guiding Exploration with Visual Uncertainty Improves Multimodal Reasoning

Add code
Oct 01, 2025
Figure 1 for VOGUE: Guiding Exploration with Visual Uncertainty Improves Multimodal Reasoning
Figure 2 for VOGUE: Guiding Exploration with Visual Uncertainty Improves Multimodal Reasoning
Figure 3 for VOGUE: Guiding Exploration with Visual Uncertainty Improves Multimodal Reasoning
Figure 4 for VOGUE: Guiding Exploration with Visual Uncertainty Improves Multimodal Reasoning
Viaarxiv icon

UniGist: Towards General and Hardware-aligned Sequence-level Long Context Compression

Add code
Sep 19, 2025
Viaarxiv icon

Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation

Add code
Sep 18, 2025
Viaarxiv icon

EconProver: Towards More Economical Test-Time Scaling for Automated Theorem Proving

Add code
Sep 16, 2025
Viaarxiv icon

CDE: Curiosity-Driven Exploration for Efficient Reinforcement Learning in Large Language Models

Add code
Sep 11, 2025
Figure 1 for CDE: Curiosity-Driven Exploration for Efficient Reinforcement Learning in Large Language Models
Figure 2 for CDE: Curiosity-Driven Exploration for Efficient Reinforcement Learning in Large Language Models
Figure 3 for CDE: Curiosity-Driven Exploration for Efficient Reinforcement Learning in Large Language Models
Figure 4 for CDE: Curiosity-Driven Exploration for Efficient Reinforcement Learning in Large Language Models
Viaarxiv icon

Self-Rewarding Vision-Language Model via Reasoning Decomposition

Add code
Aug 27, 2025
Viaarxiv icon

R-Zero: Self-Evolving Reasoning LLM from Zero Data

Add code
Aug 07, 2025
Viaarxiv icon