Picture for Yao Hu

Yao Hu

Alibaba Group

Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models

Add code
Jan 29, 2026
Viaarxiv icon

Do Not Waste Your Rollouts: Recycling Search Experience for Efficient Test-Time Scaling

Add code
Jan 29, 2026
Viaarxiv icon

Guiding the Recommender: Information-Aware Auto-Bidding for Content Promotion

Add code
Jan 28, 2026
Viaarxiv icon

Robust Tool Use via Fission-GRPO: Learning to Recover from Execution Errors

Add code
Jan 22, 2026
Viaarxiv icon

Benchmark^2: Systematic Evaluation of LLM Benchmarks

Add code
Jan 07, 2026
Viaarxiv icon

Anti-Length Shift: Dynamic Outlier Truncation for Training Efficient Reasoning Models

Add code
Jan 07, 2026
Viaarxiv icon

EComStage: Stage-wise and Orientation-specific Benchmarking for Large Language Models in E-commerce

Add code
Jan 06, 2026
Viaarxiv icon

CrossVid: A Comprehensive Benchmark for Evaluating Cross-Video Reasoning in Multimodal Large Language Models

Add code
Nov 15, 2025
Viaarxiv icon

RedOne 2.0: Rethinking Domain-specific LLM Post-Training in Social Networking Services

Add code
Nov 10, 2025
Viaarxiv icon

TeaRAG: A Token-Efficient Agentic Retrieval-Augmented Generation Framework

Add code
Nov 07, 2025
Viaarxiv icon