Picture for Yao Hu

Yao Hu

Alibaba Group

Robust Tool Use via Fission-GRPO: Learning to Recover from Execution Errors

Add code
Jan 22, 2026
Viaarxiv icon

Benchmark^2: Systematic Evaluation of LLM Benchmarks

Add code
Jan 07, 2026
Viaarxiv icon

Anti-Length Shift: Dynamic Outlier Truncation for Training Efficient Reasoning Models

Add code
Jan 07, 2026
Viaarxiv icon

EComStage: Stage-wise and Orientation-specific Benchmarking for Large Language Models in E-commerce

Add code
Jan 06, 2026
Viaarxiv icon

CrossVid: A Comprehensive Benchmark for Evaluating Cross-Video Reasoning in Multimodal Large Language Models

Add code
Nov 15, 2025
Viaarxiv icon

RedOne 2.0: Rethinking Domain-specific LLM Post-Training in Social Networking Services

Add code
Nov 10, 2025
Viaarxiv icon

TeaRAG: A Token-Efficient Agentic Retrieval-Augmented Generation Framework

Add code
Nov 07, 2025
Viaarxiv icon

Interleaving Reasoning for Better Text-to-Image Generation

Add code
Sep 09, 2025
Figure 1 for Interleaving Reasoning for Better Text-to-Image Generation
Figure 2 for Interleaving Reasoning for Better Text-to-Image Generation
Figure 3 for Interleaving Reasoning for Better Text-to-Image Generation
Figure 4 for Interleaving Reasoning for Better Text-to-Image Generation
Viaarxiv icon

SelfAug: Mitigating Catastrophic Forgetting in Retrieval-Augmented Generation via Distribution Self-Alignment

Add code
Sep 04, 2025
Figure 1 for SelfAug: Mitigating Catastrophic Forgetting in Retrieval-Augmented Generation via Distribution Self-Alignment
Figure 2 for SelfAug: Mitigating Catastrophic Forgetting in Retrieval-Augmented Generation via Distribution Self-Alignment
Figure 3 for SelfAug: Mitigating Catastrophic Forgetting in Retrieval-Augmented Generation via Distribution Self-Alignment
Figure 4 for SelfAug: Mitigating Catastrophic Forgetting in Retrieval-Augmented Generation via Distribution Self-Alignment
Viaarxiv icon

Multi-Granularity Distribution Modeling for Video Watch Time Prediction via Exponential-Gaussian Mixture Network

Add code
Aug 18, 2025
Figure 1 for Multi-Granularity Distribution Modeling for Video Watch Time Prediction via Exponential-Gaussian Mixture Network
Figure 2 for Multi-Granularity Distribution Modeling for Video Watch Time Prediction via Exponential-Gaussian Mixture Network
Figure 3 for Multi-Granularity Distribution Modeling for Video Watch Time Prediction via Exponential-Gaussian Mixture Network
Figure 4 for Multi-Granularity Distribution Modeling for Video Watch Time Prediction via Exponential-Gaussian Mixture Network
Viaarxiv icon