Picture for Han Hu

Han Hu

University of Toronto

X-Omni: Reinforcement Learning Makes Discrete Autoregressive Image Generative Models Great Again

Add code
Jul 29, 2025
Viaarxiv icon

ARC-Hunyuan-Video-7B: Structured Video Comprehension of Real-World Shorts

Add code
Jul 28, 2025
Viaarxiv icon

RBench-V: A Primary Assessment for Visual Reasoning Models with Multi-modal Outputs

Add code
May 22, 2025
Viaarxiv icon

Hunyuan-TurboS: Advancing Large Language Models through Mamba-Transformer Synergy and Adaptive Chain-of-Thought

Add code
May 21, 2025
Viaarxiv icon

Sensing-Assisted Channel Prediction in Complex Wireless Environments: An LLM-Based Approach

Add code
May 14, 2025
Viaarxiv icon

Federated Deconfounding and Debiasing Learning for Out-of-Distribution Generalization

Add code
May 08, 2025
Viaarxiv icon

R-Bench: Graduate-level Multi-disciplinary Benchmarks for LLM & MLLM Complex Reasoning Evaluation

Add code
May 04, 2025
Viaarxiv icon

Low-Precision Training of Large Language Models: Methods, Challenges, and Opportunities

Add code
May 02, 2025
Viaarxiv icon

Low-hallucination Synthetic Captions for Large-Scale Vision-Language Model Pre-training

Add code
Apr 17, 2025
Viaarxiv icon

Optimal Stepsize for Diffusion Sampling

Add code
Mar 27, 2025
Viaarxiv icon