Picture for Han Hu

Han Hu

University of Toronto

RBench-V: A Primary Assessment for Visual Reasoning Models with Multi-modal Outputs

Add code
May 22, 2025
Viaarxiv icon

Hunyuan-TurboS: Advancing Large Language Models through Mamba-Transformer Synergy and Adaptive Chain-of-Thought

Add code
May 21, 2025
Viaarxiv icon

Sensing-Assisted Channel Prediction in Complex Wireless Environments: An LLM-Based Approach

Add code
May 14, 2025
Viaarxiv icon

Federated Deconfounding and Debiasing Learning for Out-of-Distribution Generalization

Add code
May 08, 2025
Viaarxiv icon

R-Bench: Graduate-level Multi-disciplinary Benchmarks for LLM & MLLM Complex Reasoning Evaluation

Add code
May 04, 2025
Viaarxiv icon

Low-Precision Training of Large Language Models: Methods, Challenges, and Opportunities

Add code
May 02, 2025
Viaarxiv icon

Low-hallucination Synthetic Captions for Large-Scale Vision-Language Model Pre-training

Add code
Apr 17, 2025
Viaarxiv icon

Optimal Stepsize for Diffusion Sampling

Add code
Mar 27, 2025
Viaarxiv icon

Equivariant Image Modeling

Add code
Mar 24, 2025
Viaarxiv icon

Tokenize Image as a Set

Add code
Mar 20, 2025
Viaarxiv icon