Picture for Huazhong Yang

Huazhong Yang

R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing

Add code
May 27, 2025
Viaarxiv icon

PM-KVQ: Progressive Mixed-precision KV Cache Quantization for Long-CoT LLMs

Add code
May 24, 2025
Viaarxiv icon

Policy-to-Language: Train LLMs to Explain Decisions with Flow-Matching Generated Rewards

Add code
Feb 18, 2025
Viaarxiv icon

FrameFusion: Combining Similarity and Importance for Video Token Reduction on Large Visual Language Models

Add code
Dec 30, 2024
Figure 1 for FrameFusion: Combining Similarity and Importance for Video Token Reduction on Large Visual Language Models
Viaarxiv icon

MBQ: Modality-Balanced Quantization for Large Vision-Language Models

Add code
Dec 27, 2024
Viaarxiv icon

What Matters in Learning A Zero-Shot Sim-to-Real RL Policy for Quadrotor Control? A Comprehensive Study

Add code
Dec 17, 2024
Viaarxiv icon

Multi-UAV Pursuit-Evasion with Online Planning in Unknown Environments by Deep Reinforcement Learning

Add code
Sep 25, 2024
Figure 1 for Multi-UAV Pursuit-Evasion with Online Planning in Unknown Environments by Deep Reinforcement Learning
Figure 2 for Multi-UAV Pursuit-Evasion with Online Planning in Unknown Environments by Deep Reinforcement Learning
Figure 3 for Multi-UAV Pursuit-Evasion with Online Planning in Unknown Environments by Deep Reinforcement Learning
Figure 4 for Multi-UAV Pursuit-Evasion with Online Planning in Unknown Environments by Deep Reinforcement Learning
Viaarxiv icon

Efficient Expert Pruning for Sparse Mixture-of-Experts Language Models: Enhancing Performance and Reducing Inference Costs

Add code
Jul 01, 2024
Viaarxiv icon

MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression

Add code
Jun 21, 2024
Viaarxiv icon

Can LLMs Learn by Teaching? A Preliminary Study

Add code
Jun 20, 2024
Viaarxiv icon