Picture for Yu Cheng

Yu Cheng

Scaling Reasoning, Losing Control: Evaluating Instruction Following in Large Reasoning Models

Add code
May 20, 2025
Viaarxiv icon

Occult: Optimizing Collaborative Communication across Experts for Accelerated Parallel MoE Training and Inference

Add code
May 19, 2025
Viaarxiv icon

SMFusion: Semantic-Preserving Fusion of Multimodal Medical Images for Enhanced Clinical Diagnosis

Add code
May 18, 2025
Viaarxiv icon

OpenThinkIMG: Learning to Think with Images via Visual Tool Reinforcement Learning

Add code
May 13, 2025
Figure 1 for OpenThinkIMG: Learning to Think with Images via Visual Tool Reinforcement Learning
Figure 2 for OpenThinkIMG: Learning to Think with Images via Visual Tool Reinforcement Learning
Figure 3 for OpenThinkIMG: Learning to Think with Images via Visual Tool Reinforcement Learning
Figure 4 for OpenThinkIMG: Learning to Think with Images via Visual Tool Reinforcement Learning
Viaarxiv icon

Visually Interpretable Subtask Reasoning for Visual Question Answering

Add code
May 12, 2025
Figure 1 for Visually Interpretable Subtask Reasoning for Visual Question Answering
Figure 2 for Visually Interpretable Subtask Reasoning for Visual Question Answering
Figure 3 for Visually Interpretable Subtask Reasoning for Visual Question Answering
Figure 4 for Visually Interpretable Subtask Reasoning for Visual Question Answering
Viaarxiv icon

Emergent Multi-View Fidelity in Autonomous UAV Swarm Sport Injury Detection

Add code
May 10, 2025
Viaarxiv icon

TileLang: A Composable Tiled Programming Model for AI Systems

Add code
Apr 24, 2025
Viaarxiv icon

Text-to-Decision Agent: Learning Generalist Policies from Natural Language Supervision

Add code
Apr 22, 2025
Figure 1 for Text-to-Decision Agent: Learning Generalist Policies from Natural Language Supervision
Figure 2 for Text-to-Decision Agent: Learning Generalist Policies from Natural Language Supervision
Figure 3 for Text-to-Decision Agent: Learning Generalist Policies from Natural Language Supervision
Figure 4 for Text-to-Decision Agent: Learning Generalist Policies from Natural Language Supervision
Viaarxiv icon

Learning to Reason under Off-Policy Guidance

Add code
Apr 22, 2025
Viaarxiv icon

A Deep Learning Framework for Sequence Mining with Bidirectional LSTM and Multi-Scale Attention

Add code
Apr 21, 2025
Viaarxiv icon