Picture for Peng Gao

Peng Gao

University of Massachusetts Amherst

Spatial Preference Rewarding for MLLMs Spatial Understanding

Add code
Oct 16, 2025
Viaarxiv icon

ACPO: Adaptive Curriculum Policy Optimization for Aligning Vision-Language Models in Complex Reasoning

Add code
Oct 01, 2025
Figure 1 for ACPO: Adaptive Curriculum Policy Optimization for Aligning Vision-Language Models in Complex Reasoning
Figure 2 for ACPO: Adaptive Curriculum Policy Optimization for Aligning Vision-Language Models in Complex Reasoning
Figure 3 for ACPO: Adaptive Curriculum Policy Optimization for Aligning Vision-Language Models in Complex Reasoning
Figure 4 for ACPO: Adaptive Curriculum Policy Optimization for Aligning Vision-Language Models in Complex Reasoning
Viaarxiv icon

Lumina-mGPT 2.0: Stand-Alone AutoRegressive Image Modeling

Add code
Jul 23, 2025
Viaarxiv icon

Resurrect Mask AutoRegressive Modeling for Efficient and Scalable Image Generation

Add code
Jul 17, 2025
Viaarxiv icon

Non-Overlap-Aware Egocentric Pose Estimation for Collaborative Perception in Connected Autonomy

Add code
Jun 17, 2025
Viaarxiv icon

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Add code
Jun 16, 2025
Viaarxiv icon

Distinctive Feature Codec: Adaptive Segmentation for Efficient Speech Representation

Add code
May 24, 2025
Viaarxiv icon

Visionary-R1: Mitigating Shortcuts in Visual Reasoning with Reinforcement Learning

Add code
May 20, 2025
Viaarxiv icon

Towards Adaptive Meta-Gradient Adversarial Examples for Visual Tracking

Add code
May 13, 2025
Viaarxiv icon

Adaptive Markup Language Generation for Contextually-Grounded Visual Document Understanding

Add code
May 08, 2025
Figure 1 for Adaptive Markup Language Generation for Contextually-Grounded Visual Document Understanding
Figure 2 for Adaptive Markup Language Generation for Contextually-Grounded Visual Document Understanding
Figure 3 for Adaptive Markup Language Generation for Contextually-Grounded Visual Document Understanding
Figure 4 for Adaptive Markup Language Generation for Contextually-Grounded Visual Document Understanding
Viaarxiv icon