Picture for Peng Gao

Peng Gao

University of Massachusetts Amherst

Distinctive Feature Codec: Adaptive Segmentation for Efficient Speech Representation

Add code
May 24, 2025
Viaarxiv icon

Visionary-R1: Mitigating Shortcuts in Visual Reasoning with Reinforcement Learning

Add code
May 20, 2025
Viaarxiv icon

Towards Adaptive Meta-Gradient Adversarial Examples for Visual Tracking

Add code
May 13, 2025
Viaarxiv icon

Adaptive Markup Language Generation for Contextually-Grounded Visual Document Understanding

Add code
May 08, 2025
Viaarxiv icon

TrustGeoGen: Scalable and Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving

Add code
Apr 22, 2025
Viaarxiv icon

From Reflection to Perfection: Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning

Add code
Apr 22, 2025
Viaarxiv icon

VisualCloze: A Universal Image Generation Framework via Visual In-Context Learning

Add code
Apr 10, 2025
Viaarxiv icon

OmniCaptioner: One Captioner to Rule Them All

Add code
Apr 09, 2025
Viaarxiv icon

Lumina-OmniLV: A Unified Multimodal Framework for General Low-Level Vision

Add code
Apr 08, 2025
Viaarxiv icon

Localization and Tracking for Cooperative Users in Multi-RIS-assisted Systems: Theoretical Analysis and Principles of Interpretations

Add code
Apr 07, 2025
Viaarxiv icon