Picture for Ming Yang

Ming Yang

Active-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO

Add code
May 27, 2025
Viaarxiv icon

Automated Text-to-Table for Reasoning-Intensive Table QA: Pipeline Design and Benchmarking Insights

Add code
May 26, 2025
Viaarxiv icon

Weather-Magician: Reconstruction and Rendering Framework for 4D Weather Synthesis In Real Time

Add code
May 26, 2025
Viaarxiv icon

GMatch: Geometry-Constrained Feature Matching for RGB-D Object Pose Estimation

Add code
May 22, 2025
Viaarxiv icon

FedSaaS: Class-Consistency Federated Semantic Segmentation via Global Prototype Supervision and Local Adversarial Harmonization

Add code
May 14, 2025
Viaarxiv icon

JAEGER: Dual-Level Humanoid Whole-Body Controller

Add code
May 10, 2025
Viaarxiv icon

From Mapping to Composing: A Two-Stage Framework for Zero-shot Composed Image Retrieval

Add code
Apr 25, 2025
Viaarxiv icon

Single-loop Algorithms for Stochastic Non-convex Optimization with Weakly-Convex Constraints

Add code
Apr 21, 2025
Viaarxiv icon

Knowledge Rectification for Camouflaged Object Detection: Unlocking Insights from Low-Quality Data

Add code
Mar 28, 2025
Viaarxiv icon

Learning-based 3D Reconstruction in Autonomous Driving: A Comprehensive Survey

Add code
Mar 17, 2025
Viaarxiv icon