Picture for Yansong Tang

Yansong Tang

Meta-CoT: Enhancing Granularity and Generalization in Image Editing

Add code
Apr 27, 2026
Viaarxiv icon

VARestorer: One-Step VAR Distillation for Real-World Image Super-Resolution

Add code
Apr 23, 2026
Viaarxiv icon

TAIHRI: Task-Aware 3D Human Keypoints Localization for Close-Range Human-Robot Interaction

Add code
Apr 10, 2026
Viaarxiv icon

BiDexGrasp: Coordinated Bimanual Dexterous Grasps across Object Geometries and Sizes

Add code
Apr 08, 2026
Viaarxiv icon

Embed-RL: Reinforcement Learning for Reasoning-Driven Multimodal Embeddings

Add code
Feb 14, 2026
Viaarxiv icon

ChatUMM: Robust Context Tracking for Conversational Interleaved Generation

Add code
Feb 06, 2026
Viaarxiv icon

CLAP: Contrastive Latent Action Pretraining for Learning Vision-Language-Action Models from Human Videos

Add code
Jan 07, 2026
Viaarxiv icon

DDAVS: Disentangled Audio Semantics and Delayed Bidirectional Alignment for Audio-Visual Segmentation

Add code
Dec 23, 2025
Viaarxiv icon

Memorize-and-Generate: Towards Long-Term Consistency in Real-Time Video Generation

Add code
Dec 23, 2025
Viaarxiv icon

AgentMath: Empowering Mathematical Reasoning for Large Language Models via Tool-Augmented Agent

Add code
Dec 23, 2025
Viaarxiv icon