Picture for Yansong Tang

Yansong Tang

FADE: Frequency-Aware Diffusion Model Factorization for Video Editing

Add code
Jun 06, 2025
Viaarxiv icon

VLA-RL: Towards Masterful and General Robotic Manipulation with Scalable Reinforcement Learning

Add code
May 24, 2025
Viaarxiv icon

UniVG-R1: Reasoning Guided Universal Visual Grounding with Reinforcement Learning

Add code
May 20, 2025
Viaarxiv icon

FlexiAct: Towards Flexible Action Control in Heterogeneous Scenarios

Add code
May 06, 2025
Viaarxiv icon

InstaRevive: One-Step Image Enhancement via Dynamic Score Matching

Add code
Apr 22, 2025
Viaarxiv icon

IteRPrimE: Zero-shot Referring Image Segmentation with Iterative Grad-CAM Refinement and Primary Word Emphasis

Add code
Mar 02, 2025
Viaarxiv icon

KV-Edit: Training-Free Image Editing for Precise Background Preservation

Add code
Feb 25, 2025
Viaarxiv icon

GaussianToken: An Effective Image Tokenizer with 2D Gaussian Splatting

Add code
Jan 26, 2025
Viaarxiv icon

Localization-Aware Multi-Scale Representation Learning for Repetitive Action Counting

Add code
Jan 13, 2025
Figure 1 for Localization-Aware Multi-Scale Representation Learning for Repetitive Action Counting
Figure 2 for Localization-Aware Multi-Scale Representation Learning for Repetitive Action Counting
Figure 3 for Localization-Aware Multi-Scale Representation Learning for Repetitive Action Counting
Figure 4 for Localization-Aware Multi-Scale Representation Learning for Repetitive Action Counting
Viaarxiv icon

AnyBimanual: Transferring Unimanual Policy for General Bimanual Manipulation

Add code
Dec 09, 2024
Viaarxiv icon