Picture for Yunhai Tong

Yunhai Tong

You Can't Ignore Either: Unifying Structure and Feature Denoising for Robust Graph Learning

Add code
Aug 01, 2024
Viaarxiv icon

LLAVADI: What Matters For Multimodal Large Language Models Distillation

Add code
Jul 28, 2024
Viaarxiv icon

MotionBooth: Motion-Aware Customized Text-to-Video Generation

Add code
Jun 25, 2024
Viaarxiv icon

SEFraud: Graph-based Self-Explainable Fraud Detection via Interpretative Mask Learning

Add code
Jun 17, 2024
Viaarxiv icon

SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow

Add code
May 30, 2024
Viaarxiv icon

VG4D: Vision-Language Model Goes 4D Video Recognition

Add code
Apr 17, 2024
Viaarxiv icon

Explore In-Context Segmentation via Latent Diffusion Models

Add code
Mar 14, 2024
Viaarxiv icon

Towards Language-Driven Video Inpainting via Multimodal Large Language Models

Add code
Jan 18, 2024
Figure 1 for Towards Language-Driven Video Inpainting via Multimodal Large Language Models
Figure 2 for Towards Language-Driven Video Inpainting via Multimodal Large Language Models
Figure 3 for Towards Language-Driven Video Inpainting via Multimodal Large Language Models
Figure 4 for Towards Language-Driven Video Inpainting via Multimodal Large Language Models
Viaarxiv icon

RAP-SAM: Towards Real-Time All-Purpose Segment Anything

Add code
Jan 18, 2024
Viaarxiv icon

DST-Det: Simple Dynamic Self-Training for Open-Vocabulary Object Detection

Add code
Oct 02, 2023
Viaarxiv icon