Picture for Yanbin Hao

Yanbin Hao

A New Multi-Domain Benchmark for Micro-Action Recognition and Detection

Add code
Jun 12, 2026
Viaarxiv icon

A Multi-Modal Framework with Cross-Subject Pseudo-Labeling and Semantic Alignment for Micro-Gesture Recognition

Add code
Jun 11, 2026
Viaarxiv icon

Motion Reinforces Appearance: RGB-Skeleton Gated Residual Fusion for Micro-Gesture Online Recognition

Add code
Jun 10, 2026
Viaarxiv icon

SoftCap: Soft-Budget Control for Diffusion Transformer Acceleration

Add code
May 26, 2026
Viaarxiv icon

Accelerating Controllable Generation via Hybrid-grained Cache

Add code
Nov 14, 2025
Viaarxiv icon

SeViCES: Unifying Semantic-Visual Evidence Consensus for Long Video Understanding

Add code
Oct 23, 2025
Viaarxiv icon

UniSVG: A Unified Dataset for Vector Graphic Understanding and Generation with Multimodal Large Language Models

Add code
Aug 11, 2025
Viaarxiv icon

SPEED: Scalable, Precise, and Efficient Concept Erasure for Diffusion Models

Add code
Mar 10, 2025
Viaarxiv icon

Accelerating Diffusion Transformer via Gradient-Optimized Cache

Add code
Mar 07, 2025
Figure 1 for Accelerating Diffusion Transformer via Gradient-Optimized Cache
Figure 2 for Accelerating Diffusion Transformer via Gradient-Optimized Cache
Figure 3 for Accelerating Diffusion Transformer via Gradient-Optimized Cache
Figure 4 for Accelerating Diffusion Transformer via Gradient-Optimized Cache
Viaarxiv icon

Accelerating Diffusion Transformer via Error-Optimized Cache

Add code
Jan 31, 2025
Figure 1 for Accelerating Diffusion Transformer via Error-Optimized Cache
Figure 2 for Accelerating Diffusion Transformer via Error-Optimized Cache
Figure 3 for Accelerating Diffusion Transformer via Error-Optimized Cache
Figure 4 for Accelerating Diffusion Transformer via Error-Optimized Cache
Viaarxiv icon