Picture for Yiming Wu

Yiming Wu

On-Device Diffusion Transformer Policy for Efficient Robot Manipulation

Add code
Aug 01, 2025
Viaarxiv icon

Improving vision-language alignment with graph spiking hybrid Networks

Add code
Jan 31, 2025
Viaarxiv icon

MoTe: Learning Motion-Text Diffusion Model for Multiple Generation Tasks

Add code
Nov 29, 2024
Figure 1 for MoTe: Learning Motion-Text Diffusion Model for Multiple Generation Tasks
Figure 2 for MoTe: Learning Motion-Text Diffusion Model for Multiple Generation Tasks
Figure 3 for MoTe: Learning Motion-Text Diffusion Model for Multiple Generation Tasks
Figure 4 for MoTe: Learning Motion-Text Diffusion Model for Multiple Generation Tasks
Viaarxiv icon

Individual Content and Motion Dynamics Preserved Pruning for Video Diffusion Models

Add code
Nov 27, 2024
Figure 1 for Individual Content and Motion Dynamics Preserved Pruning for Video Diffusion Models
Figure 2 for Individual Content and Motion Dynamics Preserved Pruning for Video Diffusion Models
Figure 3 for Individual Content and Motion Dynamics Preserved Pruning for Video Diffusion Models
Figure 4 for Individual Content and Motion Dynamics Preserved Pruning for Video Diffusion Models
Viaarxiv icon

Towards Small Object Editing: A Benchmark Dataset and A Training-Free Approach

Add code
Nov 03, 2024
Figure 1 for Towards Small Object Editing: A Benchmark Dataset and A Training-Free Approach
Figure 2 for Towards Small Object Editing: A Benchmark Dataset and A Training-Free Approach
Figure 3 for Towards Small Object Editing: A Benchmark Dataset and A Training-Free Approach
Figure 4 for Towards Small Object Editing: A Benchmark Dataset and A Training-Free Approach
Viaarxiv icon

Self-distilled Dynamic Fusion Network for Language-based Fashion Retrieval

Add code
May 24, 2024
Figure 1 for Self-distilled Dynamic Fusion Network for Language-based Fashion Retrieval
Figure 2 for Self-distilled Dynamic Fusion Network for Language-based Fashion Retrieval
Figure 3 for Self-distilled Dynamic Fusion Network for Language-based Fashion Retrieval
Figure 4 for Self-distilled Dynamic Fusion Network for Language-based Fashion Retrieval
Viaarxiv icon

PoinTramba: A Hybrid Transformer-Mamba Framework for Point Cloud Analysis

Add code
May 24, 2024
Figure 1 for PoinTramba: A Hybrid Transformer-Mamba Framework for Point Cloud Analysis
Figure 2 for PoinTramba: A Hybrid Transformer-Mamba Framework for Point Cloud Analysis
Figure 3 for PoinTramba: A Hybrid Transformer-Mamba Framework for Point Cloud Analysis
Figure 4 for PoinTramba: A Hybrid Transformer-Mamba Framework for Point Cloud Analysis
Viaarxiv icon

SOEDiff: Efficient Distillation for Small Object Editing

Add code
May 15, 2024
Viaarxiv icon

Training-Free Unsupervised Prompt for Vision-Language Models

Add code
Apr 25, 2024
Figure 1 for Training-Free Unsupervised Prompt for Vision-Language Models
Figure 2 for Training-Free Unsupervised Prompt for Vision-Language Models
Figure 3 for Training-Free Unsupervised Prompt for Vision-Language Models
Figure 4 for Training-Free Unsupervised Prompt for Vision-Language Models
Viaarxiv icon

Progressive Target-Styled Feature Augmentation for Unsupervised Domain Adaptation on Point Clouds

Add code
Nov 27, 2023
Viaarxiv icon