Picture for Yongqi Chen

Yongqi Chen

Faster Video Diffusion with Trainable Sparse Attention

Add code
May 19, 2025
Figure 1 for Faster Video Diffusion with Trainable Sparse Attention
Figure 2 for Faster Video Diffusion with Trainable Sparse Attention
Figure 3 for Faster Video Diffusion with Trainable Sparse Attention
Figure 4 for Faster Video Diffusion with Trainable Sparse Attention
Viaarxiv icon

Fast Video Generation with Sliding Tile Attention

Add code
Feb 06, 2025
Figure 1 for Fast Video Generation with Sliding Tile Attention
Figure 2 for Fast Video Generation with Sliding Tile Attention
Figure 3 for Fast Video Generation with Sliding Tile Attention
Figure 4 for Fast Video Generation with Sliding Tile Attention
Viaarxiv icon

Scalable Benchmarking and Robust Learning for Noise-Free Ego-Motion and 3D Reconstruction from Noisy Video

Add code
Jan 24, 2025
Viaarxiv icon

From Perfect to Noisy World Simulation: Customizable Embodied Multi-modal Perturbations for SLAM Robustness Benchmarking

Add code
Jun 24, 2024
Figure 1 for From Perfect to Noisy World Simulation: Customizable Embodied Multi-modal Perturbations for SLAM Robustness Benchmarking
Figure 2 for From Perfect to Noisy World Simulation: Customizable Embodied Multi-modal Perturbations for SLAM Robustness Benchmarking
Figure 3 for From Perfect to Noisy World Simulation: Customizable Embodied Multi-modal Perturbations for SLAM Robustness Benchmarking
Figure 4 for From Perfect to Noisy World Simulation: Customizable Embodied Multi-modal Perturbations for SLAM Robustness Benchmarking
Viaarxiv icon

Customizing Visual-Language Foundation Models for Multi-modal Anomaly Detection and Reasoning

Add code
Mar 17, 2024
Figure 1 for Customizing Visual-Language Foundation Models for Multi-modal Anomaly Detection and Reasoning
Figure 2 for Customizing Visual-Language Foundation Models for Multi-modal Anomaly Detection and Reasoning
Figure 3 for Customizing Visual-Language Foundation Models for Multi-modal Anomaly Detection and Reasoning
Figure 4 for Customizing Visual-Language Foundation Models for Multi-modal Anomaly Detection and Reasoning
Viaarxiv icon

Customizable Perturbation Synthesis for Robust SLAM Benchmarking

Add code
Feb 12, 2024
Viaarxiv icon