Picture for Shanghang Zhang

Shanghang Zhang

Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model

Add code
Dec 06, 2024
Figure 1 for Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model
Figure 2 for Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model
Figure 3 for Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model
Figure 4 for Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model
Viaarxiv icon

[CLS] Attention is All You Need for Training-Free Visual Token Pruning: Make VLM Inference Faster

Add code
Dec 02, 2024
Figure 1 for [CLS] Attention is All You Need for Training-Free Visual Token Pruning: Make VLM Inference Faster
Figure 2 for [CLS] Attention is All You Need for Training-Free Visual Token Pruning: Make VLM Inference Faster
Figure 3 for [CLS] Attention is All You Need for Training-Free Visual Token Pruning: Make VLM Inference Faster
Figure 4 for [CLS] Attention is All You Need for Training-Free Visual Token Pruning: Make VLM Inference Faster
Viaarxiv icon

Proactive Gradient Conflict Mitigation in Multi-Task Learning: A Sparse Training Perspective

Add code
Nov 27, 2024
Figure 1 for Proactive Gradient Conflict Mitigation in Multi-Task Learning: A Sparse Training Perspective
Figure 2 for Proactive Gradient Conflict Mitigation in Multi-Task Learning: A Sparse Training Perspective
Figure 3 for Proactive Gradient Conflict Mitigation in Multi-Task Learning: A Sparse Training Perspective
Figure 4 for Proactive Gradient Conflict Mitigation in Multi-Task Learning: A Sparse Training Perspective
Viaarxiv icon

Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation

Add code
Nov 27, 2024
Figure 1 for Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation
Figure 2 for Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation
Figure 3 for Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation
Figure 4 for Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation
Viaarxiv icon

EMD: Explicit Motion Modeling for High-Quality Street Gaussian Splatting

Add code
Nov 23, 2024
Figure 1 for EMD: Explicit Motion Modeling for High-Quality Street Gaussian Splatting
Figure 2 for EMD: Explicit Motion Modeling for High-Quality Street Gaussian Splatting
Figure 3 for EMD: Explicit Motion Modeling for High-Quality Street Gaussian Splatting
Figure 4 for EMD: Explicit Motion Modeling for High-Quality Street Gaussian Splatting
Viaarxiv icon

MC-LLaVA: Multi-Concept Personalized Vision-Language Model

Add code
Nov 18, 2024
Figure 1 for MC-LLaVA: Multi-Concept Personalized Vision-Language Model
Figure 2 for MC-LLaVA: Multi-Concept Personalized Vision-Language Model
Figure 3 for MC-LLaVA: Multi-Concept Personalized Vision-Language Model
Figure 4 for MC-LLaVA: Multi-Concept Personalized Vision-Language Model
Viaarxiv icon

Learning from Different Samples: A Source-free Framework for Semi-supervised Domain Adaptation

Add code
Nov 11, 2024
Viaarxiv icon

Training-free Regional Prompting for Diffusion Transformers

Add code
Nov 04, 2024
Viaarxiv icon

Towards Unifying Understanding and Generation in the Era of Vision Foundation Models: A Survey from the Autoregression Perspective

Add code
Oct 29, 2024
Figure 1 for Towards Unifying Understanding and Generation in the Era of Vision Foundation Models: A Survey from the Autoregression Perspective
Figure 2 for Towards Unifying Understanding and Generation in the Era of Vision Foundation Models: A Survey from the Autoregression Perspective
Figure 3 for Towards Unifying Understanding and Generation in the Era of Vision Foundation Models: A Survey from the Autoregression Perspective
Viaarxiv icon

Subgraph Aggregation for Out-of-Distribution Generalization on Graphs

Add code
Oct 29, 2024
Figure 1 for Subgraph Aggregation for Out-of-Distribution Generalization on Graphs
Figure 2 for Subgraph Aggregation for Out-of-Distribution Generalization on Graphs
Figure 3 for Subgraph Aggregation for Out-of-Distribution Generalization on Graphs
Figure 4 for Subgraph Aggregation for Out-of-Distribution Generalization on Graphs
Viaarxiv icon