Picture for Jun Zhu

Jun Zhu

Tsinghua University

Spatio-Temporal Side Tuning Pre-trained Foundation Models for Video-based Pedestrian Attribute Recognition

Add code
Apr 27, 2024
Figure 1 for Spatio-Temporal Side Tuning Pre-trained Foundation Models for Video-based Pedestrian Attribute Recognition
Figure 2 for Spatio-Temporal Side Tuning Pre-trained Foundation Models for Video-based Pedestrian Attribute Recognition
Figure 3 for Spatio-Temporal Side Tuning Pre-trained Foundation Models for Video-based Pedestrian Attribute Recognition
Figure 4 for Spatio-Temporal Side Tuning Pre-trained Foundation Models for Video-based Pedestrian Attribute Recognition
Viaarxiv icon

Exploring the Transferability of Visual Prompting for Multimodal Large Language Models

Add code
Apr 17, 2024
Figure 1 for Exploring the Transferability of Visual Prompting for Multimodal Large Language Models
Figure 2 for Exploring the Transferability of Visual Prompting for Multimodal Large Language Models
Figure 3 for Exploring the Transferability of Visual Prompting for Multimodal Large Language Models
Figure 4 for Exploring the Transferability of Visual Prompting for Multimodal Large Language Models
Viaarxiv icon

SparseDM: Toward Sparse Efficient Diffusion Models

Add code
Apr 16, 2024
Viaarxiv icon

Accelerating Transformer Pre-Training with 2:4 Sparsity

Add code
Apr 02, 2024
Viaarxiv icon

FlexiDreamer: Single Image-to-3D Generation with FlexiCubes

Add code
Apr 01, 2024
Viaarxiv icon

Embodied Active Defense: Leveraging Recurrent Feedback to Counter Adversarial Patches

Add code
Mar 31, 2024
Figure 1 for Embodied Active Defense: Leveraging Recurrent Feedback to Counter Adversarial Patches
Figure 2 for Embodied Active Defense: Leveraging Recurrent Feedback to Counter Adversarial Patches
Figure 3 for Embodied Active Defense: Leveraging Recurrent Feedback to Counter Adversarial Patches
Figure 4 for Embodied Active Defense: Leveraging Recurrent Feedback to Counter Adversarial Patches
Viaarxiv icon

DreamReward: Text-to-3D Generation with Human Preference

Add code
Mar 21, 2024
Figure 1 for DreamReward: Text-to-3D Generation with Human Preference
Figure 2 for DreamReward: Text-to-3D Generation with Human Preference
Figure 3 for DreamReward: Text-to-3D Generation with Human Preference
Figure 4 for DreamReward: Text-to-3D Generation with Human Preference
Viaarxiv icon

Jetfire: Efficient and Accurate Transformer Pretraining with INT8 Data Flow and Per-Block Quantization

Add code
Mar 19, 2024
Viaarxiv icon

CRM: Single Image to 3D Textured Mesh with Convolutional Reconstruction Model

Add code
Mar 08, 2024
Viaarxiv icon

DPOT: Auto-Regressive Denoising Operator Transformer for Large-Scale PDE Pre-Training

Add code
Mar 08, 2024
Figure 1 for DPOT: Auto-Regressive Denoising Operator Transformer for Large-Scale PDE Pre-Training
Figure 2 for DPOT: Auto-Regressive Denoising Operator Transformer for Large-Scale PDE Pre-Training
Figure 3 for DPOT: Auto-Regressive Denoising Operator Transformer for Large-Scale PDE Pre-Training
Figure 4 for DPOT: Auto-Regressive Denoising Operator Transformer for Large-Scale PDE Pre-Training
Viaarxiv icon