Picture for Mubarak Shah

Mubarak Shah

Intent3D: 3D Object Detection in RGB-D Scans Based on Human Intention

Add code
May 28, 2024
Figure 1 for Intent3D: 3D Object Detection in RGB-D Scans Based on Human Intention
Figure 2 for Intent3D: 3D Object Detection in RGB-D Scans Based on Human Intention
Figure 3 for Intent3D: 3D Object Detection in RGB-D Scans Based on Human Intention
Figure 4 for Intent3D: 3D Object Detection in RGB-D Scans Based on Human Intention
Viaarxiv icon

PTQ4DiT: Post-training Quantization for Diffusion Transformers

Add code
May 25, 2024
Figure 1 for PTQ4DiT: Post-training Quantization for Diffusion Transformers
Figure 2 for PTQ4DiT: Post-training Quantization for Diffusion Transformers
Figure 3 for PTQ4DiT: Post-training Quantization for Diffusion Transformers
Figure 4 for PTQ4DiT: Post-training Quantization for Diffusion Transformers
Viaarxiv icon

Text-guided 3D Human Motion Generation with Keyframe-based Parallel Skip Transformer

Add code
May 24, 2024
Figure 1 for Text-guided 3D Human Motion Generation with Keyframe-based Parallel Skip Transformer
Figure 2 for Text-guided 3D Human Motion Generation with Keyframe-based Parallel Skip Transformer
Figure 3 for Text-guided 3D Human Motion Generation with Keyframe-based Parallel Skip Transformer
Figure 4 for Text-guided 3D Human Motion Generation with Keyframe-based Parallel Skip Transformer
Viaarxiv icon

Curriculum Direct Preference Optimization for Diffusion and Consistency Models

Add code
May 22, 2024
Figure 1 for Curriculum Direct Preference Optimization for Diffusion and Consistency Models
Figure 2 for Curriculum Direct Preference Optimization for Diffusion and Consistency Models
Figure 3 for Curriculum Direct Preference Optimization for Diffusion and Consistency Models
Figure 4 for Curriculum Direct Preference Optimization for Diffusion and Consistency Models
Viaarxiv icon

SoccerNet-Echoes: A Soccer Game Audio Commentary Dataset

Add code
May 12, 2024
Figure 1 for SoccerNet-Echoes: A Soccer Game Audio Commentary Dataset
Figure 2 for SoccerNet-Echoes: A Soccer Game Audio Commentary Dataset
Figure 3 for SoccerNet-Echoes: A Soccer Game Audio Commentary Dataset
Figure 4 for SoccerNet-Echoes: A Soccer Game Audio Commentary Dataset
Viaarxiv icon

Sparse Points to Dense Clouds: Enhancing 3D Detection with Limited LiDAR Data

Add code
Apr 10, 2024
Viaarxiv icon

Diffexplainer: Towards Cross-modal Global Explanations with Diffusion Models

Add code
Apr 03, 2024
Figure 1 for Diffexplainer: Towards Cross-modal Global Explanations with Diffusion Models
Figure 2 for Diffexplainer: Towards Cross-modal Global Explanations with Diffusion Models
Figure 3 for Diffexplainer: Towards Cross-modal Global Explanations with Diffusion Models
Figure 4 for Diffexplainer: Towards Cross-modal Global Explanations with Diffusion Models
Viaarxiv icon

Towards Temporally Consistent Referring Video Object Segmentation

Add code
Mar 28, 2024
Figure 1 for Towards Temporally Consistent Referring Video Object Segmentation
Figure 2 for Towards Temporally Consistent Referring Video Object Segmentation
Figure 3 for Towards Temporally Consistent Referring Video Object Segmentation
Figure 4 for Towards Temporally Consistent Referring Video Object Segmentation
Viaarxiv icon

Composed Video Retrieval via Enriched Context and Discriminative Embeddings

Add code
Mar 25, 2024
Figure 1 for Composed Video Retrieval via Enriched Context and Discriminative Embeddings
Figure 2 for Composed Video Retrieval via Enriched Context and Discriminative Embeddings
Figure 3 for Composed Video Retrieval via Enriched Context and Discriminative Embeddings
Figure 4 for Composed Video Retrieval via Enriched Context and Discriminative Embeddings
Viaarxiv icon

AdaIR: Adaptive All-in-One Image Restoration via Frequency Mining and Modulation

Add code
Mar 21, 2024
Figure 1 for AdaIR: Adaptive All-in-One Image Restoration via Frequency Mining and Modulation
Figure 2 for AdaIR: Adaptive All-in-One Image Restoration via Frequency Mining and Modulation
Figure 3 for AdaIR: Adaptive All-in-One Image Restoration via Frequency Mining and Modulation
Figure 4 for AdaIR: Adaptive All-in-One Image Restoration via Frequency Mining and Modulation
Viaarxiv icon