Picture for Shanghang Zhang

Shanghang Zhang

Intuition-aware Mixture-of-Rank-1-Experts for Parameter Efficient Finetuning

Add code
Apr 13, 2024
Figure 1 for Intuition-aware Mixture-of-Rank-1-Experts for Parameter Efficient Finetuning
Figure 2 for Intuition-aware Mixture-of-Rank-1-Experts for Parameter Efficient Finetuning
Figure 3 for Intuition-aware Mixture-of-Rank-1-Experts for Parameter Efficient Finetuning
Figure 4 for Intuition-aware Mixture-of-Rank-1-Experts for Parameter Efficient Finetuning
Viaarxiv icon

SpikeNVS: Enhancing Novel View Synthesis from Blurry Images via Spike Camera

Add code
Apr 12, 2024
Viaarxiv icon

Any2Point: Empowering Any-modality Large Models for Efficient 3D Understanding

Add code
Apr 11, 2024
Viaarxiv icon

Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want

Add code
Apr 01, 2024
Figure 1 for Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want
Figure 2 for Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want
Figure 3 for Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want
Figure 4 for Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want
Viaarxiv icon

Point-DETR3D: Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-supervised 3D Object Detection

Add code
Mar 25, 2024
Figure 1 for Point-DETR3D: Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-supervised 3D Object Detection
Figure 2 for Point-DETR3D: Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-supervised 3D Object Detection
Figure 3 for Point-DETR3D: Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-supervised 3D Object Detection
Figure 4 for Point-DETR3D: Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-supervised 3D Object Detection
Viaarxiv icon

DesignEdit: Multi-Layered Latent Decomposition and Fusion for Unified & Accurate Image Editing

Add code
Mar 21, 2024
Figure 1 for DesignEdit: Multi-Layered Latent Decomposition and Fusion for Unified & Accurate Image Editing
Figure 2 for DesignEdit: Multi-Layered Latent Decomposition and Fusion for Unified & Accurate Image Editing
Figure 3 for DesignEdit: Multi-Layered Latent Decomposition and Fusion for Unified & Accurate Image Editing
Figure 4 for DesignEdit: Multi-Layered Latent Decomposition and Fusion for Unified & Accurate Image Editing
Viaarxiv icon

DOZE: A Dataset for Open-Vocabulary Zero-Shot Object Navigation in Dynamic Environments

Add code
Feb 29, 2024
Viaarxiv icon

A Vanilla Multi-Task Framework for Dense Visual Prediction Solution to 1st VCL Challenge -- Multi-Task Robustness Track

Add code
Feb 27, 2024
Viaarxiv icon

Building Flexible Machine Learning Models for Scientific Computing at Scale

Add code
Feb 25, 2024
Viaarxiv icon

Proximity QA: Unleashing the Power of Multi-Modal Large Language Models for Spatial Proximity Analysis

Add code
Jan 31, 2024
Viaarxiv icon