Picture for Lanqing Hong

Lanqing Hong

CoSafe: Evaluating Large Language Model Safety in Multi-Turn Dialogue Coreference

Add code
Jun 25, 2024
Viaarxiv icon

MagicDrive3D: Controllable 3D Generation for Any-View Rendering in Street Scenes

Add code
May 23, 2024
Figure 1 for MagicDrive3D: Controllable 3D Generation for Any-View Rendering in Street Scenes
Figure 2 for MagicDrive3D: Controllable 3D Generation for Any-View Rendering in Street Scenes
Figure 3 for MagicDrive3D: Controllable 3D Generation for Any-View Rendering in Street Scenes
Figure 4 for MagicDrive3D: Controllable 3D Generation for Any-View Rendering in Street Scenes
Viaarxiv icon

Mixture of insighTful Experts : The Synergy of Thought Chains and Expert Mixtures in Self-Alignment

Add code
May 01, 2024
Viaarxiv icon

Automated Evaluation of Large Vision-Language Models on Self-driving Corner Cases

Add code
Apr 16, 2024
Figure 1 for Automated Evaluation of Large Vision-Language Models on Self-driving Corner Cases
Figure 2 for Automated Evaluation of Large Vision-Language Models on Self-driving Corner Cases
Figure 3 for Automated Evaluation of Large Vision-Language Models on Self-driving Corner Cases
Figure 4 for Automated Evaluation of Large Vision-Language Models on Self-driving Corner Cases
Viaarxiv icon

CVT-xRF: Contrastive In-Voxel Transformer for 3D Consistent Radiance Fields from Sparse Inputs

Add code
Mar 25, 2024
Figure 1 for CVT-xRF: Contrastive In-Voxel Transformer for 3D Consistent Radiance Fields from Sparse Inputs
Figure 2 for CVT-xRF: Contrastive In-Voxel Transformer for 3D Consistent Radiance Fields from Sparse Inputs
Figure 3 for CVT-xRF: Contrastive In-Voxel Transformer for 3D Consistent Radiance Fields from Sparse Inputs
Figure 4 for CVT-xRF: Contrastive In-Voxel Transformer for 3D Consistent Radiance Fields from Sparse Inputs
Viaarxiv icon

Eyes Closed, Safety On: Protecting Multimodal LLMs via Image-to-Text Transformation

Add code
Mar 22, 2024
Figure 1 for Eyes Closed, Safety On: Protecting Multimodal LLMs via Image-to-Text Transformation
Figure 2 for Eyes Closed, Safety On: Protecting Multimodal LLMs via Image-to-Text Transformation
Figure 3 for Eyes Closed, Safety On: Protecting Multimodal LLMs via Image-to-Text Transformation
Figure 4 for Eyes Closed, Safety On: Protecting Multimodal LLMs via Image-to-Text Transformation
Viaarxiv icon

DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception

Add code
Mar 20, 2024
Figure 1 for DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception
Figure 2 for DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception
Figure 3 for DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception
Figure 4 for DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception
Viaarxiv icon

Task-customized Masked AutoEncoder via Mixture of Cluster-conditional Experts

Add code
Feb 08, 2024
Viaarxiv icon

G-NAS: Generalizable Neural Architecture Search for Single Domain Generalization Object Detection

Add code
Feb 07, 2024
Figure 1 for G-NAS: Generalizable Neural Architecture Search for Single Domain Generalization Object Detection
Figure 2 for G-NAS: Generalizable Neural Architecture Search for Single Domain Generalization Object Detection
Figure 3 for G-NAS: Generalizable Neural Architecture Search for Single Domain Generalization Object Detection
Figure 4 for G-NAS: Generalizable Neural Architecture Search for Single Domain Generalization Object Detection
Viaarxiv icon

SERF: Fine-Grained Interactive 3D Segmentation and Editing with Radiance Fields

Add code
Dec 26, 2023
Viaarxiv icon