Picture for Lei Zhang

Lei Zhang

Sid

Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding

Add code
Jul 13, 2024
Figure 1 for Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding
Figure 2 for Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding
Figure 3 for Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding
Figure 4 for Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding
Viaarxiv icon

LAPT: Label-driven Automated Prompt Tuning for OOD Detection with Vision-Language Models

Add code
Jul 12, 2024
Viaarxiv icon

A Text-to-Game Engine for UGC-Based Role-Playing Games

Add code
Jul 11, 2024
Figure 1 for A Text-to-Game Engine for UGC-Based Role-Playing Games
Figure 2 for A Text-to-Game Engine for UGC-Based Role-Playing Games
Figure 3 for A Text-to-Game Engine for UGC-Based Role-Playing Games
Figure 4 for A Text-to-Game Engine for UGC-Based Role-Playing Games
Viaarxiv icon

MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis

Add code
Jul 11, 2024
Figure 1 for MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis
Figure 2 for MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis
Figure 3 for MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis
Figure 4 for MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis
Viaarxiv icon

Urban Waterlogging Detection: A Challenging Benchmark and Large-Small Model Co-Adapter

Add code
Jul 11, 2024
Figure 1 for Urban Waterlogging Detection: A Challenging Benchmark and Large-Small Model Co-Adapter
Figure 2 for Urban Waterlogging Detection: A Challenging Benchmark and Large-Small Model Co-Adapter
Figure 3 for Urban Waterlogging Detection: A Challenging Benchmark and Large-Small Model Co-Adapter
Figure 4 for Urban Waterlogging Detection: A Challenging Benchmark and Large-Small Model Co-Adapter
Viaarxiv icon

AI-based Automatic Segmentation of Prostate on Multi-modality Images: A Review

Add code
Jul 09, 2024
Figure 1 for AI-based Automatic Segmentation of Prostate on Multi-modality Images: A Review
Figure 2 for AI-based Automatic Segmentation of Prostate on Multi-modality Images: A Review
Figure 3 for AI-based Automatic Segmentation of Prostate on Multi-modality Images: A Review
Figure 4 for AI-based Automatic Segmentation of Prostate on Multi-modality Images: A Review
Viaarxiv icon

EMBANet: A Flexible Efffcient Multi-branch Attention Network

Add code
Jul 07, 2024
Viaarxiv icon

ScaleDreamer: Scalable Text-to-3D Synthesis with Asynchronous Score Distillation

Add code
Jul 02, 2024
Viaarxiv icon

SymPoint Revolutionized: Boosting Panoptic Symbol Spotting with Layer Feature Enhancement

Add code
Jul 02, 2024
Viaarxiv icon

TokenPacker: Efficient Visual Projector for Multimodal LLM

Add code
Jul 02, 2024
Figure 1 for TokenPacker: Efficient Visual Projector for Multimodal LLM
Figure 2 for TokenPacker: Efficient Visual Projector for Multimodal LLM
Figure 3 for TokenPacker: Efficient Visual Projector for Multimodal LLM
Figure 4 for TokenPacker: Efficient Visual Projector for Multimodal LLM
Viaarxiv icon