Picture for Mingkui Tan

Mingkui Tan

Nanyang Technological University

Dynamic Ensemble Reasoning for LLM Experts

Add code
Dec 10, 2024
Figure 1 for Dynamic Ensemble Reasoning for LLM Experts
Figure 2 for Dynamic Ensemble Reasoning for LLM Experts
Figure 3 for Dynamic Ensemble Reasoning for LLM Experts
Figure 4 for Dynamic Ensemble Reasoning for LLM Experts
Viaarxiv icon

Towards Long Video Understanding via Fine-detailed Video Story Generation

Add code
Dec 09, 2024
Figure 1 for Towards Long Video Understanding via Fine-detailed Video Story Generation
Figure 2 for Towards Long Video Understanding via Fine-detailed Video Story Generation
Figure 3 for Towards Long Video Understanding via Fine-detailed Video Story Generation
Figure 4 for Towards Long Video Understanding via Fine-detailed Video Story Generation
Viaarxiv icon

LSceneLLM: Enhancing Large 3D Scene Understanding Using Adaptive Visual Preferences

Add code
Dec 02, 2024
Figure 1 for LSceneLLM: Enhancing Large 3D Scene Understanding Using Adaptive Visual Preferences
Figure 2 for LSceneLLM: Enhancing Large 3D Scene Understanding Using Adaptive Visual Preferences
Figure 3 for LSceneLLM: Enhancing Large 3D Scene Understanding Using Adaptive Visual Preferences
Figure 4 for LSceneLLM: Enhancing Large 3D Scene Understanding Using Adaptive Visual Preferences
Viaarxiv icon

Enhancing Perception Capabilities of Multimodal LLMs with Training-free Fusion

Add code
Dec 02, 2024
Figure 1 for Enhancing Perception Capabilities of Multimodal LLMs with Training-free Fusion
Figure 2 for Enhancing Perception Capabilities of Multimodal LLMs with Training-free Fusion
Figure 3 for Enhancing Perception Capabilities of Multimodal LLMs with Training-free Fusion
Figure 4 for Enhancing Perception Capabilities of Multimodal LLMs with Training-free Fusion
Viaarxiv icon

A Cross-Scene Benchmark for Open-World Drone Active Tracking

Add code
Dec 01, 2024
Viaarxiv icon

Robust 3D Semantic Occupancy Prediction with Calibration-free Spatial Transformation

Add code
Nov 19, 2024
Figure 1 for Robust 3D Semantic Occupancy Prediction with Calibration-free Spatial Transformation
Figure 2 for Robust 3D Semantic Occupancy Prediction with Calibration-free Spatial Transformation
Figure 3 for Robust 3D Semantic Occupancy Prediction with Calibration-free Spatial Transformation
Figure 4 for Robust 3D Semantic Occupancy Prediction with Calibration-free Spatial Transformation
Viaarxiv icon

Open-Nav: Exploring Zero-Shot Vision-and-Language Navigation in Continuous Environment with Open-Source LLMs

Add code
Sep 27, 2024
Figure 1 for Open-Nav: Exploring Zero-Shot Vision-and-Language Navigation in Continuous Environment with Open-Source LLMs
Figure 2 for Open-Nav: Exploring Zero-Shot Vision-and-Language Navigation in Continuous Environment with Open-Source LLMs
Figure 3 for Open-Nav: Exploring Zero-Shot Vision-and-Language Navigation in Continuous Environment with Open-Source LLMs
Figure 4 for Open-Nav: Exploring Zero-Shot Vision-and-Language Navigation in Continuous Environment with Open-Source LLMs
Viaarxiv icon

CoNav: A Benchmark for Human-Centered Collaborative Navigation

Add code
Jun 04, 2024
Viaarxiv icon

MAGIC: Map-Guided Few-Shot Audio-Visual Acoustics Modeling

Add code
May 22, 2024
Viaarxiv icon

G-NeRF: Geometry-enhanced Novel View Synthesis from Single-View Images

Add code
Apr 11, 2024
Viaarxiv icon