Picture for Meng Chu

Meng Chu

TraveLLaMA: Facilitating Multi-modal Large Language Models to Understand Urban Scenes and Provide Travel Assistance

Add code
Apr 23, 2025
Viaarxiv icon

Understanding Long Videos via LLM-Powered Entity Relation Graphs

Add code
Jan 27, 2025
Viaarxiv icon

IRIS: Interactive Responsive Intelligent Segmentation for 3D Affordance Analysis

Add code
Sep 17, 2024
Viaarxiv icon

Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with Spatially Relation Matching

Add code
Nov 21, 2023
Figure 1 for Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with Spatially Relation Matching
Figure 2 for Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with Spatially Relation Matching
Figure 3 for Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with Spatially Relation Matching
Figure 4 for Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with Spatially Relation Matching
Viaarxiv icon