Picture for Minghan Li

Minghan Li

RFKG-CoT: Relation-Driven Adaptive Hop-count Selection and Few-Shot Path Guidance for Knowledge-Aware QA

Add code
Dec 17, 2025
Viaarxiv icon

SplitFlow: Flow Decomposition for Inversion-Free Text-to-Image Editing

Add code
Oct 29, 2025
Figure 1 for SplitFlow: Flow Decomposition for Inversion-Free Text-to-Image Editing
Figure 2 for SplitFlow: Flow Decomposition for Inversion-Free Text-to-Image Editing
Figure 3 for SplitFlow: Flow Decomposition for Inversion-Free Text-to-Image Editing
Figure 4 for SplitFlow: Flow Decomposition for Inversion-Free Text-to-Image Editing
Viaarxiv icon

TrackVLA++: Unleashing Reasoning and Memory Capabilities in VLA Models for Embodied Visual Tracking

Add code
Oct 08, 2025
Viaarxiv icon

Query Expansion in the Age of Pre-trained and Large Language Models: A Comprehensive Survey

Add code
Sep 09, 2025
Viaarxiv icon

A Survey of Long-Document Retrieval in the PLM and LLM Era

Add code
Sep 09, 2025
Figure 1 for A Survey of Long-Document Retrieval in the PLM and LLM Era
Figure 2 for A Survey of Long-Document Retrieval in the PLM and LLM Era
Figure 3 for A Survey of Long-Document Retrieval in the PLM and LLM Era
Figure 4 for A Survey of Long-Document Retrieval in the PLM and LLM Era
Viaarxiv icon

VideoITG: Multimodal Video Understanding with Instructed Temporal Grounding

Add code
Jul 17, 2025
Viaarxiv icon

TrackVLA: Embodied Visual Tracking in the Wild

Add code
May 29, 2025
Figure 1 for TrackVLA: Embodied Visual Tracking in the Wild
Figure 2 for TrackVLA: Embodied Visual Tracking in the Wild
Figure 3 for TrackVLA: Embodied Visual Tracking in the Wild
Figure 4 for TrackVLA: Embodied Visual Tracking in the Wild
Viaarxiv icon

HA-VLN: A Benchmark for Human-Aware Navigation in Discrete-Continuous Environments with Dynamic Multi-Human Interactions, Real-World Validation, and an Open Leaderboard

Add code
Mar 18, 2025
Figure 1 for HA-VLN: A Benchmark for Human-Aware Navigation in Discrete-Continuous Environments with Dynamic Multi-Human Interactions, Real-World Validation, and an Open Leaderboard
Figure 2 for HA-VLN: A Benchmark for Human-Aware Navigation in Discrete-Continuous Environments with Dynamic Multi-Human Interactions, Real-World Validation, and an Open Leaderboard
Figure 3 for HA-VLN: A Benchmark for Human-Aware Navigation in Discrete-Continuous Environments with Dynamic Multi-Human Interactions, Real-World Validation, and an Open Leaderboard
Figure 4 for HA-VLN: A Benchmark for Human-Aware Navigation in Discrete-Continuous Environments with Dynamic Multi-Human Interactions, Real-World Validation, and an Open Leaderboard
Viaarxiv icon

FiVE: A Fine-grained Video Editing Benchmark for Evaluating Emerging Diffusion and Rectified Flow Models

Add code
Mar 17, 2025
Viaarxiv icon

Enhanced Retrieval of Long Documents: Leveraging Fine-Grained Block Representations with Large Language Models

Add code
Jan 28, 2025
Figure 1 for Enhanced Retrieval of Long Documents: Leveraging Fine-Grained Block Representations with Large Language Models
Figure 2 for Enhanced Retrieval of Long Documents: Leveraging Fine-Grained Block Representations with Large Language Models
Figure 3 for Enhanced Retrieval of Long Documents: Leveraging Fine-Grained Block Representations with Large Language Models
Figure 4 for Enhanced Retrieval of Long Documents: Leveraging Fine-Grained Block Representations with Large Language Models
Viaarxiv icon