Picture for Heng Tao Shen

Heng Tao Shen

Truth in the Few: High-Value Data Selection for Efficient Multi-Modal Reasoning

Add code
Jun 05, 2025
Viaarxiv icon

InSpire: Vision-Language-Action Models with Intrinsic Spatial Reasoning

Add code
May 20, 2025
Viaarxiv icon

Policy Contrastive Decoding for Robotic Foundation Models

Add code
May 19, 2025
Viaarxiv icon

Towards Generalized and Training-Free Text-Guided Semantic Manipulation

Add code
Apr 24, 2025
Viaarxiv icon

Syzygy of Thoughts: Improving LLM CoT with the Minimal Free Resolution

Add code
Apr 16, 2025
Viaarxiv icon

Exploring Kernel Transformations for Implicit Neural Representations

Add code
Apr 07, 2025
Viaarxiv icon

Attention Hijackers: Detect and Disentangle Attention Hijacking in LVLMs for Hallucination Mitigation

Add code
Mar 11, 2025
Viaarxiv icon

New Dataset and Methods for Fine-Grained Compositional Referring Expression Comprehension via Specialist-MLLM Collaboration

Add code
Feb 28, 2025
Viaarxiv icon

PSCon: Toward Conversational Product Search

Add code
Feb 19, 2025
Viaarxiv icon

Skip Tuning: Pre-trained Vision-Language Models are Effective and Efficient Adapters Themselves

Add code
Dec 16, 2024
Viaarxiv icon