Picture for Shuhang Xu

Shuhang Xu

VLM Can Be a Good Assistant: Enhancing Embodied Visual Tracking with Self-Improving Vision-Language Models

Add code
May 28, 2025
Viaarxiv icon

PartInstruct: Part-level Instruction Following for Fine-grained Robot Manipulation

Add code
May 27, 2025
Viaarxiv icon

CoMet: Metaphor-Driven Covert Communication for Multi-Agent Language Games

Add code
May 23, 2025
Viaarxiv icon

Probe by Gaming: A Game-based Benchmark for Assessing Conceptual Knowledge in LLMs

Add code
May 23, 2025
Viaarxiv icon