Picture for Joyce Chai

Joyce Chai

AimBot: A Simple Auxiliary Visual Cue to Enhance Spatial Awareness of Visuomotor Policies

Add code
Aug 11, 2025
Viaarxiv icon

4D-LRM: Large Space-Time Reconstruction Model From and To Any View at Any Time

Add code
Jun 23, 2025
Viaarxiv icon

Proactive Assistant Dialogue Generation from Streaming Egocentric Videos

Add code
Jun 06, 2025
Viaarxiv icon

Temporally-Grounded Language Generation: A Benchmark for Real-Time Vision-Language Models

Add code
May 16, 2025
Viaarxiv icon

Vision-Language Models Are Not Pragmatically Competent in Referring Expression Generation

Add code
Apr 22, 2025
Viaarxiv icon

VEGGIE: Instructional Editing and Reasoning of Video Concepts with Grounded Generation

Add code
Mar 19, 2025
Viaarxiv icon

Training Turn-by-Turn Verifiers for Dialogue Tutoring Agents: The Curious Case of LLMs as Your Coding Tutors

Add code
Feb 18, 2025
Viaarxiv icon

Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass

Add code
Jan 23, 2025
Viaarxiv icon

Explainable Procedural Mistake Detection

Add code
Dec 16, 2024
Viaarxiv icon

Teaching Embodied Reinforcement Learning Agents: Informativeness and Diversity of Language Use

Add code
Oct 31, 2024
Viaarxiv icon