Picture for Joyce Chai

Joyce Chai

4D-LRM: Large Space-Time Reconstruction Model From and To Any View at Any Time

Add code
Jun 23, 2025
Viaarxiv icon

Proactive Assistant Dialogue Generation from Streaming Egocentric Videos

Add code
Jun 06, 2025
Viaarxiv icon

Temporally-Grounded Language Generation: A Benchmark for Real-Time Vision-Language Models

Add code
May 16, 2025
Viaarxiv icon

Vision-Language Models Are Not Pragmatically Competent in Referring Expression Generation

Add code
Apr 22, 2025
Viaarxiv icon

VEGGIE: Instructional Editing and Reasoning of Video Concepts with Grounded Generation

Add code
Mar 19, 2025
Viaarxiv icon

Training Turn-by-Turn Verifiers for Dialogue Tutoring Agents: The Curious Case of LLMs as Your Coding Tutors

Add code
Feb 18, 2025
Viaarxiv icon

Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass

Add code
Jan 23, 2025
Viaarxiv icon

Explainable Procedural Mistake Detection

Add code
Dec 16, 2024
Viaarxiv icon

Teaching Embodied Reinforcement Learning Agents: Informativeness and Diversity of Language Use

Add code
Oct 31, 2024
Viaarxiv icon

Do Vision-Language Models Represent Space and How? Evaluating Spatial Frame of Reference Under Ambiguities

Add code
Oct 22, 2024
Viaarxiv icon