Picture for Ziqiao Ma

Ziqiao Ma

Michael Pokorny

Next-Embedding Prediction Makes Strong Vision Learners

Add code
Dec 23, 2025
Viaarxiv icon

DeliveryBench: Can Agents Earn Profit in Real World?

Add code
Dec 22, 2025
Viaarxiv icon

SimWorld-Robotics: Synthesizing Photorealistic and Dynamic Urban Environments for Multimodal Robot Navigation and Collaboration

Add code
Dec 10, 2025
Viaarxiv icon

From Behavioral Performance to Internal Competence: Interpreting Vision-Language Models with VLM-Lens

Add code
Oct 02, 2025
Viaarxiv icon

AimBot: A Simple Auxiliary Visual Cue to Enhance Spatial Awareness of Visuomotor Policies

Add code
Aug 11, 2025
Viaarxiv icon

4D-LRM: Large Space-Time Reconstruction Model From and To Any View at Any Time

Add code
Jun 23, 2025
Viaarxiv icon

Can Vision Language Models Infer Human Gaze Direction? A Controlled Study

Add code
Jun 04, 2025
Viaarxiv icon

Vision-Language Models Are Not Pragmatically Competent in Referring Expression Generation

Add code
Apr 22, 2025
Figure 1 for Vision-Language Models Are Not Pragmatically Competent in Referring Expression Generation
Figure 2 for Vision-Language Models Are Not Pragmatically Competent in Referring Expression Generation
Figure 3 for Vision-Language Models Are Not Pragmatically Competent in Referring Expression Generation
Figure 4 for Vision-Language Models Are Not Pragmatically Competent in Referring Expression Generation
Viaarxiv icon

VEGGIE: Instructional Editing and Reasoning of Video Concepts with Grounded Generation

Add code
Mar 19, 2025
Viaarxiv icon

Training Turn-by-Turn Verifiers for Dialogue Tutoring Agents: The Curious Case of LLMs as Your Coding Tutors

Add code
Feb 18, 2025
Figure 1 for Training Turn-by-Turn Verifiers for Dialogue Tutoring Agents: The Curious Case of LLMs as Your Coding Tutors
Figure 2 for Training Turn-by-Turn Verifiers for Dialogue Tutoring Agents: The Curious Case of LLMs as Your Coding Tutors
Figure 3 for Training Turn-by-Turn Verifiers for Dialogue Tutoring Agents: The Curious Case of LLMs as Your Coding Tutors
Figure 4 for Training Turn-by-Turn Verifiers for Dialogue Tutoring Agents: The Curious Case of LLMs as Your Coding Tutors
Viaarxiv icon