Picture for Litao Guo

Litao Guo

HKUST

A4-Agent: An Agentic Framework for Zero-Shot Affordance Reasoning

Add code
Dec 16, 2025
Viaarxiv icon

PresentCoach: Dual-Agent Presentation Coaching through Exemplars and Interactive Feedback

Add code
Nov 19, 2025
Viaarxiv icon

PhysToolBench: Benchmarking Physical Tool Understanding for MLLMs

Add code
Oct 10, 2025
Viaarxiv icon

Enhancing Vector Quantization with Distributional Matching: A Theoretical and Empirical Study

Add code
Jun 18, 2025
Viaarxiv icon

ComfyMind: Toward General-Purpose Generation via Tree-Based Planning and Reactive Feedback

Add code
May 23, 2025
Viaarxiv icon

Make-A-Storyboard: A General Framework for Storyboard with Disentangled and Merged Control

Add code
Dec 06, 2023
Figure 1 for Make-A-Storyboard: A General Framework for Storyboard with Disentangled and Merged Control
Figure 2 for Make-A-Storyboard: A General Framework for Storyboard with Disentangled and Merged Control
Figure 3 for Make-A-Storyboard: A General Framework for Storyboard with Disentangled and Merged Control
Figure 4 for Make-A-Storyboard: A General Framework for Storyboard with Disentangled and Merged Control
Viaarxiv icon

MotionZero:Exploiting Motion Priors for Zero-shot Text-to-Video Generation

Add code
Nov 28, 2023
Figure 1 for MotionZero:Exploiting Motion Priors for Zero-shot Text-to-Video Generation
Figure 2 for MotionZero:Exploiting Motion Priors for Zero-shot Text-to-Video Generation
Figure 3 for MotionZero:Exploiting Motion Priors for Zero-shot Text-to-Video Generation
Figure 4 for MotionZero:Exploiting Motion Priors for Zero-shot Text-to-Video Generation
Viaarxiv icon