Picture for Yu-Wing Tai

Yu-Wing Tai

Tencent

SmartAvatar: Text- and Image-Guided Human Avatar Generation with VLM AI Agents

Add code
Jun 05, 2025
Viaarxiv icon

Agentic 3D Scene Generation with Spatially Contextualized VLMs

Add code
May 26, 2025
Viaarxiv icon

MA-RAG: Multi-Agent Retrieval-Augmented Generation via Collaborative Chain-of-Thought Reasoning

Add code
May 26, 2025
Viaarxiv icon

ThinkVideo: High-Quality Reasoning Video Segmentation with Chain of Thoughts

Add code
May 24, 2025
Viaarxiv icon

FusionSegReID: Advancing Person Re-Identification with Multimodal Retrieval and Precise Segmentation

Add code
Mar 27, 2025
Viaarxiv icon

Multimodal Generation of Animatable 3D Human Models with AvatarForge

Add code
Mar 11, 2025
Viaarxiv icon

Dynamic Path Navigation for Motion Agents with LLM Reasoning

Add code
Mar 10, 2025
Viaarxiv icon

Think Before You Segment: High-Quality Reasoning Segmentation with GPT Chain of Thoughts

Add code
Mar 10, 2025
Viaarxiv icon

ReelWave: A Multi-Agent Framework Toward Professional Movie Sound Generation

Add code
Mar 10, 2025
Viaarxiv icon

WorldCraft: Photo-Realistic 3D World Creation and Customization via LLM Agents

Add code
Feb 21, 2025
Viaarxiv icon