Picture for Kevin Qinghong Lin

Kevin Qinghong Lin

3D-CoS: A New 3D Reconstruction Paradigm Based on VLM Code Synthesis

Add code
Jun 09, 2026
Viaarxiv icon

Data Journalist Agent: Transforming Data into Verifiable Multimodal Stories

Add code
Jun 09, 2026
Viaarxiv icon

Dream.exe: Can Video Generation Models Dream Executable Robot Manipulation?

Add code
Jun 04, 2026
Viaarxiv icon

Agents' Last Exam

Add code
Jun 03, 2026
Viaarxiv icon

Demo2Tutorial: From Human Experience to Multimodal Software Tutorials

Add code
Jun 02, 2026
Viaarxiv icon

Residual Decoder Adapter: ID-Preserving Tokenizer Adaption for Autoregressive Text Rendering

Add code
Jun 01, 2026
Viaarxiv icon

SceneCode: Executable World Programs for Editable Indoor Scenes with Articulated Objects

Add code
May 19, 2026
Viaarxiv icon

AI for Auto-Research: Roadmap & User Guide

Add code
May 18, 2026
Viaarxiv icon

Checkup2Action: A Multimodal Clinical Check-up Report Dataset for Patient-Oriented Action Card Generation

Add code
May 13, 2026
Viaarxiv icon

Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond

Add code
Apr 24, 2026
Viaarxiv icon