Picture for Kevin Qinghong Lin

Kevin Qinghong Lin

CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents

Add code
Mar 25, 2026
Viaarxiv icon

Code2World: A GUI World Model via Renderable Code Generation

Add code
Feb 10, 2026
Viaarxiv icon

FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection

Add code
Jan 07, 2026
Viaarxiv icon

ShowUI-$π$: Flow-based Generative Models as GUI Dexterous Hands

Add code
Dec 31, 2025
Viaarxiv icon

Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans?

Add code
Dec 18, 2025
Viaarxiv icon

Computer-Use Agents as Judges for Generative User Interface

Add code
Nov 19, 2025
Viaarxiv icon

Grounding Computer Use Agents on Human Demonstrations

Add code
Nov 10, 2025
Figure 1 for Grounding Computer Use Agents on Human Demonstrations
Figure 2 for Grounding Computer Use Agents on Human Demonstrations
Figure 3 for Grounding Computer Use Agents on Human Demonstrations
Figure 4 for Grounding Computer Use Agents on Human Demonstrations
Viaarxiv icon

Paper2Video: Automatic Video Generation from Scientific Papers

Add code
Oct 06, 2025
Viaarxiv icon

Reinforcement Learning in Vision: A Survey

Add code
Aug 11, 2025
Viaarxiv icon

Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers

Add code
May 27, 2025
Viaarxiv icon