Picture for Xin Liu

Xin Liu

The Hong Kong University of Science and Technology

Videos are Sample-Efficient Supervisions: Behavior Cloning from Videos via Latent Representations

Add code
Dec 25, 2025
Viaarxiv icon

Seedance 1.5 pro: A Native Audio-Visual Joint Generation Foundation Model

Add code
Dec 23, 2025
Viaarxiv icon

Step-GUI Technical Report

Add code
Dec 19, 2025
Viaarxiv icon

Towards a Science of Scaling Agent Systems

Add code
Dec 17, 2025
Viaarxiv icon

Optimizing the Adversarial Perturbation with a Momentum-based Adaptive Matrix

Add code
Dec 16, 2025
Viaarxiv icon

HyperEdit: Unlocking Instruction-based Text Editing in LLMs via Hypernetworks

Add code
Dec 14, 2025
Viaarxiv icon

WebCoach: Self-Evolving Web Agents with Cross-Session Memory Guidance

Add code
Nov 17, 2025
Viaarxiv icon

Virtual Width Networks

Add code
Nov 17, 2025
Figure 1 for Virtual Width Networks
Figure 2 for Virtual Width Networks
Figure 3 for Virtual Width Networks
Figure 4 for Virtual Width Networks
Viaarxiv icon

C$^3$TG: Conflict-aware, Composite, and Collaborative Controlled Text Generation

Add code
Nov 16, 2025
Viaarxiv icon

Multi-agent Self-triage System with Medical Flowcharts

Add code
Nov 16, 2025
Viaarxiv icon