Picture for Zeyuan Chen

Zeyuan Chen

CoAct-1: Computer-using Agents with Coding as Actions

Add code
Aug 05, 2025
Viaarxiv icon

YOLO-Count: Differentiable Object Counting for Text-to-Image Generation

Add code
Aug 01, 2025
Viaarxiv icon

DepR: Depth Guided Single-view Scene Reconstruction with Instance-level Diffusion

Add code
Jul 30, 2025
Viaarxiv icon

ClutterDexGrasp: A Sim-to-Real System for General Dexterous Grasping in Cluttered Scenes

Add code
Jun 17, 2025
Viaarxiv icon

Adaptive Visuo-Tactile Fusion with Predictive Force Attention for Dexterous Manipulation

Add code
May 20, 2025
Viaarxiv icon

X-Dancer: Expressive Music to Human Dance Video Generation

Add code
Feb 24, 2025
Viaarxiv icon

X-Dyna: Expressive Dynamic Human Image Animation

Add code
Jan 20, 2025
Viaarxiv icon

ProVision: Programmatically Scaling Vision-centric Instruction Data for Multimodal Language Models

Add code
Dec 09, 2024
Figure 1 for ProVision: Programmatically Scaling Vision-centric Instruction Data for Multimodal Language Models
Figure 2 for ProVision: Programmatically Scaling Vision-centric Instruction Data for Multimodal Language Models
Figure 3 for ProVision: Programmatically Scaling Vision-centric Instruction Data for Multimodal Language Models
Figure 4 for ProVision: Programmatically Scaling Vision-centric Instruction Data for Multimodal Language Models
Viaarxiv icon

CPRM: A LLM-based Continual Pre-training Framework for Relevance Modeling in Commercial Search

Add code
Dec 03, 2024
Figure 1 for CPRM: A LLM-based Continual Pre-training Framework for Relevance Modeling in Commercial Search
Figure 2 for CPRM: A LLM-based Continual Pre-training Framework for Relevance Modeling in Commercial Search
Figure 3 for CPRM: A LLM-based Continual Pre-training Framework for Relevance Modeling in Commercial Search
Figure 4 for CPRM: A LLM-based Continual Pre-training Framework for Relevance Modeling in Commercial Search
Viaarxiv icon

xLAM: A Family of Large Action Models to Empower AI Agent Systems

Add code
Sep 05, 2024
Figure 1 for xLAM: A Family of Large Action Models to Empower AI Agent Systems
Figure 2 for xLAM: A Family of Large Action Models to Empower AI Agent Systems
Figure 3 for xLAM: A Family of Large Action Models to Empower AI Agent Systems
Figure 4 for xLAM: A Family of Large Action Models to Empower AI Agent Systems
Viaarxiv icon