Picture for Xin Cheng

Xin Cheng

Fellow, IEEE

Unified Synthesis of Compositional Speech and Sound from Free-Form Text Prompts

Add code
May 27, 2026
Viaarxiv icon

Beyond Trajectory-Level Attribution: Graph-Based Credit Assignment for Agentic Reinforcement Learning

Add code
May 26, 2026
Viaarxiv icon

SyncDPO: Enhancing Temporal Synchronization in Video-Audio Joint Generation via Preference Learning

Add code
May 12, 2026
Viaarxiv icon

CUTEv2: Unified and Configurable Matrix Extension for Diverse CPU Architectures with Minimal Design Overhead

Add code
Apr 13, 2026
Viaarxiv icon

Hierarchy-of-Groups Policy Optimization for Long-Horizon Agentic Tasks

Add code
Feb 26, 2026
Viaarxiv icon

Online Causal Kalman Filtering for Stable and Effective Policy Optimization

Add code
Feb 11, 2026
Viaarxiv icon

Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models

Add code
Jan 12, 2026
Viaarxiv icon

AgentOCR: Reimagining Agent History via Optical Self-Compression

Add code
Jan 08, 2026
Viaarxiv icon

Reasoning via Video: The First Evaluation of Video Models' Reasoning Abilities through Maze-Solving Tasks

Add code
Nov 19, 2025
Figure 1 for Reasoning via Video: The First Evaluation of Video Models' Reasoning Abilities through Maze-Solving Tasks
Figure 2 for Reasoning via Video: The First Evaluation of Video Models' Reasoning Abilities through Maze-Solving Tasks
Figure 3 for Reasoning via Video: The First Evaluation of Video Models' Reasoning Abilities through Maze-Solving Tasks
Figure 4 for Reasoning via Video: The First Evaluation of Video Models' Reasoning Abilities through Maze-Solving Tasks
Viaarxiv icon

WildSpoof Challenge Evaluation Plan

Add code
Aug 23, 2025
Viaarxiv icon