Picture for Chao Xu

Chao Xu

School of Software, Tianjin University

Language Model Based Text-to-Audio Generation: Anti-Causally Aligned Collaborative Residual Transformers

Add code
Oct 06, 2025
Viaarxiv icon

MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

Add code
Sep 26, 2025
Viaarxiv icon

AniME: Adaptive Multi-Agent Planning for Long Animation Generation

Add code
Aug 27, 2025
Viaarxiv icon

TAR-TVG: Enhancing VLMs with Timestamp Anchor-Constrained Reasoning for Temporal Video Grounding

Add code
Aug 11, 2025
Viaarxiv icon

BokehDiff: Neural Lens Blur with One-Step Diffusion

Add code
Jul 24, 2025
Viaarxiv icon

Ground-Effect-Aware Modeling and Control for Multicopters

Add code
Jun 24, 2025
Viaarxiv icon

I$^2$S-TFCKD: Intra-Inter Set Knowledge Distillation with Time-Frequency Calibration for Speech Enhancement

Add code
Jun 16, 2025
Viaarxiv icon

Graph Neural Network Aided Detection for the Multi-User Multi-Dimensional Index Modulated Uplink

Add code
May 27, 2025
Viaarxiv icon

PSC: Extending Context Window of Large Language Models via Phase Shift Calibration

Add code
May 18, 2025
Viaarxiv icon

InterAnimate: Taming Region-aware Diffusion Model for Realistic Human Interaction Animation

Add code
Apr 15, 2025
Viaarxiv icon