Picture for Xin Li

Xin Li

College of Business, City University of Hong Kong, Hong Kong, China

AMAP Agentic Planning Technical Report

Add code
Dec 31, 2025
Viaarxiv icon

AHA: Aligning Large Audio-Language Models for Reasoning Hallucinations via Counterfactual Hard Negatives

Add code
Dec 30, 2025
Viaarxiv icon

ManchuTTS: Towards High-Quality Manchu Speech Synthesis via Flow Matching and Hierarchical Text Representation

Add code
Dec 27, 2025
Viaarxiv icon

TravelBench: A Real-World Benchmark for Multi-Turn and Tool-Augmented Travel Planning

Add code
Dec 27, 2025
Viaarxiv icon

Asynchronous Fast-Slow Vision-Language-Action Policies for Whole-Body Robotic Manipulation

Add code
Dec 23, 2025
Figure 1 for Asynchronous Fast-Slow Vision-Language-Action Policies for Whole-Body Robotic Manipulation
Figure 2 for Asynchronous Fast-Slow Vision-Language-Action Policies for Whole-Body Robotic Manipulation
Figure 3 for Asynchronous Fast-Slow Vision-Language-Action Policies for Whole-Body Robotic Manipulation
Figure 4 for Asynchronous Fast-Slow Vision-Language-Action Policies for Whole-Body Robotic Manipulation
Viaarxiv icon

The Geometry of Abstraction: Continual Learning via Recursive Quotienting

Add code
Dec 20, 2025
Viaarxiv icon

A Semantically Enhanced Generative Foundation Model Improves Pathological Image Synthesis

Add code
Dec 16, 2025
Viaarxiv icon

DirectSwap: Mask-Free Cross-Identity Training and Benchmarking for Expression-Consistent Video Head Swapping

Add code
Dec 10, 2025
Viaarxiv icon

UniSER: A Foundation Model for Unified Soft Effects Removal

Add code
Nov 18, 2025
Viaarxiv icon

FIA-Edit: Frequency-Interactive Attention for Efficient and High-Fidelity Inversion-Free Text-Guided Image Editing

Add code
Nov 15, 2025
Viaarxiv icon