Picture for Tian Xu

Tian Xu

Cooperative Long Rope Skipping via Multi-Agent Reinforcement Learning

Add code
Jun 06, 2026
Viaarxiv icon

Adversarial Imitation Learning with General Function Approximation: Theoretical Analysis and Practical Algorithms

Add code
May 03, 2026
Viaarxiv icon

How Can Reinforcement Learning Achieve Expert-level Placement?

Add code
Apr 28, 2026
Viaarxiv icon

Non-Adversarial Imitation Learning Provably Free of Compounding Errors: The Role of Bellman Constraints

Add code
Mar 24, 2026
Viaarxiv icon

Off-Policy Value-Based Reinforcement Learning for Large Language Models

Add code
Mar 24, 2026
Viaarxiv icon

Provably and Practically Efficient Adversarial Imitation Learning with General Function Approximation

Add code
Nov 01, 2024
Figure 1 for Provably and Practically Efficient Adversarial Imitation Learning with General Function Approximation
Figure 2 for Provably and Practically Efficient Adversarial Imitation Learning with General Function Approximation
Figure 3 for Provably and Practically Efficient Adversarial Imitation Learning with General Function Approximation
Figure 4 for Provably and Practically Efficient Adversarial Imitation Learning with General Function Approximation
Viaarxiv icon

Collaborative motion planning for multi-manipulator systems through Reinforcement Learning and Dynamic Movement Primitives

Add code
Oct 01, 2024
Viaarxiv icon

Entropic Distribution Matching in Supervised Fine-tuning of LLMs: Less Overfitting and Better Diversity

Add code
Aug 29, 2024
Viaarxiv icon

AI-driven platform for systematic nomenclature and intelligent knowledge acquisition of natural medicinal materials

Add code
Dec 27, 2023
Figure 1 for AI-driven platform for systematic nomenclature and intelligent knowledge acquisition of natural medicinal materials
Figure 2 for AI-driven platform for systematic nomenclature and intelligent knowledge acquisition of natural medicinal materials
Figure 3 for AI-driven platform for systematic nomenclature and intelligent knowledge acquisition of natural medicinal materials
Figure 4 for AI-driven platform for systematic nomenclature and intelligent knowledge acquisition of natural medicinal materials
Viaarxiv icon

Policy Optimization in RLHF: The Impact of Out-of-preference Data

Add code
Dec 17, 2023
Viaarxiv icon