Picture for Tian Xu

Tian Xu

Adversarial Imitation Learning with General Function Approximation: Theoretical Analysis and Practical Algorithms

Add code
May 03, 2026
Viaarxiv icon

How Can Reinforcement Learning Achieve Expert-level Placement?

Add code
Apr 28, 2026
Viaarxiv icon

Off-Policy Value-Based Reinforcement Learning for Large Language Models

Add code
Mar 24, 2026
Viaarxiv icon

Non-Adversarial Imitation Learning Provably Free of Compounding Errors: The Role of Bellman Constraints

Add code
Mar 24, 2026
Viaarxiv icon

Provably and Practically Efficient Adversarial Imitation Learning with General Function Approximation

Add code
Nov 01, 2024
Figure 1 for Provably and Practically Efficient Adversarial Imitation Learning with General Function Approximation
Figure 2 for Provably and Practically Efficient Adversarial Imitation Learning with General Function Approximation
Figure 3 for Provably and Practically Efficient Adversarial Imitation Learning with General Function Approximation
Figure 4 for Provably and Practically Efficient Adversarial Imitation Learning with General Function Approximation
Viaarxiv icon

Collaborative motion planning for multi-manipulator systems through Reinforcement Learning and Dynamic Movement Primitives

Add code
Oct 01, 2024
Viaarxiv icon

Entropic Distribution Matching in Supervised Fine-tuning of LLMs: Less Overfitting and Better Diversity

Add code
Aug 29, 2024
Viaarxiv icon

AI-driven platform for systematic nomenclature and intelligent knowledge acquisition of natural medicinal materials

Add code
Dec 27, 2023
Figure 1 for AI-driven platform for systematic nomenclature and intelligent knowledge acquisition of natural medicinal materials
Figure 2 for AI-driven platform for systematic nomenclature and intelligent knowledge acquisition of natural medicinal materials
Figure 3 for AI-driven platform for systematic nomenclature and intelligent knowledge acquisition of natural medicinal materials
Figure 4 for AI-driven platform for systematic nomenclature and intelligent knowledge acquisition of natural medicinal materials
Viaarxiv icon

Policy Optimization in RLHF: The Impact of Out-of-preference Data

Add code
Dec 17, 2023
Viaarxiv icon

ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models

Add code
Oct 17, 2023
Figure 1 for ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models
Figure 2 for ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models
Figure 3 for ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models
Figure 4 for ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models
Viaarxiv icon