Picture for Xiaoyang Tan

Xiaoyang Tan

RoGA: Towards Generalizable Deepfake Detection through Robust Gradient Alignment

Add code
May 27, 2025
Viaarxiv icon

Contrastive Desensitization Learning for Cross Domain Face Forgery Detection

Add code
May 27, 2025
Viaarxiv icon

Variational OOD State Correction for Offline Reinforcement Learning

Add code
May 01, 2025
Viaarxiv icon

Beyond Non-Expert Demonstrations: Outcome-Driven Action Constraint for Offline Reinforcement Learning

Add code
Apr 03, 2025
Viaarxiv icon

Transductive Off-policy Proximal Policy Optimization

Add code
Jun 06, 2024
Figure 1 for Transductive Off-policy Proximal Policy Optimization
Figure 2 for Transductive Off-policy Proximal Policy Optimization
Figure 3 for Transductive Off-policy Proximal Policy Optimization
Figure 4 for Transductive Off-policy Proximal Policy Optimization
Viaarxiv icon

Highway Reinforcement Learning

Add code
May 28, 2024
Figure 1 for Highway Reinforcement Learning
Figure 2 for Highway Reinforcement Learning
Figure 3 for Highway Reinforcement Learning
Figure 4 for Highway Reinforcement Learning
Viaarxiv icon

HiQA: A Hierarchical Contextual Augmentation RAG for Massive Documents QA

Add code
Feb 01, 2024
Viaarxiv icon

ProxyFormer: Proxy Alignment Assisted Point Cloud Completion with Missing Part Sensitive Transformer

Add code
Feb 28, 2023
Viaarxiv icon

Contextual Conservative Q-Learning for Offline Reinforcement Learning

Add code
Jan 16, 2023
Viaarxiv icon

Smoothing Advantage Learning

Add code
Mar 20, 2022
Figure 1 for Smoothing Advantage Learning
Figure 2 for Smoothing Advantage Learning
Figure 3 for Smoothing Advantage Learning
Figure 4 for Smoothing Advantage Learning
Viaarxiv icon