Picture for Thanh Nguyen-Tang

Thanh Nguyen-Tang

Wicked Oddities: Selectively Poisoning for Effective Clean-Label Backdoor Attacks

Add code
Jul 16, 2024
Viaarxiv icon

Offline Multitask Representation Learning for Reinforcement Learning

Add code
Mar 18, 2024
Viaarxiv icon

On Sample-Efficient Offline Reinforcement Learning: Data Diversity, Posterior Sampling, and Beyond

Add code
Jan 06, 2024
Viaarxiv icon

SigFormer: Signature Transformers for Deep Hedging

Add code
Oct 20, 2023
Viaarxiv icon

A Cosine Similarity-based Method for Out-of-Distribution Detection

Add code
Jun 23, 2023
Viaarxiv icon

VIPeR: Provably Efficient Algorithm for Offline RL with Neural Function Approximation

Add code
Mar 04, 2023
Viaarxiv icon

On Instance-Dependent Bounds for Offline Reinforcement Learning with Linear Function Approximation

Add code
Nov 23, 2022
Viaarxiv icon

Two-Stage Neural Contextual Bandits for Personalised News Recommendation

Add code
Jun 26, 2022
Figure 1 for Two-Stage Neural Contextual Bandits for Personalised News Recommendation
Figure 2 for Two-Stage Neural Contextual Bandits for Personalised News Recommendation
Figure 3 for Two-Stage Neural Contextual Bandits for Personalised News Recommendation
Figure 4 for Two-Stage Neural Contextual Bandits for Personalised News Recommendation
Viaarxiv icon

On Practical Reinforcement Learning: Provable Robustness, Scalability, and Statistical Efficiency

Add code
Mar 03, 2022
Figure 1 for On Practical Reinforcement Learning: Provable Robustness, Scalability, and Statistical Efficiency
Figure 2 for On Practical Reinforcement Learning: Provable Robustness, Scalability, and Statistical Efficiency
Figure 3 for On Practical Reinforcement Learning: Provable Robustness, Scalability, and Statistical Efficiency
Figure 4 for On Practical Reinforcement Learning: Provable Robustness, Scalability, and Statistical Efficiency
Viaarxiv icon

Offline Neural Contextual Bandits: Pessimism, Optimization and Generalization

Add code
Nov 27, 2021
Figure 1 for Offline Neural Contextual Bandits: Pessimism, Optimization and Generalization
Figure 2 for Offline Neural Contextual Bandits: Pessimism, Optimization and Generalization
Figure 3 for Offline Neural Contextual Bandits: Pessimism, Optimization and Generalization
Figure 4 for Offline Neural Contextual Bandits: Pessimism, Optimization and Generalization
Viaarxiv icon