Picture for Tong Zhang

Tong Zhang

Nanjing University of Science and Technology, Nanjing, China

Nearly Minimax Optimal Offline Reinforcement Learning with Linear Function Approximation: Single-Agent MDP and Markov Game

Add code
May 31, 2022
Figure 1 for Nearly Minimax Optimal Offline Reinforcement Learning with Linear Function Approximation: Single-Agent MDP and Markov Game
Figure 2 for Nearly Minimax Optimal Offline Reinforcement Learning with Linear Function Approximation: Single-Agent MDP and Markov Game
Figure 3 for Nearly Minimax Optimal Offline Reinforcement Learning with Linear Function Approximation: Single-Agent MDP and Markov Game
Viaarxiv icon

MulT: An End-to-End Multitask Learning Transformer

Add code
May 17, 2022
Figure 1 for MulT: An End-to-End Multitask Learning Transformer
Figure 2 for MulT: An End-to-End Multitask Learning Transformer
Viaarxiv icon

Nearly Optimal Algorithms for Linear Contextual Bandits with Adversarial Corruptions

Add code
May 13, 2022
Figure 1 for Nearly Optimal Algorithms for Linear Contextual Bandits with Adversarial Corruptions
Viaarxiv icon

Leverage Your Local and Global Representations: A New Self-Supervised Learning Strategy

Add code
Apr 13, 2022
Figure 1 for Leverage Your Local and Global Representations: A New Self-Supervised Learning Strategy
Figure 2 for Leverage Your Local and Global Representations: A New Self-Supervised Learning Strategy
Figure 3 for Leverage Your Local and Global Representations: A New Self-Supervised Learning Strategy
Figure 4 for Leverage Your Local and Global Representations: A New Self-Supervised Learning Strategy
Viaarxiv icon

Deep Non-rigid Structure-from-Motion: A Sequence-to-Sequence Translation Perspective

Add code
Apr 10, 2022
Figure 1 for Deep Non-rigid Structure-from-Motion: A Sequence-to-Sequence Translation Perspective
Figure 2 for Deep Non-rigid Structure-from-Motion: A Sequence-to-Sequence Translation Perspective
Figure 3 for Deep Non-rigid Structure-from-Motion: A Sequence-to-Sequence Translation Perspective
Figure 4 for Deep Non-rigid Structure-from-Motion: A Sequence-to-Sequence Translation Perspective
Viaarxiv icon

Non-Linear Reinforcement Learning in Large Action Spaces: Structural Conditions and Sample-efficiency of Posterior Sampling

Add code
Mar 15, 2022
Figure 1 for Non-Linear Reinforcement Learning in Large Action Spaces: Structural Conditions and Sample-efficiency of Posterior Sampling
Viaarxiv icon

RC-MVSNet: Unsupervised Multi-View Stereo with Neural Rendering

Add code
Mar 14, 2022
Figure 1 for RC-MVSNet: Unsupervised Multi-View Stereo with Neural Rendering
Figure 2 for RC-MVSNet: Unsupervised Multi-View Stereo with Neural Rendering
Figure 3 for RC-MVSNet: Unsupervised Multi-View Stereo with Neural Rendering
Figure 4 for RC-MVSNet: Unsupervised Multi-View Stereo with Neural Rendering
Viaarxiv icon

Reconfigurable Intelligent Surface Assisted OFDM Relaying: Subcarrier Matching with Balanced SNR

Add code
Mar 03, 2022
Figure 1 for Reconfigurable Intelligent Surface Assisted OFDM Relaying: Subcarrier Matching with Balanced SNR
Figure 2 for Reconfigurable Intelligent Surface Assisted OFDM Relaying: Subcarrier Matching with Balanced SNR
Figure 3 for Reconfigurable Intelligent Surface Assisted OFDM Relaying: Subcarrier Matching with Balanced SNR
Figure 4 for Reconfigurable Intelligent Surface Assisted OFDM Relaying: Subcarrier Matching with Balanced SNR
Viaarxiv icon

Pessimistic Minimax Value Iteration: Provably Efficient Equilibrium Learning from Offline Datasets

Add code
Feb 15, 2022
Figure 1 for Pessimistic Minimax Value Iteration: Provably Efficient Equilibrium Learning from Offline Datasets
Viaarxiv icon

Achieving Minimax Rates in Pool-Based Batch Active Learning

Add code
Feb 11, 2022
Viaarxiv icon