Alert button
Picture for Vincent Liu

Vincent Liu

Alert button

Switching the Loss Reduces the Cost in Batch Reinforcement Learning

Add code
Bookmark button
Alert button
Mar 12, 2024
Alex Ayoub, Kaiwen Wang, Vincent Liu, Samuel Robertson, James McInerney, Dawen Liang, Nathan Kallus, Csaba Szepesvári

Figure 1 for Switching the Loss Reduces the Cost in Batch Reinforcement Learning
Figure 2 for Switching the Loss Reduces the Cost in Batch Reinforcement Learning
Figure 3 for Switching the Loss Reduces the Cost in Batch Reinforcement Learning
Viaarxiv icon

Under the Surface: Tracking the Artifactuality of LLM-Generated Data

Add code
Bookmark button
Alert button
Jan 30, 2024
Debarati Das, Karin De Langis, Anna Martin-Boyle, Jaehyung Kim, Minhwa Lee, Zae Myung Kim, Shirley Anugrah Hayati, Risako Owan, Bin Hu, Ritik Parkar, Ryan Koo, Jonginn Park, Aahan Tyagi, Libby Ferland, Sanjali Roy, Vincent Liu, Dongyeop Kang

Viaarxiv icon

When is Offline Policy Selection Sample Efficient for Reinforcement Learning?

Add code
Bookmark button
Alert button
Dec 04, 2023
Vincent Liu, Prabhat Nagarajan, Andrew Patterson, Martha White

Viaarxiv icon

Measuring and Mitigating Interference in Reinforcement Learning

Add code
Bookmark button
Alert button
Jul 10, 2023
Vincent Liu, Han Wang, Ruo Yu Tao, Khurram Javed, Adam White, Martha White

Figure 1 for Measuring and Mitigating Interference in Reinforcement Learning
Figure 2 for Measuring and Mitigating Interference in Reinforcement Learning
Figure 3 for Measuring and Mitigating Interference in Reinforcement Learning
Figure 4 for Measuring and Mitigating Interference in Reinforcement Learning
Viaarxiv icon

Asymptotically Unbiased Off-Policy Policy Evaluation when Reusing Old Data in Nonstationary Environments

Add code
Bookmark button
Alert button
Feb 23, 2023
Vincent Liu, Yash Chandak, Philip Thomas, Martha White

Figure 1 for Asymptotically Unbiased Off-Policy Policy Evaluation when Reusing Old Data in Nonstationary Environments
Figure 2 for Asymptotically Unbiased Off-Policy Policy Evaluation when Reusing Old Data in Nonstationary Environments
Figure 3 for Asymptotically Unbiased Off-Policy Policy Evaluation when Reusing Old Data in Nonstationary Environments
Figure 4 for Asymptotically Unbiased Off-Policy Policy Evaluation when Reusing Old Data in Nonstationary Environments
Viaarxiv icon

AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving

Add code
Bookmark button
Alert button
Feb 22, 2023
Zhuohan Li, Lianmin Zheng, Yinmin Zhong, Vincent Liu, Ying Sheng, Xin Jin, Yanping Huang, Zhifeng Chen, Hao Zhang, Joseph E. Gonzalez, Ion Stoica

Figure 1 for AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving
Figure 2 for AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving
Figure 3 for AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving
Figure 4 for AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving
Viaarxiv icon

No More Pesky Hyperparameters: Offline Hyperparameter Tuning for RL

Add code
Bookmark button
Alert button
May 18, 2022
Han Wang, Archit Sakhadeo, Adam White, James Bell, Vincent Liu, Xutong Zhao, Puer Liu, Tadashi Kozuno, Alona Fyshe, Martha White

Figure 1 for No More Pesky Hyperparameters: Offline Hyperparameter Tuning for RL
Figure 2 for No More Pesky Hyperparameters: Offline Hyperparameter Tuning for RL
Figure 3 for No More Pesky Hyperparameters: Offline Hyperparameter Tuning for RL
Figure 4 for No More Pesky Hyperparameters: Offline Hyperparameter Tuning for RL
Viaarxiv icon

Investigating the Properties of Neural Network Representations in Reinforcement Learning

Add code
Bookmark button
Alert button
Mar 30, 2022
Han Wang, Erfan Miahi, Martha White, Marlos C. Machado, Zaheer Abbas, Raksha Kumaraswamy, Vincent Liu, Adam White

Figure 1 for Investigating the Properties of Neural Network Representations in Reinforcement Learning
Figure 2 for Investigating the Properties of Neural Network Representations in Reinforcement Learning
Figure 3 for Investigating the Properties of Neural Network Representations in Reinforcement Learning
Figure 4 for Investigating the Properties of Neural Network Representations in Reinforcement Learning
Viaarxiv icon

DABS: A Domain-Agnostic Benchmark for Self-Supervised Learning

Add code
Bookmark button
Alert button
Nov 23, 2021
Alex Tamkin, Vincent Liu, Rongfei Lu, Daniel Fein, Colin Schultz, Noah Goodman

Figure 1 for DABS: A Domain-Agnostic Benchmark for Self-Supervised Learning
Figure 2 for DABS: A Domain-Agnostic Benchmark for Self-Supervised Learning
Figure 3 for DABS: A Domain-Agnostic Benchmark for Self-Supervised Learning
Figure 4 for DABS: A Domain-Agnostic Benchmark for Self-Supervised Learning
Viaarxiv icon