Alert button
Picture for Viraj Mehta

Viraj Mehta

Alert button

Sample Efficient Reinforcement Learning from Human Feedback via Active Exploration

Add code
Bookmark button
Alert button
Dec 01, 2023
Viraj Mehta, Vikramjeet Das, Ojash Neopane, Yijia Dai, Ilija Bogunovic, Jeff Schneider, Willie Neiswanger

Viaarxiv icon

Kernelized Offline Contextual Dueling Bandits

Add code
Bookmark button
Alert button
Jul 21, 2023
Viraj Mehta, Ojash Neopane, Vikramjeet Das, Sen Lin, Jeff Schneider, Willie Neiswanger

Figure 1 for Kernelized Offline Contextual Dueling Bandits
Figure 2 for Kernelized Offline Contextual Dueling Bandits
Figure 3 for Kernelized Offline Contextual Dueling Bandits
Viaarxiv icon

Near-optimal Policy Identification in Active Reinforcement Learning

Add code
Bookmark button
Alert button
Dec 19, 2022
Xiang Li, Viraj Mehta, Johannes Kirschner, Ian Char, Willie Neiswanger, Jeff Schneider, Andreas Krause, Ilija Bogunovic

Figure 1 for Near-optimal Policy Identification in Active Reinforcement Learning
Figure 2 for Near-optimal Policy Identification in Active Reinforcement Learning
Figure 3 for Near-optimal Policy Identification in Active Reinforcement Learning
Figure 4 for Near-optimal Policy Identification in Active Reinforcement Learning
Viaarxiv icon

Exploration via Planning for Information about the Optimal Trajectory

Add code
Bookmark button
Alert button
Oct 06, 2022
Viraj Mehta, Ian Char, Joseph Abbate, Rory Conlin, Mark D. Boyer, Stefano Ermon, Jeff Schneider, Willie Neiswanger

Figure 1 for Exploration via Planning for Information about the Optimal Trajectory
Figure 2 for Exploration via Planning for Information about the Optimal Trajectory
Figure 3 for Exploration via Planning for Information about the Optimal Trajectory
Figure 4 for Exploration via Planning for Information about the Optimal Trajectory
Viaarxiv icon

BATS: Best Action Trajectory Stitching

Add code
Bookmark button
Alert button
Apr 26, 2022
Ian Char, Viraj Mehta, Adam Villaflor, John M. Dolan, Jeff Schneider

Figure 1 for BATS: Best Action Trajectory Stitching
Figure 2 for BATS: Best Action Trajectory Stitching
Figure 3 for BATS: Best Action Trajectory Stitching
Figure 4 for BATS: Best Action Trajectory Stitching
Viaarxiv icon

Variational autoencoders in the presence of low-dimensional data: landscape and implicit bias

Add code
Bookmark button
Alert button
Dec 13, 2021
Frederic Koehler, Viraj Mehta, Andrej Risteski, Chenghui Zhou

Figure 1 for Variational autoencoders in the presence of low-dimensional data: landscape and implicit bias
Figure 2 for Variational autoencoders in the presence of low-dimensional data: landscape and implicit bias
Figure 3 for Variational autoencoders in the presence of low-dimensional data: landscape and implicit bias
Figure 4 for Variational autoencoders in the presence of low-dimensional data: landscape and implicit bias
Viaarxiv icon

An Experimental Design Perspective on Model-Based Reinforcement Learning

Add code
Bookmark button
Alert button
Dec 09, 2021
Viraj Mehta, Biswajit Paria, Jeff Schneider, Stefano Ermon, Willie Neiswanger

Figure 1 for An Experimental Design Perspective on Model-Based Reinforcement Learning
Figure 2 for An Experimental Design Perspective on Model-Based Reinforcement Learning
Figure 3 for An Experimental Design Perspective on Model-Based Reinforcement Learning
Figure 4 for An Experimental Design Perspective on Model-Based Reinforcement Learning
Viaarxiv icon

Representational aspects of depth and conditioning in normalizing flows

Add code
Bookmark button
Alert button
Oct 02, 2020
Frederic Koehler, Viraj Mehta, Andrej Risteski

Figure 1 for Representational aspects of depth and conditioning in normalizing flows
Figure 2 for Representational aspects of depth and conditioning in normalizing flows
Figure 3 for Representational aspects of depth and conditioning in normalizing flows
Figure 4 for Representational aspects of depth and conditioning in normalizing flows
Viaarxiv icon

Neural Dynamical Systems: Balancing Structure and Flexibility in Physical Prediction

Add code
Bookmark button
Alert button
Jun 23, 2020
Viraj Mehta, Ian Char, Willie Neiswanger, Youngseog Chung, Andrew Oakleigh Nelson, Mark D Boyer, Egemen Kolemen, Jeff Schneider

Figure 1 for Neural Dynamical Systems: Balancing Structure and Flexibility in Physical Prediction
Figure 2 for Neural Dynamical Systems: Balancing Structure and Flexibility in Physical Prediction
Figure 3 for Neural Dynamical Systems: Balancing Structure and Flexibility in Physical Prediction
Figure 4 for Neural Dynamical Systems: Balancing Structure and Flexibility in Physical Prediction
Viaarxiv icon

Learning Task-Oriented Grasping for Tool Manipulation from Simulated Self-Supervision

Add code
Bookmark button
Alert button
Jun 25, 2018
Kuan Fang, Yuke Zhu, Animesh Garg, Andrey Kurenkov, Viraj Mehta, Li Fei-Fei, Silvio Savarese

Figure 1 for Learning Task-Oriented Grasping for Tool Manipulation from Simulated Self-Supervision
Figure 2 for Learning Task-Oriented Grasping for Tool Manipulation from Simulated Self-Supervision
Figure 3 for Learning Task-Oriented Grasping for Tool Manipulation from Simulated Self-Supervision
Figure 4 for Learning Task-Oriented Grasping for Tool Manipulation from Simulated Self-Supervision
Viaarxiv icon