Alert button
Picture for Ankit Anand

Ankit Anand

Alert button

Code as Reward: Empowering Reinforcement Learning with VLMs

Add code
Bookmark button
Alert button
Feb 07, 2024
David Venuto, Sami Nur Islam, Martin Klissarov, Doina Precup, Sherry Yang, Ankit Anand

Viaarxiv icon

Transfer Learning for the Prediction of Entity Modifiers in Clinical Text: Application to Opioid Use Disorder Case Detection

Add code
Bookmark button
Alert button
Feb 05, 2024
Abdullateef I. Almudaifer, Whitney Covington, JaMor Hairston, Zachary Deitch, Ankit Anand, Caleb M. Carroll, Estera Crisan, William Bradford, Lauren Walter, Eaton Ellen, Sue S. Feldman, John D. Osborne

Viaarxiv icon

GeomVerse: A Systematic Evaluation of Large Models for Geometric Reasoning

Add code
Bookmark button
Alert button
Dec 19, 2023
Mehran Kazemi, Hamidreza Alvari, Ankit Anand, Jialin Wu, Xi Chen, Radu Soricut

Viaarxiv icon

Finding Increasingly Large Extremal Graphs with AlphaZero and Tabu Search

Add code
Bookmark button
Alert button
Nov 06, 2023
Abbas Mehrabian, Ankit Anand, Hyunjik Kim, Nicolas Sonnerat, Matej Balog, Gheorghe Comanici, Tudor Berariu, Andrew Lee, Anian Ruoss, Anna Bulanova, Daniel Toyama, Sam Blackwell, Bernardino Romera Paredes, Petar Veličković, Laurent Orseau, Joonkyung Lee, Anurag Murty Naredla, Doina Precup, Adam Zsolt Wagner

Figure 1 for Finding Increasingly Large Extremal Graphs with AlphaZero and Tabu Search
Figure 2 for Finding Increasingly Large Extremal Graphs with AlphaZero and Tabu Search
Figure 3 for Finding Increasingly Large Extremal Graphs with AlphaZero and Tabu Search
Figure 4 for Finding Increasingly Large Extremal Graphs with AlphaZero and Tabu Search
Viaarxiv icon

AutoMix: Automatically Mixing Language Models

Add code
Bookmark button
Alert button
Oct 19, 2023
Aman Madaan, Pranjal Aggarwal, Ankit Anand, Srividya Pranavi Potharaju, Swaroop Mishra, Pei Zhou, Aditya Gupta, Dheeraj Rajagopal, Karthik Kappaganthu, Yiming Yang, Shyam Upadhyay, Mausam, Manaal Faruqui

Figure 1 for AutoMix: Automatically Mixing Language Models
Figure 2 for AutoMix: Automatically Mixing Language Models
Figure 3 for AutoMix: Automatically Mixing Language Models
Figure 4 for AutoMix: Automatically Mixing Language Models
Viaarxiv icon

Policy composition in reinforcement learning via multi-objective policy optimization

Add code
Bookmark button
Alert button
Aug 30, 2023
Shruti Mishra, Ankit Anand, Jordan Hoffmann, Nicolas Heess, Martin Riedmiller, Abbas Abdolmaleki, Doina Precup

Figure 1 for Policy composition in reinforcement learning via multi-objective policy optimization
Figure 2 for Policy composition in reinforcement learning via multi-objective policy optimization
Figure 3 for Policy composition in reinforcement learning via multi-objective policy optimization
Figure 4 for Policy composition in reinforcement learning via multi-objective policy optimization
Viaarxiv icon

Accelerating exploration and representation learning with offline pre-training

Add code
Bookmark button
Alert button
Mar 31, 2023
Bogdan Mazoure, Jake Bruce, Doina Precup, Rob Fergus, Ankit Anand

Figure 1 for Accelerating exploration and representation learning with offline pre-training
Figure 2 for Accelerating exploration and representation learning with offline pre-training
Figure 3 for Accelerating exploration and representation learning with offline pre-training
Figure 4 for Accelerating exploration and representation learning with offline pre-training
Viaarxiv icon

Proving Theorems using Incremental Learning and Hindsight Experience Replay

Add code
Bookmark button
Alert button
Dec 20, 2021
Eser Aygün, Laurent Orseau, Ankit Anand, Xavier Glorot, Vlad Firoiu, Lei M. Zhang, Doina Precup, Shibl Mourad

Figure 1 for Proving Theorems using Incremental Learning and Hindsight Experience Replay
Figure 2 for Proving Theorems using Incremental Learning and Hindsight Experience Replay
Figure 3 for Proving Theorems using Incremental Learning and Hindsight Experience Replay
Viaarxiv icon