Alert button
Picture for Andi Nika

Andi Nika

Alert button

Reward Model Learning vs. Direct Policy Optimization: A Comparative Analysis of Learning from Human Preferences

Add code
Bookmark button
Alert button
Mar 04, 2024
Andi Nika, Debmalya Mandal, Parameswaran Kamalaruban, Georgios Tzannetos, Goran Radanović, Adish Singla

Figure 1 for Reward Model Learning vs. Direct Policy Optimization: A Comparative Analysis of Learning from Human Preferences
Viaarxiv icon

Corruption-Robust Offline Two-Player Zero-Sum Markov Games

Add code
Bookmark button
Alert button
Mar 04, 2024
Andi Nika, Debmalya Mandal, Adish Singla, Goran Radanović

Figure 1 for Corruption-Robust Offline Two-Player Zero-Sum Markov Games
Figure 2 for Corruption-Robust Offline Two-Player Zero-Sum Markov Games
Viaarxiv icon

Corruption Robust Offline Reinforcement Learning with Human Feedback

Add code
Bookmark button
Alert button
Feb 09, 2024
Debmalya Mandal, Andi Nika, Parameswaran Kamalaruban, Adish Singla, Goran Radanović

Viaarxiv icon

Contextual Combinatorial Volatile Bandits via Gaussian Processes

Add code
Bookmark button
Alert button
Oct 05, 2021
Andi Nika, Sepehr Elahi, Cem Tekin

Figure 1 for Contextual Combinatorial Volatile Bandits via Gaussian Processes
Figure 2 for Contextual Combinatorial Volatile Bandits via Gaussian Processes
Figure 3 for Contextual Combinatorial Volatile Bandits via Gaussian Processes
Figure 4 for Contextual Combinatorial Volatile Bandits via Gaussian Processes
Viaarxiv icon

Pareto Active Learning with Gaussian Processes and Adaptive Discretization

Add code
Bookmark button
Alert button
Jun 24, 2020
Andi Nika, Kerem Bozgan, Çağın Ararat, Cem Tekin

Figure 1 for Pareto Active Learning with Gaussian Processes and Adaptive Discretization
Figure 2 for Pareto Active Learning with Gaussian Processes and Adaptive Discretization
Figure 3 for Pareto Active Learning with Gaussian Processes and Adaptive Discretization
Figure 4 for Pareto Active Learning with Gaussian Processes and Adaptive Discretization
Viaarxiv icon