Picture for Balaraman Ravindran

Balaraman Ravindran

PREFINE: Preference-Based Implicit Reward and Cost Fine-Tuning for Safety Alignment

Add code
May 20, 2026
Viaarxiv icon

How Much Online RL is Enough? Informative Rollouts for Offline Preference Optimization in RLVR

Add code
May 20, 2026
Viaarxiv icon

Unifying Model-Free Efficiency and Model-Based Representations via Latent Dynamics

Add code
Feb 13, 2026
Viaarxiv icon

OSIL: Learning Offline Safe Imitation Policies with Safety Inferred from Non-preferred Trajectories

Add code
Feb 11, 2026
Viaarxiv icon

SafeMIL: Learning Offline Safe Imitation Policy from Non-Preferred Trajectories

Add code
Nov 14, 2025
Figure 1 for SafeMIL: Learning Offline Safe Imitation Policy from Non-Preferred Trajectories
Figure 2 for SafeMIL: Learning Offline Safe Imitation Policy from Non-Preferred Trajectories
Figure 3 for SafeMIL: Learning Offline Safe Imitation Policy from Non-Preferred Trajectories
Figure 4 for SafeMIL: Learning Offline Safe Imitation Policy from Non-Preferred Trajectories
Viaarxiv icon

LExT: Towards Evaluating Trustworthiness of Natural Language Explanations

Add code
Apr 08, 2025
Figure 1 for LExT: Towards Evaluating Trustworthiness of Natural Language Explanations
Figure 2 for LExT: Towards Evaluating Trustworthiness of Natural Language Explanations
Figure 3 for LExT: Towards Evaluating Trustworthiness of Natural Language Explanations
Figure 4 for LExT: Towards Evaluating Trustworthiness of Natural Language Explanations
Viaarxiv icon

International AI Safety Report

Add code
Jan 29, 2025
Figure 1 for International AI Safety Report
Figure 2 for International AI Safety Report
Figure 3 for International AI Safety Report
Figure 4 for International AI Safety Report
Viaarxiv icon

QuAKE: Speeding up Model Inference Using Quick and Approximate Kernels for Exponential Non-Linearities

Add code
Nov 30, 2024
Figure 1 for QuAKE: Speeding up Model Inference Using Quick and Approximate Kernels for Exponential Non-Linearities
Figure 2 for QuAKE: Speeding up Model Inference Using Quick and Approximate Kernels for Exponential Non-Linearities
Figure 3 for QuAKE: Speeding up Model Inference Using Quick and Approximate Kernels for Exponential Non-Linearities
Figure 4 for QuAKE: Speeding up Model Inference Using Quick and Approximate Kernels for Exponential Non-Linearities
Viaarxiv icon

Deception in Reinforced Autonomous Agents: The Unconventional Rabbit Hat Trick in Legislation

Add code
May 07, 2024
Figure 1 for Deception in Reinforced Autonomous Agents: The Unconventional Rabbit Hat Trick in Legislation
Figure 2 for Deception in Reinforced Autonomous Agents: The Unconventional Rabbit Hat Trick in Legislation
Figure 3 for Deception in Reinforced Autonomous Agents: The Unconventional Rabbit Hat Trick in Legislation
Figure 4 for Deception in Reinforced Autonomous Agents: The Unconventional Rabbit Hat Trick in Legislation
Viaarxiv icon

InSaAF: Incorporating Safety through Accuracy and Fairness | Are LLMs ready for the Indian Legal Domain?

Add code
Feb 21, 2024
Figure 1 for InSaAF: Incorporating Safety through Accuracy and Fairness | Are LLMs ready for the Indian Legal Domain?
Figure 2 for InSaAF: Incorporating Safety through Accuracy and Fairness | Are LLMs ready for the Indian Legal Domain?
Figure 3 for InSaAF: Incorporating Safety through Accuracy and Fairness | Are LLMs ready for the Indian Legal Domain?
Figure 4 for InSaAF: Incorporating Safety through Accuracy and Fairness | Are LLMs ready for the Indian Legal Domain?
Viaarxiv icon