Picture for Souradip Chakraborty

Souradip Chakraborty

SAIL: Self-Improving Efficient Online Alignment of Large Language Models

Add code
Jun 21, 2024
Figure 1 for SAIL: Self-Improving Efficient Online Alignment of Large Language Models
Figure 2 for SAIL: Self-Improving Efficient Online Alignment of Large Language Models
Figure 3 for SAIL: Self-Improving Efficient Online Alignment of Large Language Models
Figure 4 for SAIL: Self-Improving Efficient Online Alignment of Large Language Models
Viaarxiv icon

Is poisoning a real threat to LLM alignment? Maybe more so than you think

Add code
Jun 17, 2024
Viaarxiv icon

DIPPER: Direct Preference Optimization to Accelerate Primitive-Enabled Hierarchical Reinforcement Learning

Add code
Jun 16, 2024
Viaarxiv icon

Transfer Q Star: Principled Decoding for LLM Alignment

Add code
May 30, 2024
Viaarxiv icon

On the Safety Concerns of Deploying LLMs/VLMs in Robotics: Highlighting the Risks and Vulnerabilities

Add code
Feb 24, 2024
Viaarxiv icon

Provably Sample Efficient RLHF via Active Preference Optimization

Add code
Feb 16, 2024
Viaarxiv icon

MaxMin-RLHF: Towards Equitable Alignment of Large Language Models with Diverse Human Preferences

Add code
Feb 14, 2024
Figure 1 for MaxMin-RLHF: Towards Equitable Alignment of Large Language Models with Diverse Human Preferences
Figure 2 for MaxMin-RLHF: Towards Equitable Alignment of Large Language Models with Diverse Human Preferences
Figure 3 for MaxMin-RLHF: Towards Equitable Alignment of Large Language Models with Diverse Human Preferences
Figure 4 for MaxMin-RLHF: Towards Equitable Alignment of Large Language Models with Diverse Human Preferences
Viaarxiv icon

Beyond Text: Improving LLM's Decision Making for Robot Navigation via Vocal Cues

Add code
Feb 05, 2024
Figure 1 for Beyond Text: Improving LLM's Decision Making for Robot Navigation via Vocal Cues
Figure 2 for Beyond Text: Improving LLM's Decision Making for Robot Navigation via Vocal Cues
Figure 3 for Beyond Text: Improving LLM's Decision Making for Robot Navigation via Vocal Cues
Figure 4 for Beyond Text: Improving LLM's Decision Making for Robot Navigation via Vocal Cues
Viaarxiv icon

REBEL: A Regularization-Based Solution for Reward Overoptimization in Reinforcement Learning from Human Feedback

Add code
Dec 22, 2023
Viaarxiv icon

Towards Possibilities & Impossibilities of AI-generated Text Detection: A Survey

Add code
Oct 23, 2023
Figure 1 for Towards Possibilities & Impossibilities of AI-generated Text Detection: A Survey
Figure 2 for Towards Possibilities & Impossibilities of AI-generated Text Detection: A Survey
Viaarxiv icon