Picture for Dinesh Manocha

Dinesh Manocha

Beyond Joint Demonstrations: Personalized Expert Guidance for Efficient Multi-Agent Reinforcement Learning

Add code
Mar 13, 2024
Figure 1 for Beyond Joint Demonstrations: Personalized Expert Guidance for Efficient Multi-Agent Reinforcement Learning
Figure 2 for Beyond Joint Demonstrations: Personalized Expert Guidance for Efficient Multi-Agent Reinforcement Learning
Figure 3 for Beyond Joint Demonstrations: Personalized Expert Guidance for Efficient Multi-Agent Reinforcement Learning
Figure 4 for Beyond Joint Demonstrations: Personalized Expert Guidance for Efficient Multi-Agent Reinforcement Learning
Viaarxiv icon

On the Safety Concerns of Deploying LLMs/VLMs in Robotics: Highlighting the Risks and Vulnerabilities

Add code
Feb 24, 2024
Figure 1 for On the Safety Concerns of Deploying LLMs/VLMs in Robotics: Highlighting the Risks and Vulnerabilities
Figure 2 for On the Safety Concerns of Deploying LLMs/VLMs in Robotics: Highlighting the Risks and Vulnerabilities
Figure 3 for On the Safety Concerns of Deploying LLMs/VLMs in Robotics: Highlighting the Risks and Vulnerabilities
Figure 4 for On the Safety Concerns of Deploying LLMs/VLMs in Robotics: Highlighting the Risks and Vulnerabilities
Viaarxiv icon

MaxMin-RLHF: Towards Equitable Alignment of Large Language Models with Diverse Human Preferences

Add code
Feb 14, 2024
Viaarxiv icon

A Closer Look at the Limitations of Instruction Tuning

Add code
Feb 03, 2024
Viaarxiv icon

REBEL: A Regularization-Based Solution for Reward Overoptimization in Reinforcement Learning from Human Feedback

Add code
Dec 22, 2023
Figure 1 for REBEL: A Regularization-Based Solution for Reward Overoptimization in Reinforcement Learning from Human Feedback
Figure 2 for REBEL: A Regularization-Based Solution for Reward Overoptimization in Reinforcement Learning from Human Feedback
Figure 3 for REBEL: A Regularization-Based Solution for Reward Overoptimization in Reinforcement Learning from Human Feedback
Figure 4 for REBEL: A Regularization-Based Solution for Reward Overoptimization in Reinforcement Learning from Human Feedback
Viaarxiv icon

FusDom: Combining In-Domain and Out-of-Domain Knowledge for Continuous Self-Supervised Learning

Add code
Dec 20, 2023
Figure 1 for FusDom: Combining In-Domain and Out-of-Domain Knowledge for Continuous Self-Supervised Learning
Figure 2 for FusDom: Combining In-Domain and Out-of-Domain Knowledge for Continuous Self-Supervised Learning
Figure 3 for FusDom: Combining In-Domain and Out-of-Domain Knowledge for Continuous Self-Supervised Learning
Figure 4 for FusDom: Combining In-Domain and Out-of-Domain Knowledge for Continuous Self-Supervised Learning
Viaarxiv icon

Stable Distillation: Regularizing Continued Pre-training for Low-Resource Automatic Speech Recognition

Add code
Dec 20, 2023
Figure 1 for Stable Distillation: Regularizing Continued Pre-training for Low-Resource Automatic Speech Recognition
Figure 2 for Stable Distillation: Regularizing Continued Pre-training for Low-Resource Automatic Speech Recognition
Figure 3 for Stable Distillation: Regularizing Continued Pre-training for Low-Resource Automatic Speech Recognition
Figure 4 for Stable Distillation: Regularizing Continued Pre-training for Low-Resource Automatic Speech Recognition
Viaarxiv icon

APoLLo: Unified Adapter and Prompt Learning for Vision Language Models

Add code
Dec 04, 2023
Viaarxiv icon

AV-RIR: Audio-Visual Room Impulse Response Estimation

Add code
Nov 30, 2023
Figure 1 for AV-RIR: Audio-Visual Room Impulse Response Estimation
Figure 2 for AV-RIR: Audio-Visual Room Impulse Response Estimation
Figure 3 for AV-RIR: Audio-Visual Room Impulse Response Estimation
Figure 4 for AV-RIR: Audio-Visual Room Impulse Response Estimation
Viaarxiv icon

AerialBooth: Mutual Information Guidance for Text Controlled Aerial View Synthesis from a Single Image

Add code
Nov 27, 2023
Figure 1 for AerialBooth: Mutual Information Guidance for Text Controlled Aerial View Synthesis from a Single Image
Figure 2 for AerialBooth: Mutual Information Guidance for Text Controlled Aerial View Synthesis from a Single Image
Figure 3 for AerialBooth: Mutual Information Guidance for Text Controlled Aerial View Synthesis from a Single Image
Figure 4 for AerialBooth: Mutual Information Guidance for Text Controlled Aerial View Synthesis from a Single Image
Viaarxiv icon