Picture for Buddhika Laknath Semage

Buddhika Laknath Semage

The Unintended Trade-off of AI Alignment:Balancing Hallucination Mitigation and Safety in LLMs

Add code
Oct 09, 2025
Figure 1 for The Unintended Trade-off of AI Alignment:Balancing Hallucination Mitigation and Safety in LLMs
Figure 2 for The Unintended Trade-off of AI Alignment:Balancing Hallucination Mitigation and Safety in LLMs
Figure 3 for The Unintended Trade-off of AI Alignment:Balancing Hallucination Mitigation and Safety in LLMs
Figure 4 for The Unintended Trade-off of AI Alignment:Balancing Hallucination Mitigation and Safety in LLMs
Viaarxiv icon

Improving Multilingual Language Models by Aligning Representations through Steering

Add code
May 19, 2025
Viaarxiv icon

Zero-shot Sim2Real Adaptation Across Environments

Add code
Feb 08, 2023
Viaarxiv icon

Uncertainty Aware System Identification with Universal Policies

Add code
Feb 11, 2022
Viaarxiv icon

Fast Model-based Policy Search for Universal Policy Networks

Add code
Feb 11, 2022
Viaarxiv icon

Intuitive Physics Guided Exploration for Sample Efficient Sim2real Transfer

Add code
Apr 18, 2021
Figure 1 for Intuitive Physics Guided Exploration for Sample Efficient Sim2real Transfer
Figure 2 for Intuitive Physics Guided Exploration for Sample Efficient Sim2real Transfer
Figure 3 for Intuitive Physics Guided Exploration for Sample Efficient Sim2real Transfer
Figure 4 for Intuitive Physics Guided Exploration for Sample Efficient Sim2real Transfer
Viaarxiv icon