Picture for Mark Dras

Mark Dras

Microsoft Research Institute, Macquarie University

When the Model Said 'No Comment', We Knew Helpfulness Was Dead, Honesty Was Alive, and Safety Was Terrified

Add code
Feb 07, 2026
Viaarxiv icon

CogMem: A Cognitive Memory Architecture for Sustained Multi-Turn Reasoning in Large Language Models

Add code
Dec 16, 2025
Figure 1 for CogMem: A Cognitive Memory Architecture for Sustained Multi-Turn Reasoning in Large Language Models
Figure 2 for CogMem: A Cognitive Memory Architecture for Sustained Multi-Turn Reasoning in Large Language Models
Figure 3 for CogMem: A Cognitive Memory Architecture for Sustained Multi-Turn Reasoning in Large Language Models
Viaarxiv icon

Beyond the Black Box: Demystifying Multi-Turn LLM Reasoning with VISTA

Add code
Nov 13, 2025
Figure 1 for Beyond the Black Box: Demystifying Multi-Turn LLM Reasoning with VISTA
Viaarxiv icon

We Think, Therefore We Align LLMs to Helpful, Harmless and Honest Before They Go Wrong

Add code
Sep 26, 2025
Figure 1 for We Think, Therefore We Align LLMs to Helpful, Harmless and Honest Before They Go Wrong
Figure 2 for We Think, Therefore We Align LLMs to Helpful, Harmless and Honest Before They Go Wrong
Figure 3 for We Think, Therefore We Align LLMs to Helpful, Harmless and Honest Before They Go Wrong
Figure 4 for We Think, Therefore We Align LLMs to Helpful, Harmless and Honest Before They Go Wrong
Viaarxiv icon

Too Helpful, Too Harmless, Too Honest or Just Right?

Add code
Sep 10, 2025
Viaarxiv icon

Steering Towards Fairness: Mitigating Political Bias in LLMs

Add code
Aug 12, 2025
Viaarxiv icon

Seeing the Threat: Vulnerabilities in Vision-Language Models to Adversarial Attack

Add code
May 28, 2025
Viaarxiv icon

A Survey on Progress in LLM Alignment from the Perspective of Reward Design

Add code
May 05, 2025
Viaarxiv icon

Bi-directional Model Cascading with Proxy Confidence

Add code
Apr 27, 2025
Viaarxiv icon

Myanmar XNLI: Building a Dataset and Exploring Low-resource Approaches to Natural Language Inference with Myanmar

Add code
Apr 13, 2025
Viaarxiv icon