Picture for Mark Dras

Mark Dras

Microsoft Research Institute, Macquarie University

Beyond Theoretical Bounds: Empirical Privacy Loss Calibration for Text Rewriting Under Local Differential Privacy

Add code
Mar 24, 2026
Viaarxiv icon

Facial Movement Dynamics Reveal Workload During Complex Multitasking

Add code
Mar 18, 2026
Viaarxiv icon

When the Model Said 'No Comment', We Knew Helpfulness Was Dead, Honesty Was Alive, and Safety Was Terrified

Add code
Feb 07, 2026
Viaarxiv icon

CogMem: A Cognitive Memory Architecture for Sustained Multi-Turn Reasoning in Large Language Models

Add code
Dec 16, 2025
Figure 1 for CogMem: A Cognitive Memory Architecture for Sustained Multi-Turn Reasoning in Large Language Models
Figure 2 for CogMem: A Cognitive Memory Architecture for Sustained Multi-Turn Reasoning in Large Language Models
Figure 3 for CogMem: A Cognitive Memory Architecture for Sustained Multi-Turn Reasoning in Large Language Models
Viaarxiv icon

Beyond the Black Box: Demystifying Multi-Turn LLM Reasoning with VISTA

Add code
Nov 13, 2025
Figure 1 for Beyond the Black Box: Demystifying Multi-Turn LLM Reasoning with VISTA
Viaarxiv icon

We Think, Therefore We Align LLMs to Helpful, Harmless and Honest Before They Go Wrong

Add code
Sep 26, 2025
Figure 1 for We Think, Therefore We Align LLMs to Helpful, Harmless and Honest Before They Go Wrong
Figure 2 for We Think, Therefore We Align LLMs to Helpful, Harmless and Honest Before They Go Wrong
Figure 3 for We Think, Therefore We Align LLMs to Helpful, Harmless and Honest Before They Go Wrong
Figure 4 for We Think, Therefore We Align LLMs to Helpful, Harmless and Honest Before They Go Wrong
Viaarxiv icon

Too Helpful, Too Harmless, Too Honest or Just Right?

Add code
Sep 10, 2025
Viaarxiv icon

Steering Towards Fairness: Mitigating Political Bias in LLMs

Add code
Aug 12, 2025
Viaarxiv icon

Seeing the Threat: Vulnerabilities in Vision-Language Models to Adversarial Attack

Add code
May 28, 2025
Viaarxiv icon

A Survey on Progress in LLM Alignment from the Perspective of Reward Design

Add code
May 05, 2025
Viaarxiv icon