Picture for Rada Mihalcea

Rada Mihalcea

How Do AI Agents Spend Your Money? Analyzing and Predicting Token Consumption in Agentic Coding Tasks

Add code
Apr 24, 2026
Viaarxiv icon

SafetyALFRED: Evaluating Safety-Conscious Planning of Multimodal Large Language Models

Add code
Apr 21, 2026
Viaarxiv icon

When Do Language Models Endorse Limitations on Human Rights Principles?

Add code
Mar 04, 2026
Viaarxiv icon

Belief-Sim: Towards Belief-Driven Simulation of Demographic Misinformation Susceptibility

Add code
Mar 03, 2026
Viaarxiv icon

Copyright Detective: A Forensic System to Evidence LLMs Flickering Copyright Leakage Risks

Add code
Feb 05, 2026
Viaarxiv icon

SocialHarmBench: Revealing LLM Vulnerabilities to Socially Harmful Requests

Add code
Oct 06, 2025
Figure 1 for SocialHarmBench: Revealing LLM Vulnerabilities to Socially Harmful Requests
Figure 2 for SocialHarmBench: Revealing LLM Vulnerabilities to Socially Harmful Requests
Figure 3 for SocialHarmBench: Revealing LLM Vulnerabilities to Socially Harmful Requests
Figure 4 for SocialHarmBench: Revealing LLM Vulnerabilities to Socially Harmful Requests
Viaarxiv icon

ISCA: A Framework for Interview-Style Conversational Agents

Add code
Aug 20, 2025
Viaarxiv icon

Revisiting LLM Value Probing Strategies: Are They Robust and Expressive?

Add code
Jul 17, 2025
Viaarxiv icon

Democratic or Authoritarian? Probing a New Dimension of Political Biases in Large Language Models

Add code
Jun 15, 2025
Viaarxiv icon

CliniDial: A Naturally Occurring Multimodal Dialogue Dataset for Team Reflection in Action During Clinical Operation

Add code
Jun 15, 2025
Viaarxiv icon