Picture for Marzyeh Ghassemi

Marzyeh Ghassemi

University of Toronto, Vector Institute

Explainable AI as a Double-Edged Sword in Dermatology: The Impact on Clinicians versus The Public

Add code
Dec 14, 2025
Viaarxiv icon

SpaceVLM: Sub-Space Modeling of Negation in Vision-Language Models

Add code
Nov 15, 2025
Viaarxiv icon

Train Long, Think Short: Curriculum Learning for Efficient Reasoning

Add code
Aug 12, 2025
Viaarxiv icon

Tiered Agentic Oversight: A Hierarchical Multi-Agent System for AI Safety in Healthcare

Add code
Jun 14, 2025
Viaarxiv icon

When Style Breaks Safety: Defending Language Models Against Superficial Style Alignment

Add code
Jun 09, 2025
Viaarxiv icon

KScope: A Framework for Characterizing the Knowledge Status of Language Models

Add code
Jun 09, 2025
Viaarxiv icon

MedPAIR: Measuring Physicians and AI Relevance Alignment in Medical Question Answering

Add code
May 29, 2025
Viaarxiv icon

MOSAIC: Modeling Social AI for Content Dissemination and Regulation in Multi-Agent Simulations

Add code
Apr 10, 2025
Viaarxiv icon

What's in a Query: Polarity-Aware Distribution-Based Fair Ranking

Add code
Feb 17, 2025
Figure 1 for What's in a Query: Polarity-Aware Distribution-Based Fair Ranking
Figure 2 for What's in a Query: Polarity-Aware Distribution-Based Fair Ranking
Figure 3 for What's in a Query: Polarity-Aware Distribution-Based Fair Ranking
Figure 4 for What's in a Query: Polarity-Aware Distribution-Based Fair Ranking
Viaarxiv icon

Speak Easy: Eliciting Harmful Jailbreaks from LLMs with Simple Interactions

Add code
Feb 06, 2025
Figure 1 for Speak Easy: Eliciting Harmful Jailbreaks from LLMs with Simple Interactions
Figure 2 for Speak Easy: Eliciting Harmful Jailbreaks from LLMs with Simple Interactions
Figure 3 for Speak Easy: Eliciting Harmful Jailbreaks from LLMs with Simple Interactions
Figure 4 for Speak Easy: Eliciting Harmful Jailbreaks from LLMs with Simple Interactions
Viaarxiv icon