Picture for Minlie Huang

Minlie Huang

EJ

MAGI: Multi-Agent Guided Interview for Psychiatric Assessment

Add code
Apr 25, 2025
Viaarxiv icon

Crisp: Cognitive Restructuring of Negative Thoughts through Multi-turn Supportive Dialogues

Add code
Apr 24, 2025
Viaarxiv icon

VPO: Aligning Text-to-Video Generation Models with Prompt Optimization

Add code
Mar 26, 2025
Viaarxiv icon

StepMathAgent: A Step-Wise Agent for Evaluating Mathematical Processes through Tree-of-Error

Add code
Mar 13, 2025
Viaarxiv icon

AISafetyLab: A Comprehensive Framework for AI Safety Evaluation and Improvement

Add code
Feb 24, 2025
Viaarxiv icon

LongSafety: Evaluating Long-Context Safety of Large Language Models

Add code
Feb 24, 2025
Viaarxiv icon

HPSS: Heuristic Prompting Strategy Search for LLM Evaluators

Add code
Feb 18, 2025
Viaarxiv icon

DELMAN: Dynamic Defense Against Large Language Model Jailbreaking with Model Editing

Add code
Feb 17, 2025
Viaarxiv icon

Human Decision-making is Susceptible to AI-driven Manipulation

Add code
Feb 11, 2025
Viaarxiv icon

MAPS: Advancing Multi-Modal Reasoning in Expert-Level Physical Science

Add code
Jan 18, 2025
Figure 1 for MAPS: Advancing Multi-Modal Reasoning in Expert-Level Physical Science
Figure 2 for MAPS: Advancing Multi-Modal Reasoning in Expert-Level Physical Science
Figure 3 for MAPS: Advancing Multi-Modal Reasoning in Expert-Level Physical Science
Figure 4 for MAPS: Advancing Multi-Modal Reasoning in Expert-Level Physical Science
Viaarxiv icon