Picture for Yuchen Ma

Yuchen Ma

SpecAlign: Efficient Specification-Grounded Alignment of Large Language Models via Synthetic Data

Add code
Jun 17, 2026
Viaarxiv icon

UXBench: Measuring the Actionability of LLM-Generated UX Critiques

Add code
Jun 15, 2026
Viaarxiv icon

Causal methods for LLM development and evaluation

Add code
May 25, 2026
Viaarxiv icon

AgentTrap: Measuring Runtime Trust Failures in Third-Party Agent Skills

Add code
May 13, 2026
Viaarxiv icon

Emergent Social Intelligence Risks in Generative Multi-Agent Systems

Add code
Mar 29, 2026
Viaarxiv icon

ProbeLLM: Automating Principled Diagnosis of LLM Failures

Add code
Feb 13, 2026
Viaarxiv icon

Synthetic Interaction Data for Scalable Personalization in Large Language Models

Add code
Feb 12, 2026
Viaarxiv icon

Targeted Synthetic Control Method

Add code
Feb 04, 2026
Viaarxiv icon

UniCorn: Towards Self-Improving Unified Multimodal Models through Self-Generated Supervision

Add code
Jan 08, 2026
Viaarxiv icon

LLM-Driven Treatment Effect Estimation Under Inference Time Text Confounding

Add code
Jul 03, 2025
Viaarxiv icon