Picture for Paul Pu Liang

Paul Pu Liang

May

What One Cannot, Two Can: Two-Layer Transformers Provably Represent Induction Heads on Any-Order Markov Chains

Add code
Aug 10, 2025
Viaarxiv icon

Sotopia-RL: Reward Design for Social Intelligence

Add code
Aug 05, 2025
Viaarxiv icon

Decide less, communicate more: On the construct validity of end-to-end fact-checking in medicine

Add code
Jun 25, 2025
Viaarxiv icon

PuzzleWorld: A Benchmark for Multimodal, Open-Ended Reasoning in Puzzlehunts

Add code
Jun 06, 2025
Viaarxiv icon

Deriving Strategic Market Insights with Large Language Models: A Benchmark for Forward Counterfactual Generation

Add code
May 26, 2025
Viaarxiv icon

REGen: Multimodal Retrieval-Embedded Generation for Long-to-Short Video Editing

Add code
May 24, 2025
Viaarxiv icon

RAVENEA: A Benchmark for Multimodal Retrieval-Augmented Visual Culture Understanding

Add code
May 20, 2025
Viaarxiv icon

POET: Supporting Prompting Creativity and Personalization with Automated Expansion of Text-to-Image Generation

Add code
Apr 18, 2025
Viaarxiv icon

TAMP: Token-Adaptive Layerwise Pruning in Multimodal Large Language Models

Add code
Apr 15, 2025
Viaarxiv icon

Alice: Proactive Learning with Teacher's Demonstrations for Weak-to-Strong Generalization

Add code
Apr 09, 2025
Viaarxiv icon