Picture for Xin Eric Wang

Xin Eric Wang

"PhyWorldBench": A Comprehensive Evaluation of Physical Realism in Text-to-Video Models

Add code
Jul 17, 2025
Viaarxiv icon

Agents of Change: Self-Evolving LLM Agents for Strategic Planning

Add code
Jun 05, 2025
Viaarxiv icon

SafeKey: Amplifying Aha-Moment Insights for Safety Reasoning

Add code
May 22, 2025
Viaarxiv icon

GRIT: Teaching MLLMs to Think with Images

Add code
May 21, 2025
Viaarxiv icon

Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space

Add code
May 21, 2025
Viaarxiv icon

Constructing a 3D Town from a Single Image

Add code
May 21, 2025
Viaarxiv icon

A Comprehensive Analysis for Visual Object Hallucination in Large Vision-Language Models

Add code
May 04, 2025
Viaarxiv icon

Self-Resource Allocation in Multi-Agent LLM Systems

Add code
Apr 02, 2025
Viaarxiv icon

Agent S2: A Compositional Generalist-Specialist Framework for Computer Use Agents

Add code
Apr 01, 2025
Viaarxiv icon

Multimodal Inconsistency Reasoning (MMIR): A New Benchmark for Multimodal Reasoning Models

Add code
Feb 22, 2025
Viaarxiv icon