Picture for Yuwei Niu

Yuwei Niu

Echo-Memory: A Controlled Study of Memory in Action World Models

Add code
Jun 08, 2026
Viaarxiv icon

ChronoPhyBench: Do MLLMs Truly Understand the World or Merely Exploit Language Priors?

Add code
Jun 06, 2026
Viaarxiv icon

From Pixels to Words -- Towards Native One-Vision Models at Scale

Add code
May 27, 2026
Viaarxiv icon

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture

Add code
May 12, 2026
Viaarxiv icon

ClawMark: A Living-World Benchmark for Multi-Turn, Multi-Day, Multimodal Coworker Agents

Add code
Apr 26, 2026
Viaarxiv icon

iFSQ: Improving FSQ for Image Generation with 1 Line of Code

Add code
Jan 27, 2026
Viaarxiv icon

Memory in the Age of AI Agents

Add code
Dec 15, 2025
Viaarxiv icon

SRUM: Fine-Grained Self-Rewarding for Unified Multimodal Models

Add code
Oct 14, 2025
Viaarxiv icon

LangBridge: Interpreting Image as a Combination of Language Embeddings

Add code
Mar 26, 2025
Figure 1 for LangBridge: Interpreting Image as a Combination of Language Embeddings
Figure 2 for LangBridge: Interpreting Image as a Combination of Language Embeddings
Figure 3 for LangBridge: Interpreting Image as a Combination of Language Embeddings
Figure 4 for LangBridge: Interpreting Image as a Combination of Language Embeddings
Viaarxiv icon

WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation

Add code
Mar 10, 2025
Viaarxiv icon