Picture for Yifan Yang

Yifan Yang

OpenWorldLib: A Unified Codebase and Definition of Advanced World Models

Add code
Apr 06, 2026
Viaarxiv icon

Earth Embeddings Reveal Diverse Urban Signals from Space

Add code
Apr 03, 2026
Viaarxiv icon

OmniSch: A Multimodal PCB Schematic Benchmark For Structured Diagram Visual Reasoning

Add code
Mar 31, 2026
Viaarxiv icon

CREST: Constraint-Release Execution for Multi-Robot Warehouse Shelf Rearrangement

Add code
Mar 27, 2026
Viaarxiv icon

BizGenEval: A Systematic Benchmark for Commercial Visual Content Generation

Add code
Mar 26, 2026
Viaarxiv icon

CLEAR: Context-Aware Learning with End-to-End Mask-Free Inference for Adaptive Video Subtitle Removal

Add code
Mar 23, 2026
Viaarxiv icon

Satellite-to-Street: Synthesizing Post-Disaster Views from Satellite Imagery via Generative Vision Models

Add code
Mar 21, 2026
Viaarxiv icon

Em-Garde: A Propose-Match Framework for Proactive Streaming Video Understanding

Add code
Mar 19, 2026
Viaarxiv icon

DamageArbiter: A CLIP-Enhanced Multimodal Arbitration Framework for Hurricane Damage Assessment from Street-View Imagery

Add code
Mar 16, 2026
Viaarxiv icon

SLICE: Semantic Latent Injection via Compartmentalized Embedding for Image Watermarking

Add code
Mar 13, 2026
Viaarxiv icon