Picture for Yifan Yang

Yifan Yang

OpenRCA 2.0: From Outcome Labels to Causal Process Supervision

Add code
Jun 25, 2026
Viaarxiv icon

IAPO: Input Attribution-Aware Policy Optimization for Tool Use in Small Multimodal Agents

Add code
Jun 10, 2026
Viaarxiv icon

Precision-Aware Illumination-Disentangled Vision Transformer for Spacecraft 6D Pose Estimation

Add code
Jun 10, 2026
Viaarxiv icon

A Comprehensive Ecosystem for Open-Domain Customized Video Generation

Add code
Jun 10, 2026
Viaarxiv icon

WeaveBench: A Long-Horizon, Real-World Benchmark for Computer-Use Agents with Hybrid Interfaces

Add code
Jun 08, 2026
Viaarxiv icon

Latent Spatial Memory for Video World Models

Add code
Jun 08, 2026
Viaarxiv icon

GIScholarBench: Benchmarking LLM Overconfidence in GIS Research

Add code
Jun 06, 2026
Viaarxiv icon

Beyond Matching: Category-Guided Latent Intent Reasoning for Generative Retrieval in E-Commerce

Add code
Jun 05, 2026
Viaarxiv icon

MMAE: A Massive Multitask Audio Editing Benchmark

Add code
Jun 05, 2026
Viaarxiv icon

UAT: Unified Audio-Text Diffusion for Audio Generation, Editing, and Captioning

Add code
Jun 03, 2026
Viaarxiv icon