Picture for Yujia Yang

Yujia Yang

Spatio-Temporal Fusion Model for Standard View Classification of Echocardiographic Videos

Add code
Jun 16, 2026
Viaarxiv icon

When Seeing Is Not Believing -- A Benchmark for Search-Grounded Video Misinformation Detection

Add code
Jun 02, 2026
Viaarxiv icon

Omni IIE Bench: Benchmarking the Practical Capabilities of Image Editing Models

Add code
Mar 16, 2026
Viaarxiv icon

Beyond Closed-Pool Video Retrieval: A Benchmark and Agent Framework for Real-World Video Search and Moment Localization

Add code
Feb 10, 2026
Viaarxiv icon

PaperX: A Unified Framework for Multimodal Academic Presentation Generation with Scholar DAG

Add code
Feb 05, 2026
Viaarxiv icon

ShotFinder: Imagination-Driven Open-Domain Video Shot Retrieval via Web Search

Add code
Jan 30, 2026
Viaarxiv icon

RPO:Reinforcement Fine-Tuning with Partial Reasoning Optimization

Add code
Jan 27, 2026
Viaarxiv icon

VDE Bench: Evaluating The Capability of Image Editing Models to Modify Visual Documents

Add code
Jan 27, 2026
Viaarxiv icon

Beyond BEV: Optimizing Point-Level Tokens for Collaborative Perception

Add code
Aug 27, 2025
Viaarxiv icon

CollaMamba: Efficient Collaborative Perception with Cross-Agent Spatial-Temporal State Space Model

Add code
Sep 12, 2024
Figure 1 for CollaMamba: Efficient Collaborative Perception with Cross-Agent Spatial-Temporal State Space Model
Figure 2 for CollaMamba: Efficient Collaborative Perception with Cross-Agent Spatial-Temporal State Space Model
Figure 3 for CollaMamba: Efficient Collaborative Perception with Cross-Agent Spatial-Temporal State Space Model
Figure 4 for CollaMamba: Efficient Collaborative Perception with Cross-Agent Spatial-Temporal State Space Model
Viaarxiv icon