Picture for Yong Wang

Yong Wang

FASA: Frequency-aware Sparse Attention

Add code
Feb 03, 2026
Viaarxiv icon

Entropy-Guided Data-Efficient Training for Multimodal Reasoning Reward Models

Add code
Feb 02, 2026
Viaarxiv icon

Everything in Its Place: Benchmarking Spatial Intelligence of Text-to-Image Models

Add code
Jan 29, 2026
Viaarxiv icon

Generation Enhances Understanding in Unified Multimodal Models via Multi-Representation Generation

Add code
Jan 29, 2026
Viaarxiv icon

Harder Is Better: Boosting Mathematical Reasoning via Difficulty-Aware GRPO and Multi-Aspect Question Reformulation

Add code
Jan 28, 2026
Viaarxiv icon

Shedding the Facades, Connecting the Domains: Detecting Shifting Multimodal Hate Video with Test-Time Adaptation

Add code
Jan 28, 2026
Viaarxiv icon

A Pragmatic VLA Foundation Model

Add code
Jan 26, 2026
Viaarxiv icon

Athanor: Authoring Action Modification-based Interactions on Static Visualizations via Natural Language

Add code
Jan 25, 2026
Viaarxiv icon

A Graph Prompt Fine-Tuning Method for WSN Spatio-Temporal Correlation Anomaly Detection

Add code
Jan 19, 2026
Viaarxiv icon

Nip Rumors in the Bud: Retrieval-Guided Topic-Level Adaptation for Test-Time Fake News Video Detection

Add code
Jan 17, 2026
Viaarxiv icon