Picture for Jiahao Wang

Jiahao Wang

From Pixels to Words -- Towards Native One-Vision Models at Scale

Add code
May 27, 2026
Viaarxiv icon

Sensor2Sensor: Cross-Embodiment Sensor Conversion for Autonomous Driving

Add code
May 21, 2026
Viaarxiv icon

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture

Add code
May 12, 2026
Viaarxiv icon

GeoDecider: A Coarse-to-Fine Agentic Workflow for Explainable Lithology Classification

Add code
May 05, 2026
Viaarxiv icon

GeoMind: An Agentic Workflow for Lithology Classification with Reasoned Tool Invocation

Add code
Apr 23, 2026
Viaarxiv icon

Long-SCOPE: Fully Sparse Long-Range Cooperative 3D Perception

Add code
Apr 10, 2026
Viaarxiv icon

AnyID: Ultra-Fidelity Universal Identity-Preserving Video Generation from Any Visual References

Add code
Mar 26, 2026
Viaarxiv icon

DepthArb: Training-Free Depth-Arbitrated Generation for Occlusion-Robust Image Synthesis

Add code
Mar 25, 2026
Viaarxiv icon

EVA: Efficient Reinforcement Learning for End-to-End Video Agent

Add code
Mar 24, 2026
Viaarxiv icon

ACPO: Counteracting Likelihood Displacement in Vision-Language Alignment with Asymmetric Constraints

Add code
Mar 23, 2026
Viaarxiv icon