Picture for Jiahao Wang

Jiahao Wang

SceneCrafter: Controllable Multi-View Driving Scene Editing

Add code
Jun 24, 2025
Viaarxiv icon

OmniGen2: Exploration to Advanced Multimodal Generation

Add code
Jun 23, 2025
Viaarxiv icon

Time Series Forecasting as Reasoning: A Slow-Thinking Approach with Reinforced LLMs

Add code
Jun 12, 2025
Viaarxiv icon

Can Slow-thinking LLMs Reason Over Time? Empirical Studies in Time Series Forecasting

Add code
May 30, 2025
Viaarxiv icon

VisuLogic: A Benchmark for Evaluating Visual Reasoning in Multi-modal Large Language Models

Add code
Apr 21, 2025
Viaarxiv icon

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Add code
Apr 15, 2025
Viaarxiv icon

C-FAITH: A Chinese Fine-Grained Benchmark for Automated Hallucination Evaluation

Add code
Apr 14, 2025
Viaarxiv icon

Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation

Add code
Mar 31, 2025
Viaarxiv icon

Griffin: Aerial-Ground Cooperative Detection and Tracking Dataset and Benchmark

Add code
Mar 10, 2025
Viaarxiv icon

DynamicID: Zero-Shot Multi-ID Image Personalization with Flexible Facial Editability

Add code
Mar 09, 2025
Viaarxiv icon