Picture for Haotian Wang

Haotian Wang

MME-VideoOCR: Evaluating OCR-Based Capabilities of Multimodal LLMs in Video Scenarios

Add code
May 27, 2025
Viaarxiv icon

GeoLLaVA-8K: Scaling Remote-Sensing Multimodal Large Language Models to 8K Resolution

Add code
May 27, 2025
Viaarxiv icon

Not All Correct Answers Are Equal: Why Your Distillation Source Matters

Add code
May 20, 2025
Viaarxiv icon

AM-Thinking-v1: Advancing the Frontier of Reasoning at 32B Scale

Add code
May 13, 2025
Viaarxiv icon

Exploring the Potential of Offline RL for Reasoning in LLMs: A Preliminary Study

Add code
May 04, 2025
Viaarxiv icon

DeepSTA: A Spatial-Temporal Attention Network for Logistics Delivery Timely Rate Prediction in Anomaly Conditions

Add code
May 01, 2025
Viaarxiv icon

Learning to Estimate Package Delivery Time in Mixed Imbalanced Delivery and Pickup Logistics Services

Add code
May 01, 2025
Viaarxiv icon

DeepDistill: Enhancing LLM Reasoning Capabilities via Large-Scale Difficulty-Graded Data Training

Add code
Apr 24, 2025
Viaarxiv icon

Leveraging Reasoning Model Answers to Enhance Non-Reasoning Model Capability

Add code
Apr 13, 2025
Viaarxiv icon

Dual Boost-Driven Graph-Level Clustering Network

Add code
Apr 08, 2025
Viaarxiv icon