Picture for Wentao Zhang

Wentao Zhang

FUSION: Fully Integration of Vision-Language Representations for Deep Cross-Modal Understanding

Add code
Apr 14, 2025
Viaarxiv icon

xVerify: Efficient Answer Verifier for Reasoning Model Evaluations

Add code
Apr 14, 2025
Viaarxiv icon

Mavors: Multi-granularity Video Representation for Multimodal Large Language Model

Add code
Apr 14, 2025
Viaarxiv icon

MaintainCoder: Maintainable Code Generation Under Dynamic Requirements

Add code
Mar 31, 2025
Viaarxiv icon

RARE: Retrieval-Augmented Reasoning Modeling

Add code
Mar 30, 2025
Viaarxiv icon

Concept-as-Tree: Synthetic Data is All You Need for VLM Personalization

Add code
Mar 17, 2025
Viaarxiv icon

WideRange4D: Enabling High-Quality 4D Reconstruction with Wide-Range Movements and Scenes

Add code
Mar 17, 2025
Viaarxiv icon

MathClean: A Benchmark for Synthetic Mathematical Data Cleaning

Add code
Feb 26, 2025
Viaarxiv icon

MM-Verify: Enhancing Multimodal Reasoning with Chain-of-Thought Verification

Add code
Feb 19, 2025
Viaarxiv icon

HopRAG: Multi-Hop Reasoning for Logic-Aware Retrieval-Augmented Generation

Add code
Feb 18, 2025
Viaarxiv icon