Picture for Zhaoxiang Zhang

Zhaoxiang Zhang

NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos

Add code
Jan 01, 2026
Viaarxiv icon

Encyclo-K: Evaluating LLMs with Dynamically Composed Knowledge Statements

Add code
Dec 31, 2025
Viaarxiv icon

T2AV-Compass: Towards Unified Evaluation for Text-to-Audio-Video Generation

Add code
Dec 24, 2025
Viaarxiv icon

VTCBench: Can Vision-Language Models Understand Long Context with Vision-Text Compression?

Add code
Dec 23, 2025
Figure 1 for VTCBench: Can Vision-Language Models Understand Long Context with Vision-Text Compression?
Figure 2 for VTCBench: Can Vision-Language Models Understand Long Context with Vision-Text Compression?
Figure 3 for VTCBench: Can Vision-Language Models Understand Long Context with Vision-Text Compression?
Figure 4 for VTCBench: Can Vision-Language Models Understand Long Context with Vision-Text Compression?
Viaarxiv icon

NL2Repo-Bench: Towards Long-Horizon Repository Generation Evaluation of Coding Agents

Add code
Dec 14, 2025
Viaarxiv icon

MVU-Eval: Towards Multi-Video Understanding Evaluation for Multimodal LLMs

Add code
Nov 13, 2025
Figure 1 for MVU-Eval: Towards Multi-Video Understanding Evaluation for Multimodal LLMs
Figure 2 for MVU-Eval: Towards Multi-Video Understanding Evaluation for Multimodal LLMs
Figure 3 for MVU-Eval: Towards Multi-Video Understanding Evaluation for Multimodal LLMs
Figure 4 for MVU-Eval: Towards Multi-Video Understanding Evaluation for Multimodal LLMs
Viaarxiv icon

SWE-Compass: Towards Unified Evaluation of Agentic Coding Abilities for Large Language Models

Add code
Nov 07, 2025
Viaarxiv icon

Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs

Add code
Oct 22, 2025
Viaarxiv icon

MCOP: Multi-UAV Collaborative Occupancy Prediction

Add code
Oct 14, 2025
Viaarxiv icon

DriveVLA-W0: World Models Amplify Data Scaling Law in Autonomous Driving

Add code
Oct 14, 2025
Figure 1 for DriveVLA-W0: World Models Amplify Data Scaling Law in Autonomous Driving
Figure 2 for DriveVLA-W0: World Models Amplify Data Scaling Law in Autonomous Driving
Figure 3 for DriveVLA-W0: World Models Amplify Data Scaling Law in Autonomous Driving
Figure 4 for DriveVLA-W0: World Models Amplify Data Scaling Law in Autonomous Driving
Viaarxiv icon