Picture for Zhang Zhang

Zhang Zhang

A Call for New Recipes to Enhance Spatial Reasoning in MLLMs

Add code
Apr 21, 2025
Viaarxiv icon

RoboOcc: Enhancing the Geometric and Semantic Scene Understanding for Robots

Add code
Apr 20, 2025
Viaarxiv icon

Open-Medical-R1: How to Choose Data for RLVR Training at Medicine Domain

Add code
Apr 16, 2025
Viaarxiv icon

MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models

Add code
Apr 07, 2025
Viaarxiv icon

Q-MambaIR: Accurate Quantized Mamba for Efficient Image Restoration

Add code
Mar 27, 2025
Viaarxiv icon

Aligning Multimodal LLM with Human Preference: A Survey

Add code
Mar 18, 2025
Viaarxiv icon

HeightFormer: Learning Height Prediction in Voxel Features for Roadside Vision Centric 3D Object Detection via Transformer

Add code
Mar 13, 2025
Viaarxiv icon

HumanoidPano: Hybrid Spherical Panoramic-LiDAR Cross-Modal Perception for Humanoid Robots

Add code
Mar 13, 2025
Viaarxiv icon

Conformal Uncertainty Indicator for Continual Test-Time Adaptation

Add code
Feb 05, 2025
Viaarxiv icon

TimeRAF: Retrieval-Augmented Foundation model for Zero-shot Time Series Forecasting

Add code
Dec 30, 2024
Viaarxiv icon