Picture for Zhang Zhang

Zhang Zhang

SportsGPT: An LLM-driven Framework for Interpretable Sports Motion Assessment and Training Guidance

Add code
Dec 19, 2025
Viaarxiv icon

HeatV2X: Scalable Heterogeneous Collaborative Perception via Efficient Alignment and Interaction

Add code
Nov 13, 2025
Viaarxiv icon

BaseReward: A Strong Baseline for Multimodal Reward Model

Add code
Sep 19, 2025
Figure 1 for BaseReward: A Strong Baseline for Multimodal Reward Model
Figure 2 for BaseReward: A Strong Baseline for Multimodal Reward Model
Figure 3 for BaseReward: A Strong Baseline for Multimodal Reward Model
Figure 4 for BaseReward: A Strong Baseline for Multimodal Reward Model
Viaarxiv icon

Humanoid Occupancy: Enabling A Generalized Multimodal Occupancy Perception System on Humanoid Robots

Add code
Jul 27, 2025
Viaarxiv icon

Patho-R1: A Multimodal Reinforcement Learning-Based Pathology Expert Reasoner

Add code
May 16, 2025
Figure 1 for Patho-R1: A Multimodal Reinforcement Learning-Based Pathology Expert Reasoner
Figure 2 for Patho-R1: A Multimodal Reinforcement Learning-Based Pathology Expert Reasoner
Figure 3 for Patho-R1: A Multimodal Reinforcement Learning-Based Pathology Expert Reasoner
Figure 4 for Patho-R1: A Multimodal Reinforcement Learning-Based Pathology Expert Reasoner
Viaarxiv icon

PillarMamba: Learning Local-Global Context for Roadside Point Cloud via Hybrid State Space Model

Add code
May 08, 2025
Viaarxiv icon

Occupancy World Model for Robots

Add code
May 07, 2025
Viaarxiv icon

R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning

Add code
May 05, 2025
Viaarxiv icon

Large Language Models are Qualified Benchmark Builders: Rebuilding Pre-Training Datasets for Advancing Code Intelligence Tasks

Add code
Apr 28, 2025
Figure 1 for Large Language Models are Qualified Benchmark Builders: Rebuilding Pre-Training Datasets for Advancing Code Intelligence Tasks
Figure 2 for Large Language Models are Qualified Benchmark Builders: Rebuilding Pre-Training Datasets for Advancing Code Intelligence Tasks
Figure 3 for Large Language Models are Qualified Benchmark Builders: Rebuilding Pre-Training Datasets for Advancing Code Intelligence Tasks
Figure 4 for Large Language Models are Qualified Benchmark Builders: Rebuilding Pre-Training Datasets for Advancing Code Intelligence Tasks
Viaarxiv icon

A Call for New Recipes to Enhance Spatial Reasoning in MLLMs

Add code
Apr 21, 2025
Viaarxiv icon