Picture for Huanyu Zhang

Huanyu Zhang

BaseReward: A Strong Baseline for Multimodal Reward Model

Add code
Sep 19, 2025
Viaarxiv icon

11Plus-Bench: Demystifying Multimodal LLM Spatial Reasoning with Cognitive-Inspired Analysis

Add code
Aug 27, 2025
Viaarxiv icon

Memory-Efficient Differentially Private Training with Gradient Random Projection

Add code
Jun 18, 2025
Viaarxiv icon

A Call for New Recipes to Enhance Spatial Reasoning in MLLMs

Add code
Apr 21, 2025
Viaarxiv icon

Imagine while Reasoning in Space: Multimodal Visualization-of-Thought

Add code
Jan 13, 2025
Figure 1 for Imagine while Reasoning in Space: Multimodal Visualization-of-Thought
Figure 2 for Imagine while Reasoning in Space: Multimodal Visualization-of-Thought
Figure 3 for Imagine while Reasoning in Space: Multimodal Visualization-of-Thought
Figure 4 for Imagine while Reasoning in Space: Multimodal Visualization-of-Thought
Viaarxiv icon

TimeRAF: Retrieval-Augmented Foundation model for Zero-shot Time Series Forecasting

Add code
Dec 30, 2024
Figure 1 for TimeRAF: Retrieval-Augmented Foundation model for Zero-shot Time Series Forecasting
Figure 2 for TimeRAF: Retrieval-Augmented Foundation model for Zero-shot Time Series Forecasting
Figure 3 for TimeRAF: Retrieval-Augmented Foundation model for Zero-shot Time Series Forecasting
Figure 4 for TimeRAF: Retrieval-Augmented Foundation model for Zero-shot Time Series Forecasting
Viaarxiv icon

MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?

Add code
Aug 23, 2024
Figure 1 for MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?
Figure 2 for MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?
Figure 3 for MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?
Figure 4 for MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?
Viaarxiv icon

Statler: State-Maintaining Language Models for Embodied Reasoning

Add code
Jul 03, 2023
Figure 1 for Statler: State-Maintaining Language Models for Embodied Reasoning
Figure 2 for Statler: State-Maintaining Language Models for Embodied Reasoning
Figure 3 for Statler: State-Maintaining Language Models for Embodied Reasoning
Figure 4 for Statler: State-Maintaining Language Models for Embodied Reasoning
Viaarxiv icon

DP-HyPO: An Adaptive Private Hyperparameter Optimization Framework

Add code
Jun 09, 2023
Viaarxiv icon

Federated Linear Contextual Bandits with User-level Differential Privacy

Add code
Jun 09, 2023
Figure 1 for Federated Linear Contextual Bandits with User-level Differential Privacy
Figure 2 for Federated Linear Contextual Bandits with User-level Differential Privacy
Viaarxiv icon