Picture for Hongming Zhang

Hongming Zhang

Shammie

ACPO: Counteracting Likelihood Displacement in Vision-Language Alignment with Asymmetric Constraints

Add code
Mar 23, 2026
Viaarxiv icon

Principled Fast and Meta Knowledge Learners for Continual Reinforcement Learning

Add code
Mar 01, 2026
Viaarxiv icon

Understanding and Enhancing Mamba-Transformer Hybrids for Memory Recall and Language Modeling

Add code
Oct 30, 2025
Viaarxiv icon

Explore to Evolve: Scaling Evolved Aggregation Logic via Proactive Online Exploration for Deep Research Agents

Add code
Oct 16, 2025
Viaarxiv icon

UniGist: Towards General and Hardware-aligned Sequence-level Long Context Compression

Add code
Sep 19, 2025
Viaarxiv icon

Parallel-R1: Towards Parallel Thinking via Reinforcement Learning

Add code
Sep 09, 2025
Viaarxiv icon

Self-Guided Function Calling in Large Language Models via Stepwise Experience Recall

Add code
Aug 21, 2025
Viaarxiv icon

R-Zero: Self-Evolving Reasoning LLM from Zero Data

Add code
Aug 07, 2025
Figure 1 for R-Zero: Self-Evolving Reasoning LLM from Zero Data
Figure 2 for R-Zero: Self-Evolving Reasoning LLM from Zero Data
Figure 3 for R-Zero: Self-Evolving Reasoning LLM from Zero Data
Figure 4 for R-Zero: Self-Evolving Reasoning LLM from Zero Data
Viaarxiv icon

UAVScenes: A Multi-Modal Dataset for UAVs

Add code
Jul 30, 2025
Figure 1 for UAVScenes: A Multi-Modal Dataset for UAVs
Figure 2 for UAVScenes: A Multi-Modal Dataset for UAVs
Figure 3 for UAVScenes: A Multi-Modal Dataset for UAVs
Figure 4 for UAVScenes: A Multi-Modal Dataset for UAVs
Viaarxiv icon

VScan: Rethinking Visual Token Reduction for Efficient Large Vision-Language Models

Add code
May 28, 2025
Viaarxiv icon