Picture for Weiran Huang

Weiran Huang

Targeted Exploration via Unified Entropy Control for Reinforcement Learning

Add code
Apr 16, 2026
Viaarxiv icon

Camyla: Scaling Autonomous Research in Medical Image Segmentation

Add code
Apr 12, 2026
Viaarxiv icon

IDER: IDempotent Experience Replay for Reliable Continual Learning

Add code
Mar 03, 2026
Viaarxiv icon

Zooming without Zooming: Region-to-Image Distillation for Fine-Grained Multimodal Perception

Add code
Feb 16, 2026
Viaarxiv icon

Vision Matters: Simple Visual Perturbations Can Boost Multimodal Math Reasoning

Add code
Jun 11, 2025
Viaarxiv icon

Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO

Add code
May 28, 2025
Viaarxiv icon

Advancing Multimodal Reasoning via Reinforcement Learning with Cold Start

Add code
May 28, 2025
Viaarxiv icon

Can Atomic Step Decomposition Enhance the Self-structured Reasoning of Multimodal Large Models?

Add code
Mar 08, 2025
Viaarxiv icon

Information-Theoretic Perspectives on Optimizers

Add code
Feb 28, 2025
Figure 1 for Information-Theoretic Perspectives on Optimizers
Figure 2 for Information-Theoretic Perspectives on Optimizers
Figure 3 for Information-Theoretic Perspectives on Optimizers
Figure 4 for Information-Theoretic Perspectives on Optimizers
Viaarxiv icon

AtomThink: A Slow Thinking Framework for Multimodal Mathematical Reasoning

Add code
Nov 18, 2024
Figure 1 for AtomThink: A Slow Thinking Framework for Multimodal Mathematical Reasoning
Figure 2 for AtomThink: A Slow Thinking Framework for Multimodal Mathematical Reasoning
Figure 3 for AtomThink: A Slow Thinking Framework for Multimodal Mathematical Reasoning
Figure 4 for AtomThink: A Slow Thinking Framework for Multimodal Mathematical Reasoning
Viaarxiv icon