Picture for Tong Zheng

Tong Zheng

Small RL Controller, Large Language Model: RL-Guided Adaptive Sampling for Test-Time Scaling

Add code
Jun 02, 2026
Viaarxiv icon

G-Zero: Self-Play for Open-Ended Generation from Zero Data

Add code
May 11, 2026
Viaarxiv icon

DeltaRubric: Generative Multimodal Reward Modeling via Joint Planning and Verification

Add code
May 10, 2026
Viaarxiv icon

Reinforcing Multimodal Reasoning Against Visual Degradation

Add code
May 10, 2026
Viaarxiv icon

Parallel-Probe: Towards Efficient Parallel Thinking via 2D Probing

Add code
Feb 03, 2026
Viaarxiv icon

Prepare Reasoning Language Models for Multi-Agent Debate with Self-Debate Reinforcement Learning

Add code
Jan 29, 2026
Viaarxiv icon

RelayLLM: Efficient Reasoning via Collaborative Decoding

Add code
Jan 08, 2026
Viaarxiv icon

ImAgent: A Unified Multimodal Agent Framework for Test-Time Scalable Image Generation

Add code
Nov 14, 2025
Viaarxiv icon

Beyond English: Toward Inclusive and Scalable Multilingual Machine Translation with LLMs

Add code
Nov 10, 2025
Figure 1 for Beyond English: Toward Inclusive and Scalable Multilingual Machine Translation with LLMs
Figure 2 for Beyond English: Toward Inclusive and Scalable Multilingual Machine Translation with LLMs
Figure 3 for Beyond English: Toward Inclusive and Scalable Multilingual Machine Translation with LLMs
Figure 4 for Beyond English: Toward Inclusive and Scalable Multilingual Machine Translation with LLMs
Viaarxiv icon

VOGUE: Guiding Exploration with Visual Uncertainty Improves Multimodal Reasoning

Add code
Oct 01, 2025
Figure 1 for VOGUE: Guiding Exploration with Visual Uncertainty Improves Multimodal Reasoning
Figure 2 for VOGUE: Guiding Exploration with Visual Uncertainty Improves Multimodal Reasoning
Figure 3 for VOGUE: Guiding Exploration with Visual Uncertainty Improves Multimodal Reasoning
Figure 4 for VOGUE: Guiding Exploration with Visual Uncertainty Improves Multimodal Reasoning
Viaarxiv icon