Picture for Mao Zheng

Mao Zheng

CodeDelegator: Mitigating Context Pollution via Role Separation in Code-as-Action Agents

Add code
Jan 21, 2026
Viaarxiv icon

PodBench: A Comprehensive Benchmark for Instruction-Aware Audio-Oriented Podcast Script Generation

Add code
Jan 21, 2026
Viaarxiv icon

HY-MT1.5 Technical Report

Add code
Dec 30, 2025
Viaarxiv icon

Hunyuan-MT Technical Report

Add code
Sep 05, 2025
Viaarxiv icon

TAT-R1: Terminology-Aware Translation with Reinforcement Learning and Word Alignment

Add code
May 27, 2025
Viaarxiv icon

Walk Before You Run! Concise LLM Reasoning via Reinforcement Learning

Add code
May 27, 2025
Figure 1 for Walk Before You Run! Concise LLM Reasoning via Reinforcement Learning
Figure 2 for Walk Before You Run! Concise LLM Reasoning via Reinforcement Learning
Figure 3 for Walk Before You Run! Concise LLM Reasoning via Reinforcement Learning
Figure 4 for Walk Before You Run! Concise LLM Reasoning via Reinforcement Learning
Viaarxiv icon

SSR-Zero: Simple Self-Rewarding Reinforcement Learning for Machine Translation

Add code
May 22, 2025
Viaarxiv icon

Hunyuan-TurboS: Advancing Large Language Models through Mamba-Transformer Synergy and Adaptive Chain-of-Thought

Add code
May 21, 2025
Viaarxiv icon

FastCuRL: Curriculum Reinforcement Learning with Progressive Context Extension for Efficient Training R1-like Reasoning Models

Add code
Mar 21, 2025
Viaarxiv icon

GRP: Goal-Reversed Prompting for Zero-Shot Evaluation with LLMs

Add code
Mar 08, 2025
Viaarxiv icon