Picture for Mao Zheng

Mao Zheng

TAT-R1: Terminology-Aware Translation with Reinforcement Learning and Word Alignment

Add code
May 27, 2025
Viaarxiv icon

Walk Before You Run! Concise LLM Reasoning via Reinforcement Learning

Add code
May 27, 2025
Viaarxiv icon

SSR-Zero: Simple Self-Rewarding Reinforcement Learning for Machine Translation

Add code
May 22, 2025
Viaarxiv icon

Hunyuan-TurboS: Advancing Large Language Models through Mamba-Transformer Synergy and Adaptive Chain-of-Thought

Add code
May 21, 2025
Viaarxiv icon

FastCuRL: Curriculum Reinforcement Learning with Progressive Context Extension for Efficient Training R1-like Reasoning Models

Add code
Mar 21, 2025
Viaarxiv icon

GRP: Goal-Reversed Prompting for Zero-Shot Evaluation with LLMs

Add code
Mar 08, 2025
Viaarxiv icon

A Survey of Query Optimization in Large Language Models

Add code
Dec 23, 2024
Viaarxiv icon

MiMoTable: A Multi-scale Spreadsheet Benchmark with Meta Operations for Table Reasoning

Add code
Dec 16, 2024
Viaarxiv icon

SS-Bench: A Benchmark for Social Story Generation and Evaluation

Add code
Jun 22, 2024
Viaarxiv icon

Can Many-Shot In-Context Learning Help Long-Context LLM Judges? See More, Judge Better!

Add code
Jun 17, 2024
Figure 1 for Can Many-Shot In-Context Learning Help Long-Context LLM Judges? See More, Judge Better!
Figure 2 for Can Many-Shot In-Context Learning Help Long-Context LLM Judges? See More, Judge Better!
Figure 3 for Can Many-Shot In-Context Learning Help Long-Context LLM Judges? See More, Judge Better!
Figure 4 for Can Many-Shot In-Context Learning Help Long-Context LLM Judges? See More, Judge Better!
Viaarxiv icon