Picture for Dawei Zhu

Dawei Zhu

MiMo-Audio: Audio Language Models are Few-Shot Learners

Add code
Dec 29, 2025
Viaarxiv icon

DocLens : A Tool-Augmented Multi-Agent Framework for Long Visual Document Understanding

Add code
Nov 14, 2025
Viaarxiv icon

PricingLogic: Evaluating LLMs Reasoning on Complex Tourism Pricing Tasks

Add code
Oct 14, 2025
Figure 1 for PricingLogic: Evaluating LLMs Reasoning on Complex Tourism Pricing Tasks
Figure 2 for PricingLogic: Evaluating LLMs Reasoning on Complex Tourism Pricing Tasks
Figure 3 for PricingLogic: Evaluating LLMs Reasoning on Complex Tourism Pricing Tasks
Figure 4 for PricingLogic: Evaluating LLMs Reasoning on Complex Tourism Pricing Tasks
Viaarxiv icon

MultiJustice: A Chinese Dataset for Multi-Party, Multi-Charge Legal Prediction

Add code
Jul 09, 2025
Viaarxiv icon

A Survey on Latent Reasoning

Add code
Jul 08, 2025
Figure 1 for A Survey on Latent Reasoning
Figure 2 for A Survey on Latent Reasoning
Figure 3 for A Survey on Latent Reasoning
Figure 4 for A Survey on Latent Reasoning
Viaarxiv icon

PLD: A Choice-Theoretic List-Wise Knowledge Distillation

Add code
Jun 14, 2025
Viaarxiv icon

Language models can learn implicit multi-hop reasoning, but only if they have lots of training data

Add code
May 23, 2025
Figure 1 for Language models can learn implicit multi-hop reasoning, but only if they have lots of training data
Figure 2 for Language models can learn implicit multi-hop reasoning, but only if they have lots of training data
Figure 3 for Language models can learn implicit multi-hop reasoning, but only if they have lots of training data
Figure 4 for Language models can learn implicit multi-hop reasoning, but only if they have lots of training data
Viaarxiv icon

MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

Add code
May 12, 2025
Viaarxiv icon

Same evaluation, more tokens: On the effect of input length for machine translation evaluation using Large Language Models

Add code
May 03, 2025
Figure 1 for Same evaluation, more tokens: On the effect of input length for machine translation evaluation using Large Language Models
Figure 2 for Same evaluation, more tokens: On the effect of input length for machine translation evaluation using Large Language Models
Figure 3 for Same evaluation, more tokens: On the effect of input length for machine translation evaluation using Large Language Models
Figure 4 for Same evaluation, more tokens: On the effect of input length for machine translation evaluation using Large Language Models
Viaarxiv icon

Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision

Add code
Feb 28, 2025
Figure 1 for Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision
Figure 2 for Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision
Figure 3 for Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision
Figure 4 for Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision
Viaarxiv icon