Picture for Dawei Zhu

Dawei Zhu

MultiJustice: A Chinese Dataset for Multi-Party, Multi-Charge Legal Prediction

Add code
Jul 09, 2025
Viaarxiv icon

A Survey on Latent Reasoning

Add code
Jul 08, 2025
Viaarxiv icon

PLD: A Choice-Theoretic List-Wise Knowledge Distillation

Add code
Jun 14, 2025
Viaarxiv icon

Language models can learn implicit multi-hop reasoning, but only if they have lots of training data

Add code
May 23, 2025
Viaarxiv icon

MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

Add code
May 12, 2025
Viaarxiv icon

Same evaluation, more tokens: On the effect of input length for machine translation evaluation using Large Language Models

Add code
May 03, 2025
Viaarxiv icon

Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision

Add code
Feb 28, 2025
Viaarxiv icon

LongAttn: Selecting Long-context Training Data via Token-level Attention

Add code
Feb 24, 2025
Viaarxiv icon

MMTEB: Massive Multilingual Text Embedding Benchmark

Add code
Feb 19, 2025
Viaarxiv icon

AFRIDOC-MT: Document-level MT Corpus for African Languages

Add code
Jan 10, 2025
Figure 1 for AFRIDOC-MT: Document-level MT Corpus for African Languages
Figure 2 for AFRIDOC-MT: Document-level MT Corpus for African Languages
Figure 3 for AFRIDOC-MT: Document-level MT Corpus for African Languages
Figure 4 for AFRIDOC-MT: Document-level MT Corpus for African Languages
Viaarxiv icon