Picture for Mozhi Zhang

Mozhi Zhang

Domain2Vec: Vectorizing Datasets to Find the Optimal Data Mixture without Training

Add code
Jun 12, 2025
Viaarxiv icon

SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond

Add code
May 26, 2025
Viaarxiv icon

MiniMax-01: Scaling Foundation Models with Lightning Attention

Add code
Jan 14, 2025
Viaarxiv icon

LongSafetyBench: Long-Context LLMs Struggle with Safety Issues

Add code
Nov 11, 2024
Figure 1 for LongSafetyBench: Long-Context LLMs Struggle with Safety Issues
Figure 2 for LongSafetyBench: Long-Context LLMs Struggle with Safety Issues
Figure 3 for LongSafetyBench: Long-Context LLMs Struggle with Safety Issues
Figure 4 for LongSafetyBench: Long-Context LLMs Struggle with Safety Issues
Viaarxiv icon

MetaAlign: Align Large Language Models with Diverse Preferences during Inference Time

Add code
Oct 18, 2024
Figure 1 for MetaAlign: Align Large Language Models with Diverse Preferences during Inference Time
Figure 2 for MetaAlign: Align Large Language Models with Diverse Preferences during Inference Time
Figure 3 for MetaAlign: Align Large Language Models with Diverse Preferences during Inference Time
Figure 4 for MetaAlign: Align Large Language Models with Diverse Preferences during Inference Time
Viaarxiv icon

Calibrating the Confidence of Large Language Models by Eliciting Fidelity

Add code
Apr 03, 2024
Figure 1 for Calibrating the Confidence of Large Language Models by Eliciting Fidelity
Figure 2 for Calibrating the Confidence of Large Language Models by Eliciting Fidelity
Figure 3 for Calibrating the Confidence of Large Language Models by Eliciting Fidelity
Figure 4 for Calibrating the Confidence of Large Language Models by Eliciting Fidelity
Viaarxiv icon

Labeled Interactive Topic Models

Add code
Nov 15, 2023
Viaarxiv icon

Evaluating Hallucinations in Chinese Large Language Models

Add code
Oct 05, 2023
Figure 1 for Evaluating Hallucinations in Chinese Large Language Models
Figure 2 for Evaluating Hallucinations in Chinese Large Language Models
Figure 3 for Evaluating Hallucinations in Chinese Large Language Models
Figure 4 for Evaluating Hallucinations in Chinese Large Language Models
Viaarxiv icon

PromptNER: A Prompting Method for Few-shot Named Entity Recognition via k Nearest Neighbor Search

Add code
May 20, 2023
Viaarxiv icon

A Dataset and Baselines for Multilingual Reply Suggestion

Add code
Jun 03, 2021
Figure 1 for A Dataset and Baselines for Multilingual Reply Suggestion
Figure 2 for A Dataset and Baselines for Multilingual Reply Suggestion
Figure 3 for A Dataset and Baselines for Multilingual Reply Suggestion
Figure 4 for A Dataset and Baselines for Multilingual Reply Suggestion
Viaarxiv icon