Picture for Fei Mi

Fei Mi

and Other Contributors

ClusterUCB: Efficient Gradient-Based Data Selection for Targeted Fine-Tuning of LLMs

Add code
Jun 12, 2025
Viaarxiv icon

Pangu Embedded: An Efficient Dual-system LLM Reasoner with Metacognition

Add code
May 29, 2025
Viaarxiv icon

Pangu Pro MoE: Mixture of Grouped Experts for Efficient Sparsity

Add code
May 28, 2025
Viaarxiv icon

Self-Error-Instruct: Generalizing from Errors for LLMs Mathematical Reasoning

Add code
May 28, 2025
Viaarxiv icon

How Should We Enhance the Safety of Large Reasoning Models: An Empirical Study

Add code
May 21, 2025
Viaarxiv icon

Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs

Add code
Apr 10, 2025
Viaarxiv icon

DAST: Difficulty-Aware Self-Training on Large Language Models

Add code
Mar 12, 2025
Viaarxiv icon

UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models

Add code
Dec 16, 2024
Viaarxiv icon

CoSafe: Evaluating Large Language Model Safety in Multi-Turn Dialogue Coreference

Add code
Jun 25, 2024
Viaarxiv icon

Dynamic Stochastic Decoding Strategy for Open-Domain Dialogue Generation

Add code
Jun 12, 2024
Figure 1 for Dynamic Stochastic Decoding Strategy for Open-Domain Dialogue Generation
Figure 2 for Dynamic Stochastic Decoding Strategy for Open-Domain Dialogue Generation
Figure 3 for Dynamic Stochastic Decoding Strategy for Open-Domain Dialogue Generation
Figure 4 for Dynamic Stochastic Decoding Strategy for Open-Domain Dialogue Generation
Viaarxiv icon