Picture for Fei Mi

Fei Mi

and Other Contributors

How Should We Enhance the Safety of Large Reasoning Models: An Empirical Study

Add code
May 21, 2025
Viaarxiv icon

Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs

Add code
Apr 10, 2025
Figure 1 for Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs
Figure 2 for Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs
Figure 3 for Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs
Figure 4 for Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs
Viaarxiv icon

DAST: Difficulty-Aware Self-Training on Large Language Models

Add code
Mar 12, 2025
Viaarxiv icon

UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models

Add code
Dec 16, 2024
Figure 1 for UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models
Figure 2 for UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models
Figure 3 for UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models
Figure 4 for UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models
Viaarxiv icon

CoSafe: Evaluating Large Language Model Safety in Multi-Turn Dialogue Coreference

Add code
Jun 25, 2024
Viaarxiv icon

Dynamic Stochastic Decoding Strategy for Open-Domain Dialogue Generation

Add code
Jun 12, 2024
Figure 1 for Dynamic Stochastic Decoding Strategy for Open-Domain Dialogue Generation
Figure 2 for Dynamic Stochastic Decoding Strategy for Open-Domain Dialogue Generation
Figure 3 for Dynamic Stochastic Decoding Strategy for Open-Domain Dialogue Generation
Figure 4 for Dynamic Stochastic Decoding Strategy for Open-Domain Dialogue Generation
Viaarxiv icon

Mixture of insighTful Experts : The Synergy of Thought Chains and Expert Mixtures in Self-Alignment

Add code
May 01, 2024
Figure 1 for Mixture of insighTful Experts : The Synergy of Thought Chains and Expert Mixtures in Self-Alignment
Figure 2 for Mixture of insighTful Experts : The Synergy of Thought Chains and Expert Mixtures in Self-Alignment
Figure 3 for Mixture of insighTful Experts : The Synergy of Thought Chains and Expert Mixtures in Self-Alignment
Figure 4 for Mixture of insighTful Experts : The Synergy of Thought Chains and Expert Mixtures in Self-Alignment
Viaarxiv icon

Role Prompting Guided Domain Adaptation with General Capability Preserve for Large Language Models

Add code
Mar 05, 2024
Figure 1 for Role Prompting Guided Domain Adaptation with General Capability Preserve for Large Language Models
Figure 2 for Role Prompting Guided Domain Adaptation with General Capability Preserve for Large Language Models
Figure 3 for Role Prompting Guided Domain Adaptation with General Capability Preserve for Large Language Models
Figure 4 for Role Prompting Guided Domain Adaptation with General Capability Preserve for Large Language Models
Viaarxiv icon

UniRetriever: Multi-task Candidates Selection for Various Context-Adaptive Conversational Retrieval

Add code
Feb 28, 2024
Figure 1 for UniRetriever: Multi-task Candidates Selection for Various Context-Adaptive Conversational Retrieval
Figure 2 for UniRetriever: Multi-task Candidates Selection for Various Context-Adaptive Conversational Retrieval
Figure 3 for UniRetriever: Multi-task Candidates Selection for Various Context-Adaptive Conversational Retrieval
Figure 4 for UniRetriever: Multi-task Candidates Selection for Various Context-Adaptive Conversational Retrieval
Viaarxiv icon

YODA: Teacher-Student Progressive Learning for Language Models

Add code
Jan 28, 2024
Figure 1 for YODA: Teacher-Student Progressive Learning for Language Models
Figure 2 for YODA: Teacher-Student Progressive Learning for Language Models
Figure 3 for YODA: Teacher-Student Progressive Learning for Language Models
Figure 4 for YODA: Teacher-Student Progressive Learning for Language Models
Viaarxiv icon