Picture for Yehui Tang

Yehui Tang

and Other Contributors

Pangu Embedded: An Efficient Dual-system LLM Reasoner with Metacognition

Add code
May 29, 2025
Viaarxiv icon

Pangu Pro MoE: Mixture of Grouped Experts for Efficient Sparsity

Add code
May 28, 2025
Viaarxiv icon

SlimLLM: Accurate Structured Pruning for Large Language Models

Add code
May 28, 2025
Viaarxiv icon

Pangu Light: Weight Re-Initialization for Pruning and Accelerating LLMs

Add code
May 26, 2025
Viaarxiv icon

Pangu Ultra MoE: How to Train Your Big MoE on Ascend NPUs

Add code
May 07, 2025
Viaarxiv icon

Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs

Add code
Apr 10, 2025
Viaarxiv icon

Saliency-driven Dynamic Token Pruning for Large Language Models

Add code
Apr 09, 2025
Viaarxiv icon

SpeCache: Speculative Key-Value Caching for Efficient Generation of LLMs

Add code
Mar 20, 2025
Viaarxiv icon

Mixture of Lookup Experts

Add code
Mar 20, 2025
Viaarxiv icon

Post-Training Quantization for Diffusion Transformer via Hierarchical Timestep Grouping

Add code
Mar 10, 2025
Viaarxiv icon