Picture for Xiaojun Meng

Xiaojun Meng

and Other Contributors

Pangu Pro MoE: Mixture of Grouped Experts for Efficient Sparsity

Add code
May 28, 2025
Viaarxiv icon

Pangu Light: Weight Re-Initialization for Pruning and Accelerating LLMs

Add code
May 26, 2025
Viaarxiv icon

Dynamic Sampling that Adapts: Iterative DPO for Self-Aware Mathematical Reasoning

Add code
May 22, 2025
Viaarxiv icon

Pangu Ultra MoE: How to Train Your Big MoE on Ascend NPUs

Add code
May 07, 2025
Viaarxiv icon

Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs

Add code
Apr 10, 2025
Viaarxiv icon

Friends-MMC: A Dataset for Multi-modal Multi-party Conversation Understanding

Add code
Dec 23, 2024
Viaarxiv icon

VideoLLM Knows When to Speak: Enhancing Time-Sensitive Video Comprehension with Video-Text Duet Interaction Format

Add code
Nov 27, 2024
Viaarxiv icon

Learning to Compress Contexts for Efficient Knowledge-based Visual Question Answering

Add code
Sep 11, 2024
Viaarxiv icon

End-to-End Video Question Answering with Frame Scoring Mechanisms and Adaptive Sampling

Add code
Jul 23, 2024
Viaarxiv icon

HawkEye: Training Video-Text LLMs for Grounding Text in Videos

Add code
Mar 15, 2024
Viaarxiv icon