Picture for Jiansheng Wei

Jiansheng Wei

The GaoYao Benchmark: A Comprehensive Framework for Evaluating Multilingual and Multicultural Abilities of Large Language Models

Add code
Apr 22, 2026
Viaarxiv icon

DT2IT-MRM: Debiased Preference Construction and Iterative Training for Multimodal Reward Modeling

Add code
Apr 21, 2026
Viaarxiv icon

MathAgent: Adversarial Evolution of Constraint Graphs for Mathematical Reasoning Data Synthesis

Add code
Apr 13, 2026
Viaarxiv icon

PACE: Defying the Scaling Hypothesis of Exploration in Iterative Alignment for Mathematical Reasoning

Add code
Feb 05, 2026
Viaarxiv icon

Dynamic Sampling that Adapts: Iterative DPO for Self-Aware Mathematical Reasoning

Add code
May 22, 2025
Viaarxiv icon

Pangu Ultra MoE: How to Train Your Big MoE on Ascend NPUs

Add code
May 07, 2025
Figure 1 for Pangu Ultra MoE: How to Train Your Big MoE on Ascend NPUs
Figure 2 for Pangu Ultra MoE: How to Train Your Big MoE on Ascend NPUs
Figure 3 for Pangu Ultra MoE: How to Train Your Big MoE on Ascend NPUs
Figure 4 for Pangu Ultra MoE: How to Train Your Big MoE on Ascend NPUs
Viaarxiv icon

Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs

Add code
Apr 10, 2025
Figure 1 for Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs
Figure 2 for Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs
Figure 3 for Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs
Figure 4 for Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs
Viaarxiv icon

VideoLLM Knows When to Speak: Enhancing Time-Sensitive Video Comprehension with Video-Text Duet Interaction Format

Add code
Nov 27, 2024
Figure 1 for VideoLLM Knows When to Speak: Enhancing Time-Sensitive Video Comprehension with Video-Text Duet Interaction Format
Figure 2 for VideoLLM Knows When to Speak: Enhancing Time-Sensitive Video Comprehension with Video-Text Duet Interaction Format
Figure 3 for VideoLLM Knows When to Speak: Enhancing Time-Sensitive Video Comprehension with Video-Text Duet Interaction Format
Figure 4 for VideoLLM Knows When to Speak: Enhancing Time-Sensitive Video Comprehension with Video-Text Duet Interaction Format
Viaarxiv icon

Visually Guided Generative Text-Layout Pre-training for Document Intelligence

Add code
Mar 27, 2024
Figure 1 for Visually Guided Generative Text-Layout Pre-training for Document Intelligence
Figure 2 for Visually Guided Generative Text-Layout Pre-training for Document Intelligence
Figure 3 for Visually Guided Generative Text-Layout Pre-training for Document Intelligence
Figure 4 for Visually Guided Generative Text-Layout Pre-training for Document Intelligence
Viaarxiv icon

PanGu-Σ: Towards Trillion Parameter Language Model with Sparse Heterogeneous Computing

Add code
Mar 20, 2023
Figure 1 for PanGu-Σ: Towards Trillion Parameter Language Model with Sparse Heterogeneous Computing
Figure 2 for PanGu-Σ: Towards Trillion Parameter Language Model with Sparse Heterogeneous Computing
Figure 3 for PanGu-Σ: Towards Trillion Parameter Language Model with Sparse Heterogeneous Computing
Figure 4 for PanGu-Σ: Towards Trillion Parameter Language Model with Sparse Heterogeneous Computing
Viaarxiv icon