Picture for Rui Men

Rui Men

additional authors not shown

SAGE: Scalable Automated Robustness Augmentation for LLM Knowledge Evaluation

Add code
May 12, 2026
Viaarxiv icon

Qwen-Image-2.0 Technical Report

Add code
May 11, 2026
Viaarxiv icon

Canzona: A Unified, Asynchronous, and Load-Balanced Framework for Distributed Matrix-based Optimizers

Add code
Feb 04, 2026
Viaarxiv icon

A Unified View of Attention and Residual Sinks: Outlier-Driven Rescaling is Essential for Transformer Training

Add code
Jan 30, 2026
Viaarxiv icon

Group Sequence Policy Optimization

Add code
Jul 24, 2025
Figure 1 for Group Sequence Policy Optimization
Figure 2 for Group Sequence Policy Optimization
Figure 3 for Group Sequence Policy Optimization
Viaarxiv icon

MTR-Bench: A Comprehensive Benchmark for Multi-Turn Reasoning Evaluation

Add code
May 26, 2025
Viaarxiv icon

Qwen3 Technical Report

Add code
May 14, 2025
Figure 1 for Qwen3 Technical Report
Figure 2 for Qwen3 Technical Report
Figure 3 for Qwen3 Technical Report
Figure 4 for Qwen3 Technical Report
Viaarxiv icon

Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free

Add code
May 10, 2025
Viaarxiv icon

HellaSwag-Pro: A Large-Scale Bilingual Benchmark for Evaluating the Robustness of LLMs in Commonsense Reasoning

Add code
Feb 17, 2025
Viaarxiv icon

Qwen2.5-1M Technical Report

Add code
Jan 26, 2025
Figure 1 for Qwen2.5-1M Technical Report
Figure 2 for Qwen2.5-1M Technical Report
Figure 3 for Qwen2.5-1M Technical Report
Figure 4 for Qwen2.5-1M Technical Report
Viaarxiv icon