Picture for :

:

additional authors not shown

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Add code
Jul 02, 2025
Viaarxiv icon

Essential-Web v1.0: 24T tokens of organized web data

Add code
Jun 17, 2025
Viaarxiv icon

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Add code
Jun 16, 2025
Viaarxiv icon

Magistral

Add code
Jun 12, 2025
Viaarxiv icon

MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

Add code
May 12, 2025
Viaarxiv icon

Practical Efficiency of Muon for Pretraining

Add code
May 04, 2025
Viaarxiv icon

Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models

Add code
Apr 10, 2025
Viaarxiv icon

Rethinking Reflection in Pre-Training

Add code
Apr 05, 2025
Viaarxiv icon

Wan: Open and Advanced Large-Scale Video Generative Models

Add code
Mar 26, 2025
Viaarxiv icon

Cosmos-Transfer1: Conditional World Generation with Adaptive Multimodal Control

Add code
Mar 18, 2025
Viaarxiv icon