Picture for Daniel Lo

Daniel Lo

Nemotron 3 Super: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Add code
Apr 14, 2026
Viaarxiv icon

NVIDIA Nemotron 3: Efficient and Open Intelligence

Add code
Dec 24, 2025
Viaarxiv icon

Dynamic Stashing Quantization for Efficient Transformer Training

Add code
Mar 09, 2023
Viaarxiv icon