Picture for Darko Stosic

Darko Stosic

NVIDIA Nemotron 3: Efficient and Open Intelligence

Add code
Dec 24, 2025
Viaarxiv icon

Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Add code
Dec 23, 2025
Viaarxiv icon

Distance Metric Learning through Minimization of the Free Energy

Add code
Jun 10, 2021
Figure 1 for Distance Metric Learning through Minimization of the Free Energy
Figure 2 for Distance Metric Learning through Minimization of the Free Energy
Figure 3 for Distance Metric Learning through Minimization of the Free Energy
Figure 4 for Distance Metric Learning through Minimization of the Free Energy
Viaarxiv icon

Search Spaces for Neural Model Training

Add code
May 27, 2021
Figure 1 for Search Spaces for Neural Model Training
Figure 2 for Search Spaces for Neural Model Training
Figure 3 for Search Spaces for Neural Model Training
Figure 4 for Search Spaces for Neural Model Training
Viaarxiv icon

Accelerating Sparse Deep Neural Networks

Add code
Apr 16, 2021
Figure 1 for Accelerating Sparse Deep Neural Networks
Figure 2 for Accelerating Sparse Deep Neural Networks
Figure 3 for Accelerating Sparse Deep Neural Networks
Figure 4 for Accelerating Sparse Deep Neural Networks
Viaarxiv icon