Picture for Oluwatobi Olabiyi

Oluwatobi Olabiyi

NVIDIA Nemotron 3: Efficient and Open Intelligence

Add code
Dec 24, 2025
Viaarxiv icon

Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Add code
Dec 23, 2025
Viaarxiv icon

NVIDIA Nemotron Nano V2 VL

Add code
Nov 07, 2025
Viaarxiv icon

NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

Add code
Aug 21, 2025
Figure 1 for NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model
Figure 2 for NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model
Figure 3 for NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model
Figure 4 for NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model
Viaarxiv icon

Llama-Nemotron: Efficient Reasoning Models

Add code
May 02, 2025
Figure 1 for Llama-Nemotron: Efficient Reasoning Models
Figure 2 for Llama-Nemotron: Efficient Reasoning Models
Figure 3 for Llama-Nemotron: Efficient Reasoning Models
Figure 4 for Llama-Nemotron: Efficient Reasoning Models
Viaarxiv icon

Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning

Add code
Apr 15, 2025
Viaarxiv icon

DLGNet: A Transformer-based Model for Dialogue Response Generation

Add code
Sep 04, 2019
Figure 1 for DLGNet: A Transformer-based Model for Dialogue Response Generation
Figure 2 for DLGNet: A Transformer-based Model for Dialogue Response Generation
Figure 3 for DLGNet: A Transformer-based Model for Dialogue Response Generation
Figure 4 for DLGNet: A Transformer-based Model for Dialogue Response Generation
Viaarxiv icon

Adversarial Bootstrapping for Dialogue Model Training

Add code
Sep 04, 2019
Figure 1 for Adversarial Bootstrapping for Dialogue Model Training
Figure 2 for Adversarial Bootstrapping for Dialogue Model Training
Figure 3 for Adversarial Bootstrapping for Dialogue Model Training
Figure 4 for Adversarial Bootstrapping for Dialogue Model Training
Viaarxiv icon

An Adversarial Learning Framework For A Persona-Based Multi-Turn Dialogue Model

Add code
Apr 29, 2019
Figure 1 for An Adversarial Learning Framework For A Persona-Based Multi-Turn Dialogue Model
Figure 2 for An Adversarial Learning Framework For A Persona-Based Multi-Turn Dialogue Model
Figure 3 for An Adversarial Learning Framework For A Persona-Based Multi-Turn Dialogue Model
Figure 4 for An Adversarial Learning Framework For A Persona-Based Multi-Turn Dialogue Model
Viaarxiv icon

Multi-turn Dialogue Response Generation in an Adversarial Learning Framework

Add code
Sep 19, 2018
Figure 1 for Multi-turn Dialogue Response Generation in an Adversarial Learning Framework
Figure 2 for Multi-turn Dialogue Response Generation in an Adversarial Learning Framework
Figure 3 for Multi-turn Dialogue Response Generation in an Adversarial Learning Framework
Figure 4 for Multi-turn Dialogue Response Generation in an Adversarial Learning Framework
Viaarxiv icon