Picture for Keshav Santhanam

Keshav Santhanam

NVIDIA Nemotron 3: Efficient and Open Intelligence

Add code
Dec 24, 2025
Viaarxiv icon

Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Add code
Dec 23, 2025
Viaarxiv icon

NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

Add code
Aug 21, 2025
Figure 1 for NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model
Figure 2 for NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model
Figure 3 for NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model
Figure 4 for NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model
Viaarxiv icon

ColBERT-serve: Efficient Multi-Stage Memory-Mapped Scoring

Add code
Apr 21, 2025
Viaarxiv icon

Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models

Add code
Apr 10, 2025
Figure 1 for Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models
Figure 2 for Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models
Figure 3 for Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models
Figure 4 for Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models
Viaarxiv icon

ALTO: An Efficient Network Orchestrator for Compound AI Systems

Add code
Mar 07, 2024
Viaarxiv icon

DSPy: Compiling Declarative Language Model Calls into Self-Improving Pipelines

Add code
Oct 05, 2023
Figure 1 for DSPy: Compiling Declarative Language Model Calls into Self-Improving Pipelines
Figure 2 for DSPy: Compiling Declarative Language Model Calls into Self-Improving Pipelines
Figure 3 for DSPy: Compiling Declarative Language Model Calls into Self-Improving Pipelines
Figure 4 for DSPy: Compiling Declarative Language Model Calls into Self-Improving Pipelines
Viaarxiv icon

Cheaply Evaluating Inference Efficiency Metrics for Autoregressive Transformer APIs

Add code
May 03, 2023
Viaarxiv icon

UDAPDR: Unsupervised Domain Adaptation via LLM Prompting and Distillation of Rerankers

Add code
Mar 01, 2023
Figure 1 for UDAPDR: Unsupervised Domain Adaptation via LLM Prompting and Distillation of Rerankers
Figure 2 for UDAPDR: Unsupervised Domain Adaptation via LLM Prompting and Distillation of Rerankers
Figure 3 for UDAPDR: Unsupervised Domain Adaptation via LLM Prompting and Distillation of Rerankers
Figure 4 for UDAPDR: Unsupervised Domain Adaptation via LLM Prompting and Distillation of Rerankers
Viaarxiv icon

Demonstrate-Search-Predict: Composing retrieval and language models for knowledge-intensive NLP

Add code
Dec 28, 2022
Figure 1 for Demonstrate-Search-Predict: Composing retrieval and language models for knowledge-intensive NLP
Figure 2 for Demonstrate-Search-Predict: Composing retrieval and language models for knowledge-intensive NLP
Figure 3 for Demonstrate-Search-Predict: Composing retrieval and language models for knowledge-intensive NLP
Viaarxiv icon