Picture for Faisal Ladhak

Faisal Ladhak

NVIDIA Nemotron 3: Efficient and Open Intelligence

Add code
Dec 24, 2025
Viaarxiv icon

Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Add code
Dec 23, 2025
Viaarxiv icon

SWAN-GPT: An Efficient and Scalable Approach for Long-Context Language Modeling

Add code
Apr 11, 2025
Viaarxiv icon

L0-Reasoning Bench: Evaluating Procedural Correctness in Language Models via Simple Program Execution

Add code
Mar 28, 2025
Viaarxiv icon

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Add code
Dec 19, 2024
Figure 1 for Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference
Figure 2 for Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference
Figure 3 for Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference
Figure 4 for Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference
Viaarxiv icon

Incorporating Human Explanations for Robust Hate Speech Detection

Add code
Nov 09, 2024
Figure 1 for Incorporating Human Explanations for Robust Hate Speech Detection
Viaarxiv icon

STORYSUMM: Evaluating Faithfulness in Story Summarization

Add code
Jul 09, 2024
Figure 1 for STORYSUMM: Evaluating Faithfulness in Story Summarization
Figure 2 for STORYSUMM: Evaluating Faithfulness in Story Summarization
Figure 3 for STORYSUMM: Evaluating Faithfulness in Story Summarization
Figure 4 for STORYSUMM: Evaluating Faithfulness in Story Summarization
Viaarxiv icon

Aligning Large Language Models via Fine-grained Supervision

Add code
Jun 04, 2024
Figure 1 for Aligning Large Language Models via Fine-grained Supervision
Figure 2 for Aligning Large Language Models via Fine-grained Supervision
Figure 3 for Aligning Large Language Models via Fine-grained Supervision
Figure 4 for Aligning Large Language Models via Fine-grained Supervision
Viaarxiv icon

Proving Test Set Contamination in Black Box Language Models

Add code
Oct 26, 2023
Viaarxiv icon

From Sparse to Dense: GPT-4 Summarization with Chain of Density Prompting

Add code
Sep 08, 2023
Figure 1 for From Sparse to Dense: GPT-4 Summarization with Chain of Density Prompting
Figure 2 for From Sparse to Dense: GPT-4 Summarization with Chain of Density Prompting
Figure 3 for From Sparse to Dense: GPT-4 Summarization with Chain of Density Prompting
Figure 4 for From Sparse to Dense: GPT-4 Summarization with Chain of Density Prompting
Viaarxiv icon