Picture for Bryan Catanzaro

Bryan Catanzaro

Effective Large Language Model Debugging with Best-first Tree Search

Add code
Jul 26, 2024
Figure 1 for Effective Large Language Model Debugging with Best-first Tree Search
Figure 2 for Effective Large Language Model Debugging with Best-first Tree Search
Figure 3 for Effective Large Language Model Debugging with Best-first Tree Search
Figure 4 for Effective Large Language Model Debugging with Best-first Tree Search
Viaarxiv icon

Compact Language Models via Pruning and Knowledge Distillation

Add code
Jul 19, 2024
Figure 1 for Compact Language Models via Pruning and Knowledge Distillation
Figure 2 for Compact Language Models via Pruning and Knowledge Distillation
Figure 3 for Compact Language Models via Pruning and Knowledge Distillation
Figure 4 for Compact Language Models via Pruning and Knowledge Distillation
Viaarxiv icon

ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities

Add code
Jul 19, 2024
Figure 1 for ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities
Figure 2 for ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities
Figure 3 for ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities
Figure 4 for ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities
Viaarxiv icon

Reuse, Don't Retrain: A Recipe for Continued Pretraining of Language Models

Add code
Jul 09, 2024
Viaarxiv icon

Data, Data Everywhere: A Guide for Pretraining Dataset Construction

Add code
Jul 08, 2024
Viaarxiv icon

RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs

Add code
Jul 02, 2024
Figure 1 for RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs
Figure 2 for RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs
Figure 3 for RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs
Figure 4 for RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs
Viaarxiv icon

Improving Text-To-Audio Models with Synthetic Captions

Add code
Jun 18, 2024
Figure 1 for Improving Text-To-Audio Models with Synthetic Captions
Figure 2 for Improving Text-To-Audio Models with Synthetic Captions
Figure 3 for Improving Text-To-Audio Models with Synthetic Captions
Figure 4 for Improving Text-To-Audio Models with Synthetic Captions
Viaarxiv icon

Nemotron-4 340B Technical Report

Add code
Jun 17, 2024
Figure 1 for Nemotron-4 340B Technical Report
Figure 2 for Nemotron-4 340B Technical Report
Figure 3 for Nemotron-4 340B Technical Report
Figure 4 for Nemotron-4 340B Technical Report
Viaarxiv icon

CircuitVAE: Efficient and Scalable Latent Circuit Optimization

Add code
Jun 13, 2024
Figure 1 for CircuitVAE: Efficient and Scalable Latent Circuit Optimization
Figure 2 for CircuitVAE: Efficient and Scalable Latent Circuit Optimization
Figure 3 for CircuitVAE: Efficient and Scalable Latent Circuit Optimization
Figure 4 for CircuitVAE: Efficient and Scalable Latent Circuit Optimization
Viaarxiv icon

An Empirical Study of Mamba-based Language Models

Add code
Jun 12, 2024
Figure 1 for An Empirical Study of Mamba-based Language Models
Figure 2 for An Empirical Study of Mamba-based Language Models
Figure 3 for An Empirical Study of Mamba-based Language Models
Figure 4 for An Empirical Study of Mamba-based Language Models
Viaarxiv icon