Picture for Bryan Catanzaro

Bryan Catanzaro

ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities

Add code
Jul 19, 2024
Viaarxiv icon

Reuse, Don't Retrain: A Recipe for Continued Pretraining of Language Models

Add code
Jul 09, 2024
Figure 1 for Reuse, Don't Retrain: A Recipe for Continued Pretraining of Language Models
Figure 2 for Reuse, Don't Retrain: A Recipe for Continued Pretraining of Language Models
Figure 3 for Reuse, Don't Retrain: A Recipe for Continued Pretraining of Language Models
Figure 4 for Reuse, Don't Retrain: A Recipe for Continued Pretraining of Language Models
Viaarxiv icon

Data, Data Everywhere: A Guide for Pretraining Dataset Construction

Add code
Jul 08, 2024
Viaarxiv icon

RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs

Add code
Jul 02, 2024
Viaarxiv icon

Improving Text-To-Audio Models with Synthetic Captions

Add code
Jun 18, 2024
Viaarxiv icon

Nemotron-4 340B Technical Report

Add code
Jun 17, 2024
Figure 1 for Nemotron-4 340B Technical Report
Figure 2 for Nemotron-4 340B Technical Report
Figure 3 for Nemotron-4 340B Technical Report
Figure 4 for Nemotron-4 340B Technical Report
Viaarxiv icon

CircuitVAE: Efficient and Scalable Latent Circuit Optimization

Add code
Jun 13, 2024
Viaarxiv icon

An Empirical Study of Mamba-based Language Models

Add code
Jun 12, 2024
Viaarxiv icon

NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models

Add code
May 27, 2024
Figure 1 for NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models
Figure 2 for NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models
Figure 3 for NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models
Figure 4 for NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models
Viaarxiv icon

Audio Dialogues: Dialogues dataset for audio and music understanding

Add code
Apr 11, 2024
Viaarxiv icon