Picture for Mohammad Shoeybi

Mohammad Shoeybi

ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities

Add code
Jul 19, 2024
Viaarxiv icon

Reuse, Don't Retrain: A Recipe for Continued Pretraining of Language Models

Add code
Jul 09, 2024
Figure 1 for Reuse, Don't Retrain: A Recipe for Continued Pretraining of Language Models
Figure 2 for Reuse, Don't Retrain: A Recipe for Continued Pretraining of Language Models
Figure 3 for Reuse, Don't Retrain: A Recipe for Continued Pretraining of Language Models
Figure 4 for Reuse, Don't Retrain: A Recipe for Continued Pretraining of Language Models
Viaarxiv icon

Data, Data Everywhere: A Guide for Pretraining Dataset Construction

Add code
Jul 08, 2024
Viaarxiv icon

RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs

Add code
Jul 02, 2024
Viaarxiv icon

Nemotron-4 340B Technical Report

Add code
Jun 17, 2024
Figure 1 for Nemotron-4 340B Technical Report
Figure 2 for Nemotron-4 340B Technical Report
Figure 3 for Nemotron-4 340B Technical Report
Figure 4 for Nemotron-4 340B Technical Report
Viaarxiv icon

An Empirical Study of Mamba-based Language Models

Add code
Jun 12, 2024
Viaarxiv icon

NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models

Add code
May 27, 2024
Figure 1 for NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models
Figure 2 for NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models
Figure 3 for NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models
Figure 4 for NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models
Viaarxiv icon

Nemotron-4 15B Technical Report

Add code
Feb 27, 2024
Figure 1 for Nemotron-4 15B Technical Report
Figure 2 for Nemotron-4 15B Technical Report
Figure 3 for Nemotron-4 15B Technical Report
Figure 4 for Nemotron-4 15B Technical Report
Viaarxiv icon

ODIN: Disentangled Reward Mitigates Hacking in RLHF

Add code
Feb 11, 2024
Viaarxiv icon

ChatQA: Building GPT-4 Level Conversational QA Models

Add code
Jan 23, 2024
Figure 1 for ChatQA: Building GPT-4 Level Conversational QA Models
Figure 2 for ChatQA: Building GPT-4 Level Conversational QA Models
Figure 3 for ChatQA: Building GPT-4 Level Conversational QA Models
Figure 4 for ChatQA: Building GPT-4 Level Conversational QA Models
Viaarxiv icon