Picture for Oleksii Kuchaiev

Oleksii Kuchaiev

GPT vs RETRO: Exploring the Intersection of Retrieval and Parameter-Efficient Fine-Tuning

Add code
Jul 05, 2024
Viaarxiv icon

Nemotron-4 340B Technical Report

Add code
Jun 17, 2024
Figure 1 for Nemotron-4 340B Technical Report
Figure 2 for Nemotron-4 340B Technical Report
Figure 3 for Nemotron-4 340B Technical Report
Figure 4 for Nemotron-4 340B Technical Report
Viaarxiv icon

HelpSteer2: Open-source dataset for training top-performing reward models

Add code
Jun 12, 2024
Viaarxiv icon

NeMo-Aligner: Scalable Toolkit for Efficient Model Alignment

Add code
May 02, 2024
Viaarxiv icon

Nemotron-4 15B Technical Report

Add code
Feb 27, 2024
Figure 1 for Nemotron-4 15B Technical Report
Figure 2 for Nemotron-4 15B Technical Report
Figure 3 for Nemotron-4 15B Technical Report
Figure 4 for Nemotron-4 15B Technical Report
Viaarxiv icon

HelpSteer: Multi-attribute Helpfulness Dataset for SteerLM

Add code
Nov 16, 2023
Figure 1 for HelpSteer: Multi-attribute Helpfulness Dataset for SteerLM
Figure 2 for HelpSteer: Multi-attribute Helpfulness Dataset for SteerLM
Figure 3 for HelpSteer: Multi-attribute Helpfulness Dataset for SteerLM
Figure 4 for HelpSteer: Multi-attribute Helpfulness Dataset for SteerLM
Viaarxiv icon

Tied-Lora: Enhacing parameter efficiency of LoRA with weight tying

Add code
Nov 16, 2023
Figure 1 for Tied-Lora: Enhacing parameter efficiency of LoRA with weight tying
Figure 2 for Tied-Lora: Enhacing parameter efficiency of LoRA with weight tying
Figure 3 for Tied-Lora: Enhacing parameter efficiency of LoRA with weight tying
Figure 4 for Tied-Lora: Enhacing parameter efficiency of LoRA with weight tying
Viaarxiv icon

SteerLM: Attribute Conditioned SFT as an (User-Steerable) Alternative to RLHF

Add code
Oct 09, 2023
Figure 1 for SteerLM: Attribute Conditioned SFT as an (User-Steerable) Alternative to RLHF
Figure 2 for SteerLM: Attribute Conditioned SFT as an (User-Steerable) Alternative to RLHF
Figure 3 for SteerLM: Attribute Conditioned SFT as an (User-Steerable) Alternative to RLHF
Figure 4 for SteerLM: Attribute Conditioned SFT as an (User-Steerable) Alternative to RLHF
Viaarxiv icon

Leveraging Synthetic Targets for Machine Translation

Add code
May 07, 2023
Figure 1 for Leveraging Synthetic Targets for Machine Translation
Figure 2 for Leveraging Synthetic Targets for Machine Translation
Figure 3 for Leveraging Synthetic Targets for Machine Translation
Figure 4 for Leveraging Synthetic Targets for Machine Translation
Viaarxiv icon

Shall We Pretrain Autoregressive Language Models with Retrieval? A Comprehensive Study

Add code
Apr 13, 2023
Figure 1 for Shall We Pretrain Autoregressive Language Models with Retrieval? A Comprehensive Study
Figure 2 for Shall We Pretrain Autoregressive Language Models with Retrieval? A Comprehensive Study
Figure 3 for Shall We Pretrain Autoregressive Language Models with Retrieval? A Comprehensive Study
Figure 4 for Shall We Pretrain Autoregressive Language Models with Retrieval? A Comprehensive Study
Viaarxiv icon