Picture for Piotr Stanczyk

Piotr Stanczyk

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

GKD: Generalized Knowledge Distillation for Auto-regressive Sequence Models

Add code
Jun 23, 2023
Figure 1 for GKD: Generalized Knowledge Distillation for Auto-regressive Sequence Models
Figure 2 for GKD: Generalized Knowledge Distillation for Auto-regressive Sequence Models
Figure 3 for GKD: Generalized Knowledge Distillation for Auto-regressive Sequence Models
Figure 4 for GKD: Generalized Knowledge Distillation for Auto-regressive Sequence Models
Viaarxiv icon

Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback

Add code
May 31, 2023
Figure 1 for Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback
Figure 2 for Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback
Figure 3 for Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback
Figure 4 for Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback
Viaarxiv icon

RLDS: an Ecosystem to Generate, Share and Use Datasets in Reinforcement Learning

Add code
Nov 04, 2021
Figure 1 for RLDS: an Ecosystem to Generate, Share and Use Datasets in Reinforcement Learning
Figure 2 for RLDS: an Ecosystem to Generate, Share and Use Datasets in Reinforcement Learning
Figure 3 for RLDS: an Ecosystem to Generate, Share and Use Datasets in Reinforcement Learning
Figure 4 for RLDS: an Ecosystem to Generate, Share and Use Datasets in Reinforcement Learning
Viaarxiv icon

SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference

Add code
Oct 15, 2019
Figure 1 for SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference
Figure 2 for SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference
Figure 3 for SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference
Figure 4 for SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference
Viaarxiv icon