Picture for Marc Marone

Marc Marone

AdapterSwap: Continuous Training of LLMs with Data Removal and Access-Control Guarantees

Add code
Apr 12, 2024
Viaarxiv icon

Verifiable by Design: Aligning Language Models to Quote from Pre-Training Data

Add code
Apr 05, 2024
Figure 1 for Verifiable by Design: Aligning Language Models to Quote from Pre-Training Data
Figure 2 for Verifiable by Design: Aligning Language Models to Quote from Pre-Training Data
Figure 3 for Verifiable by Design: Aligning Language Models to Quote from Pre-Training Data
Figure 4 for Verifiable by Design: Aligning Language Models to Quote from Pre-Training Data
Viaarxiv icon

Dated Data: Tracing Knowledge Cutoffs in Large Language Models

Add code
Mar 19, 2024
Figure 1 for Dated Data: Tracing Knowledge Cutoffs in Large Language Models
Figure 2 for Dated Data: Tracing Knowledge Cutoffs in Large Language Models
Figure 3 for Dated Data: Tracing Knowledge Cutoffs in Large Language Models
Figure 4 for Dated Data: Tracing Knowledge Cutoffs in Large Language Models
Viaarxiv icon

StarCoder 2 and The Stack v2: The Next Generation

Add code
Feb 29, 2024
Viaarxiv icon

"According to ..." Prompting Language Models Improves Quoting from Pre-Training Data

Add code
May 22, 2023
Figure 1 for "According to ..." Prompting Language Models Improves Quoting from Pre-Training Data
Figure 2 for "According to ..." Prompting Language Models Improves Quoting from Pre-Training Data
Figure 3 for "According to ..." Prompting Language Models Improves Quoting from Pre-Training Data
Figure 4 for "According to ..." Prompting Language Models Improves Quoting from Pre-Training Data
Viaarxiv icon

StarCoder: may the source be with you!

Add code
May 09, 2023
Figure 1 for StarCoder: may the source be with you!
Figure 2 for StarCoder: may the source be with you!
Figure 3 for StarCoder: may the source be with you!
Figure 4 for StarCoder: may the source be with you!
Viaarxiv icon

Data Portraits: Recording Foundation Model Training Data

Add code
Mar 06, 2023
Figure 1 for Data Portraits: Recording Foundation Model Training Data
Figure 2 for Data Portraits: Recording Foundation Model Training Data
Figure 3 for Data Portraits: Recording Foundation Model Training Data
Figure 4 for Data Portraits: Recording Foundation Model Training Data
Viaarxiv icon

Pretrained Models for Multilingual Federated Learning

Add code
Jun 06, 2022
Figure 1 for Pretrained Models for Multilingual Federated Learning
Figure 2 for Pretrained Models for Multilingual Federated Learning
Figure 3 for Pretrained Models for Multilingual Federated Learning
Figure 4 for Pretrained Models for Multilingual Federated Learning
Viaarxiv icon

Everything Is All It Takes: A Multipronged Strategy for Zero-Shot Cross-Lingual Information Extraction

Add code
Sep 14, 2021
Figure 1 for Everything Is All It Takes: A Multipronged Strategy for Zero-Shot Cross-Lingual Information Extraction
Figure 2 for Everything Is All It Takes: A Multipronged Strategy for Zero-Shot Cross-Lingual Information Extraction
Figure 3 for Everything Is All It Takes: A Multipronged Strategy for Zero-Shot Cross-Lingual Information Extraction
Figure 4 for Everything Is All It Takes: A Multipronged Strategy for Zero-Shot Cross-Lingual Information Extraction
Viaarxiv icon

Character Eyes: Seeing Language through Character-Level Taggers

Add code
Mar 12, 2019
Figure 1 for Character Eyes: Seeing Language through Character-Level Taggers
Figure 2 for Character Eyes: Seeing Language through Character-Level Taggers
Figure 3 for Character Eyes: Seeing Language through Character-Level Taggers
Figure 4 for Character Eyes: Seeing Language through Character-Level Taggers
Viaarxiv icon