Picture for Franck Dernoncourt

Franck Dernoncourt

Adobe Research

CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages

Add code
Sep 17, 2023
Figure 1 for CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages
Figure 2 for CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages
Viaarxiv icon

PDFTriage: Question Answering over Long, Structured Documents

Add code
Sep 16, 2023
Figure 1 for PDFTriage: Question Answering over Long, Structured Documents
Figure 2 for PDFTriage: Question Answering over Long, Structured Documents
Figure 3 for PDFTriage: Question Answering over Long, Structured Documents
Figure 4 for PDFTriage: Question Answering over Long, Structured Documents
Viaarxiv icon

Multilingual Sentence-Level Semantic Search using Meta-Distillation Learning

Add code
Sep 15, 2023
Figure 1 for Multilingual Sentence-Level Semantic Search using Meta-Distillation Learning
Figure 2 for Multilingual Sentence-Level Semantic Search using Meta-Distillation Learning
Figure 3 for Multilingual Sentence-Level Semantic Search using Meta-Distillation Learning
Figure 4 for Multilingual Sentence-Level Semantic Search using Meta-Distillation Learning
Viaarxiv icon

Bias and Fairness in Large Language Models: A Survey

Add code
Sep 02, 2023
Viaarxiv icon

Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback

Add code
Aug 02, 2023
Figure 1 for Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback
Figure 2 for Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback
Figure 3 for Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback
Figure 4 for Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback
Viaarxiv icon

Boosting Punctuation Restoration with Data Generation and Reinforcement Learning

Add code
Jul 24, 2023
Viaarxiv icon

Learning Navigational Visual Representations with Semantic Map Supervision

Add code
Jul 23, 2023
Viaarxiv icon

Fairness-Aware Graph Neural Networks: A Survey

Add code
Jul 08, 2023
Viaarxiv icon

Efficient Spoken Language Recognition via Multilabel Classification

Add code
Jun 02, 2023
Figure 1 for Efficient Spoken Language Recognition via Multilabel Classification
Figure 2 for Efficient Spoken Language Recognition via Multilabel Classification
Figure 3 for Efficient Spoken Language Recognition via Multilabel Classification
Figure 4 for Efficient Spoken Language Recognition via Multilabel Classification
Viaarxiv icon

MeetingBank: A Benchmark Dataset for Meeting Summarization

Add code
May 27, 2023
Viaarxiv icon