Picture for Donald Metzler

Donald Metzler

LAIT: Efficient Multi-Segment Encoding in Transformers with Layer-Adjustable Interaction

Add code
May 31, 2023
Figure 1 for LAIT: Efficient Multi-Segment Encoding in Transformers with Layer-Adjustable Interaction
Figure 2 for LAIT: Efficient Multi-Segment Encoding in Transformers with Layer-Adjustable Interaction
Figure 3 for LAIT: Efficient Multi-Segment Encoding in Transformers with Layer-Adjustable Interaction
Figure 4 for LAIT: Efficient Multi-Segment Encoding in Transformers with Layer-Adjustable Interaction
Viaarxiv icon

How Does Generative Retrieval Scale to Millions of Passages?

Add code
May 19, 2023
Figure 1 for How Does Generative Retrieval Scale to Millions of Passages?
Figure 2 for How Does Generative Retrieval Scale to Millions of Passages?
Figure 3 for How Does Generative Retrieval Scale to Millions of Passages?
Figure 4 for How Does Generative Retrieval Scale to Millions of Passages?
Viaarxiv icon

DSI++: Updating Transformer Memory with New Documents

Add code
Dec 19, 2022
Figure 1 for DSI++: Updating Transformer Memory with New Documents
Figure 2 for DSI++: Updating Transformer Memory with New Documents
Figure 3 for DSI++: Updating Transformer Memory with New Documents
Figure 4 for DSI++: Updating Transformer Memory with New Documents
Viaarxiv icon

Dense Feature Memory Augmented Transformers for COVID-19 Vaccination Search Classification

Add code
Dec 16, 2022
Figure 1 for Dense Feature Memory Augmented Transformers for COVID-19 Vaccination Search Classification
Figure 2 for Dense Feature Memory Augmented Transformers for COVID-19 Vaccination Search Classification
Figure 3 for Dense Feature Memory Augmented Transformers for COVID-19 Vaccination Search Classification
Figure 4 for Dense Feature Memory Augmented Transformers for COVID-19 Vaccination Search Classification
Viaarxiv icon

Attributed Question Answering: Evaluation and Modeling for Attributed Large Language Models

Add code
Dec 15, 2022
Figure 1 for Attributed Question Answering: Evaluation and Modeling for Attributed Large Language Models
Figure 2 for Attributed Question Answering: Evaluation and Modeling for Attributed Large Language Models
Figure 3 for Attributed Question Answering: Evaluation and Modeling for Attributed Large Language Models
Figure 4 for Attributed Question Answering: Evaluation and Modeling for Attributed Large Language Models
Viaarxiv icon

Transcending Scaling Laws with 0.1% Extra Compute

Add code
Oct 20, 2022
Figure 1 for Transcending Scaling Laws with 0.1% Extra Compute
Figure 2 for Transcending Scaling Laws with 0.1% Extra Compute
Figure 3 for Transcending Scaling Laws with 0.1% Extra Compute
Figure 4 for Transcending Scaling Laws with 0.1% Extra Compute
Viaarxiv icon

Scaling Laws vs Model Architectures: How does Inductive Bias Influence Scaling?

Add code
Jul 21, 2022
Figure 1 for Scaling Laws vs Model Architectures: How does Inductive Bias Influence Scaling?
Figure 2 for Scaling Laws vs Model Architectures: How does Inductive Bias Influence Scaling?
Figure 3 for Scaling Laws vs Model Architectures: How does Inductive Bias Influence Scaling?
Figure 4 for Scaling Laws vs Model Architectures: How does Inductive Bias Influence Scaling?
Viaarxiv icon

Confident Adaptive Language Modeling

Add code
Jul 14, 2022
Figure 1 for Confident Adaptive Language Modeling
Figure 2 for Confident Adaptive Language Modeling
Figure 3 for Confident Adaptive Language Modeling
Figure 4 for Confident Adaptive Language Modeling
Viaarxiv icon

Emergent Abilities of Large Language Models

Add code
Jun 15, 2022
Figure 1 for Emergent Abilities of Large Language Models
Figure 2 for Emergent Abilities of Large Language Models
Figure 3 for Emergent Abilities of Large Language Models
Figure 4 for Emergent Abilities of Large Language Models
Viaarxiv icon

Unifying Language Learning Paradigms

Add code
May 10, 2022
Figure 1 for Unifying Language Learning Paradigms
Figure 2 for Unifying Language Learning Paradigms
Figure 3 for Unifying Language Learning Paradigms
Figure 4 for Unifying Language Learning Paradigms
Viaarxiv icon