Alert button
Picture for Donald Metzler

Donald Metzler

Alert button

Rethinking Search: Making Experts out of Dilettantes

Add code
Bookmark button
Alert button
May 05, 2021
Donald Metzler, Yi Tay, Dara Bahri, Marc Najork

Figure 1 for Rethinking Search: Making Experts out of Dilettantes
Figure 2 for Rethinking Search: Making Experts out of Dilettantes
Figure 3 for Rethinking Search: Making Experts out of Dilettantes
Viaarxiv icon

OmniNet: Omnidirectional Representations from Transformers

Add code
Bookmark button
Alert button
Mar 01, 2021
Yi Tay, Mostafa Dehghani, Vamsi Aribandi, Jai Gupta, Philip Pham, Zhen Qin, Dara Bahri, Da-Cheng Juan, Donald Metzler

Figure 1 for OmniNet: Omnidirectional Representations from Transformers
Figure 2 for OmniNet: Omnidirectional Representations from Transformers
Figure 3 for OmniNet: Omnidirectional Representations from Transformers
Figure 4 for OmniNet: Omnidirectional Representations from Transformers
Viaarxiv icon

Label Smoothed Embedding Hypothesis for Out-of-Distribution Detection

Add code
Bookmark button
Alert button
Feb 09, 2021
Dara Bahri, Heinrich Jiang, Yi Tay, Donald Metzler

Figure 1 for Label Smoothed Embedding Hypothesis for Out-of-Distribution Detection
Figure 2 for Label Smoothed Embedding Hypothesis for Out-of-Distribution Detection
Figure 3 for Label Smoothed Embedding Hypothesis for Out-of-Distribution Detection
Figure 4 for Label Smoothed Embedding Hypothesis for Out-of-Distribution Detection
Viaarxiv icon

StructFormer: Joint Unsupervised Induction of Dependency and Constituency Structure from Masked Language Modeling

Add code
Bookmark button
Alert button
Dec 15, 2020
Yikang Shen, Yi Tay, Che Zheng, Dara Bahri, Donald Metzler, Aaron Courville

Figure 1 for StructFormer: Joint Unsupervised Induction of Dependency and Constituency Structure from Masked Language Modeling
Figure 2 for StructFormer: Joint Unsupervised Induction of Dependency and Constituency Structure from Masked Language Modeling
Figure 3 for StructFormer: Joint Unsupervised Induction of Dependency and Constituency Structure from Masked Language Modeling
Figure 4 for StructFormer: Joint Unsupervised Induction of Dependency and Constituency Structure from Masked Language Modeling
Viaarxiv icon

Long Range Arena: A Benchmark for Efficient Transformers

Add code
Bookmark button
Alert button
Nov 08, 2020
Yi Tay, Mostafa Dehghani, Samira Abnar, Yikang Shen, Dara Bahri, Philip Pham, Jinfeng Rao, Liu Yang, Sebastian Ruder, Donald Metzler

Figure 1 for Long Range Arena: A Benchmark for Efficient Transformers
Figure 2 for Long Range Arena: A Benchmark for Efficient Transformers
Figure 3 for Long Range Arena: A Benchmark for Efficient Transformers
Figure 4 for Long Range Arena: A Benchmark for Efficient Transformers
Viaarxiv icon

Surprise: Result List Truncation via Extreme Value Theory

Add code
Bookmark button
Alert button
Oct 19, 2020
Dara Bahri, Che Zheng, Yi Tay, Donald Metzler, Andrew Tomkins

Figure 1 for Surprise: Result List Truncation via Extreme Value Theory
Figure 2 for Surprise: Result List Truncation via Extreme Value Theory
Figure 3 for Surprise: Result List Truncation via Extreme Value Theory
Figure 4 for Surprise: Result List Truncation via Extreme Value Theory
Viaarxiv icon

Efficient Transformers: A Survey

Add code
Bookmark button
Alert button
Sep 16, 2020
Yi Tay, Mostafa Dehghani, Dara Bahri, Donald Metzler

Figure 1 for Efficient Transformers: A Survey
Figure 2 for Efficient Transformers: A Survey
Figure 3 for Efficient Transformers: A Survey
Figure 4 for Efficient Transformers: A Survey
Viaarxiv icon

Generative Models are Unsupervised Predictors of Page Quality: A Colossal-Scale Study

Add code
Bookmark button
Alert button
Aug 17, 2020
Dara Bahri, Yi Tay, Che Zheng, Donald Metzler, Cliff Brunk, Andrew Tomkins

Figure 1 for Generative Models are Unsupervised Predictors of Page Quality: A Colossal-Scale Study
Figure 2 for Generative Models are Unsupervised Predictors of Page Quality: A Colossal-Scale Study
Figure 3 for Generative Models are Unsupervised Predictors of Page Quality: A Colossal-Scale Study
Figure 4 for Generative Models are Unsupervised Predictors of Page Quality: A Colossal-Scale Study
Viaarxiv icon

HyperGrid: Efficient Multi-Task Transformers with Grid-wise Decomposable Hyper Projections

Add code
Bookmark button
Alert button
Jul 12, 2020
Yi Tay, Zhe Zhao, Dara Bahri, Donald Metzler, Da-Cheng Juan

Figure 1 for HyperGrid: Efficient Multi-Task Transformers with Grid-wise Decomposable Hyper Projections
Figure 2 for HyperGrid: Efficient Multi-Task Transformers with Grid-wise Decomposable Hyper Projections
Figure 3 for HyperGrid: Efficient Multi-Task Transformers with Grid-wise Decomposable Hyper Projections
Figure 4 for HyperGrid: Efficient Multi-Task Transformers with Grid-wise Decomposable Hyper Projections
Viaarxiv icon