Alert button
Picture for Shijie Wu

Shijie Wu

Alert button

MixCE: Training Autoregressive Language Models by Mixing Forward and Reverse Cross-Entropies

Add code
Bookmark button
Alert button
May 26, 2023
Shiyue Zhang, Shijie Wu, Ozan Irsoy, Steven Lu, Mohit Bansal, Mark Dredze, David Rosenberg

Figure 1 for MixCE: Training Autoregressive Language Models by Mixing Forward and Reverse Cross-Entropies
Figure 2 for MixCE: Training Autoregressive Language Models by Mixing Forward and Reverse Cross-Entropies
Figure 3 for MixCE: Training Autoregressive Language Models by Mixing Forward and Reverse Cross-Entropies
Figure 4 for MixCE: Training Autoregressive Language Models by Mixing Forward and Reverse Cross-Entropies
Viaarxiv icon

Overcoming Catastrophic Forgetting in Massively Multilingual Continual Learning

Add code
Bookmark button
Alert button
May 25, 2023
Genta Indra Winata, Lingjue Xie, Karthik Radhakrishnan, Shijie Wu, Xisen Jin, Pengxiang Cheng, Mayank Kulkarni, Daniel Preotiuc-Pietro

Figure 1 for Overcoming Catastrophic Forgetting in Massively Multilingual Continual Learning
Figure 2 for Overcoming Catastrophic Forgetting in Massively Multilingual Continual Learning
Figure 3 for Overcoming Catastrophic Forgetting in Massively Multilingual Continual Learning
Figure 4 for Overcoming Catastrophic Forgetting in Massively Multilingual Continual Learning
Viaarxiv icon

BloombergGPT: A Large Language Model for Finance

Add code
Bookmark button
Alert button
Mar 30, 2023
Shijie Wu, Ozan Irsoy, Steven Lu, Vadim Dabravolski, Mark Dredze, Sebastian Gehrmann, Prabhanjan Kambadur, David Rosenberg, Gideon Mann

Figure 1 for BloombergGPT: A Large Language Model for Finance
Figure 2 for BloombergGPT: A Large Language Model for Finance
Figure 3 for BloombergGPT: A Large Language Model for Finance
Figure 4 for BloombergGPT: A Large Language Model for Finance
Viaarxiv icon

BoundaryFace: A mining framework with noise label self-correction for Face Recognition

Add code
Bookmark button
Alert button
Oct 10, 2022
Shijie Wu, Xun Gong

Figure 1 for BoundaryFace: A mining framework with noise label self-correction for Face Recognition
Figure 2 for BoundaryFace: A mining framework with noise label self-correction for Face Recognition
Figure 3 for BoundaryFace: A mining framework with noise label self-correction for Face Recognition
Figure 4 for BoundaryFace: A mining framework with noise label self-correction for Face Recognition
Viaarxiv icon

How Do Multilingual Encoders Learn Cross-lingual Representation?

Add code
Bookmark button
Alert button
Jul 12, 2022
Shijie Wu

Figure 1 for How Do Multilingual Encoders Learn Cross-lingual Representation?
Figure 2 for How Do Multilingual Encoders Learn Cross-lingual Representation?
Figure 3 for How Do Multilingual Encoders Learn Cross-lingual Representation?
Figure 4 for How Do Multilingual Encoders Learn Cross-lingual Representation?
Viaarxiv icon

Zero-shot Cross-lingual Transfer is Under-specified Optimization

Add code
Bookmark button
Alert button
Jul 12, 2022
Shijie Wu, Benjamin Van Durme, Mark Dredze

Figure 1 for Zero-shot Cross-lingual Transfer is Under-specified Optimization
Figure 2 for Zero-shot Cross-lingual Transfer is Under-specified Optimization
Figure 3 for Zero-shot Cross-lingual Transfer is Under-specified Optimization
Figure 4 for Zero-shot Cross-lingual Transfer is Under-specified Optimization
Viaarxiv icon

Everything Is All It Takes: A Multipronged Strategy for Zero-Shot Cross-Lingual Information Extraction

Add code
Bookmark button
Alert button
Sep 14, 2021
Mahsa Yarmohammadi, Shijie Wu, Marc Marone, Haoran Xu, Seth Ebner, Guanghui Qin, Yunmo Chen, Jialiang Guo, Craig Harman, Kenton Murray, Aaron Steven White, Mark Dredze, Benjamin Van Durme

Figure 1 for Everything Is All It Takes: A Multipronged Strategy for Zero-Shot Cross-Lingual Information Extraction
Figure 2 for Everything Is All It Takes: A Multipronged Strategy for Zero-Shot Cross-Lingual Information Extraction
Figure 3 for Everything Is All It Takes: A Multipronged Strategy for Zero-Shot Cross-Lingual Information Extraction
Figure 4 for Everything Is All It Takes: A Multipronged Strategy for Zero-Shot Cross-Lingual Information Extraction
Viaarxiv icon

Differentiable Generative Phonology

Add code
Bookmark button
Alert button
Feb 12, 2021
Shijie Wu, Edoardo Maria Ponti, Ryan Cotterell

Figure 1 for Differentiable Generative Phonology
Figure 2 for Differentiable Generative Phonology
Figure 3 for Differentiable Generative Phonology
Figure 4 for Differentiable Generative Phonology
Viaarxiv icon

Do Explicit Alignments Robustly Improve Multilingual Encoders?

Add code
Bookmark button
Alert button
Oct 06, 2020
Shijie Wu, Mark Dredze

Figure 1 for Do Explicit Alignments Robustly Improve Multilingual Encoders?
Figure 2 for Do Explicit Alignments Robustly Improve Multilingual Encoders?
Figure 3 for Do Explicit Alignments Robustly Improve Multilingual Encoders?
Figure 4 for Do Explicit Alignments Robustly Improve Multilingual Encoders?
Viaarxiv icon