Picture for Ozan Irsoy

Ozan Irsoy

MixCE: Training Autoregressive Language Models by Mixing Forward and Reverse Cross-Entropies

Add code
May 26, 2023
Figure 1 for MixCE: Training Autoregressive Language Models by Mixing Forward and Reverse Cross-Entropies
Figure 2 for MixCE: Training Autoregressive Language Models by Mixing Forward and Reverse Cross-Entropies
Figure 3 for MixCE: Training Autoregressive Language Models by Mixing Forward and Reverse Cross-Entropies
Figure 4 for MixCE: Training Autoregressive Language Models by Mixing Forward and Reverse Cross-Entropies
Viaarxiv icon

BloombergGPT: A Large Language Model for Finance

Add code
Mar 30, 2023
Figure 1 for BloombergGPT: A Large Language Model for Finance
Figure 2 for BloombergGPT: A Large Language Model for Finance
Figure 3 for BloombergGPT: A Large Language Model for Finance
Figure 4 for BloombergGPT: A Large Language Model for Finance
Viaarxiv icon

Collective Entity Disambiguation with Structured Gradient Tree Boosting

Add code
Apr 24, 2018
Figure 1 for Collective Entity Disambiguation with Structured Gradient Tree Boosting
Figure 2 for Collective Entity Disambiguation with Structured Gradient Tree Boosting
Figure 3 for Collective Entity Disambiguation with Structured Gradient Tree Boosting
Figure 4 for Collective Entity Disambiguation with Structured Gradient Tree Boosting
Viaarxiv icon

Ask Me Anything: Dynamic Memory Networks for Natural Language Processing

Add code
Mar 05, 2016
Figure 1 for Ask Me Anything: Dynamic Memory Networks for Natural Language Processing
Figure 2 for Ask Me Anything: Dynamic Memory Networks for Natural Language Processing
Viaarxiv icon