Alert button
Picture for Ozan Irsoy

Ozan Irsoy

Alert button

MixCE: Training Autoregressive Language Models by Mixing Forward and Reverse Cross-Entropies

Add code
Bookmark button
Alert button
May 26, 2023
Shiyue Zhang, Shijie Wu, Ozan Irsoy, Steven Lu, Mohit Bansal, Mark Dredze, David Rosenberg

Figure 1 for MixCE: Training Autoregressive Language Models by Mixing Forward and Reverse Cross-Entropies
Figure 2 for MixCE: Training Autoregressive Language Models by Mixing Forward and Reverse Cross-Entropies
Figure 3 for MixCE: Training Autoregressive Language Models by Mixing Forward and Reverse Cross-Entropies
Figure 4 for MixCE: Training Autoregressive Language Models by Mixing Forward and Reverse Cross-Entropies
Viaarxiv icon

BloombergGPT: A Large Language Model for Finance

Add code
Bookmark button
Alert button
Mar 30, 2023
Shijie Wu, Ozan Irsoy, Steven Lu, Vadim Dabravolski, Mark Dredze, Sebastian Gehrmann, Prabhanjan Kambadur, David Rosenberg, Gideon Mann

Figure 1 for BloombergGPT: A Large Language Model for Finance
Figure 2 for BloombergGPT: A Large Language Model for Finance
Figure 3 for BloombergGPT: A Large Language Model for Finance
Figure 4 for BloombergGPT: A Large Language Model for Finance
Viaarxiv icon

Collective Entity Disambiguation with Structured Gradient Tree Boosting

Add code
Bookmark button
Alert button
Apr 24, 2018
Yi Yang, Ozan Irsoy, Kazi Shefaet Rahman

Figure 1 for Collective Entity Disambiguation with Structured Gradient Tree Boosting
Figure 2 for Collective Entity Disambiguation with Structured Gradient Tree Boosting
Figure 3 for Collective Entity Disambiguation with Structured Gradient Tree Boosting
Figure 4 for Collective Entity Disambiguation with Structured Gradient Tree Boosting
Viaarxiv icon

Ask Me Anything: Dynamic Memory Networks for Natural Language Processing

Add code
Bookmark button
Alert button
Mar 05, 2016
Ankit Kumar, Ozan Irsoy, Peter Ondruska, Mohit Iyyer, James Bradbury, Ishaan Gulrajani, Victor Zhong, Romain Paulus, Richard Socher

Figure 1 for Ask Me Anything: Dynamic Memory Networks for Natural Language Processing
Figure 2 for Ask Me Anything: Dynamic Memory Networks for Natural Language Processing
Viaarxiv icon