Alert button
Picture for Irina Rish

Irina Rish

Alert button

Simple and Scalable Strategies to Continually Pre-train Large Language Models

Mar 13, 2024
Adam Ibrahim, Benjamin Thérien, Kshitij Gupta, Mats L. Richter, Quentin Anthony, Timothée Lesort, Eugene Belilovsky, Irina Rish

Viaarxiv icon

Unsupervised Concept Discovery Mitigates Spurious Correlations

Feb 20, 2024
Md Rifat Arefin, Yan Zhang, Aristide Baratin, Francesco Locatello, Irina Rish, Dianbo Liu, Kenji Kawaguchi

Viaarxiv icon

Towards Machines that Trust: AI Agents Learn to Trust in the Trust Game

Dec 20, 2023
Ardavan S. Nobandegani, Irina Rish, Thomas R. Shultz

Viaarxiv icon

Lag-Llama: Towards Foundation Models for Time Series Forecasting

Oct 12, 2023
Kashif Rasul, Arjun Ashok, Andrew Robert Williams, Arian Khorasani, George Adamopoulos, Rishika Bhagwatkar, Marin Biloš, Hena Ghonia, Nadhir Vincent Hassen, Anderson Schneider, Sahil Garg, Alexandre Drouin, Nicolas Chapados, Yuriy Nevmyvaka, Irina Rish

Figure 1 for Lag-Llama: Towards Foundation Models for Time Series Forecasting
Figure 2 for Lag-Llama: Towards Foundation Models for Time Series Forecasting
Figure 3 for Lag-Llama: Towards Foundation Models for Time Series Forecasting
Figure 4 for Lag-Llama: Towards Foundation Models for Time Series Forecasting
Viaarxiv icon

LORD: Low Rank Decomposition Of Monolingual Code LLMs For One-Shot Compression

Sep 25, 2023
Ayush Kaushal, Tejas Vaidhya, Irina Rish

Viaarxiv icon

Amplifying Pathological Detection in EEG Signaling Pathways through Cross-Dataset Transfer Learning

Sep 19, 2023
Mohammad-Javad Darvishi-Bayazi, Mohammad Sajjad Ghaemi, Timothee Lesort, Md Rifat Arefin, Jocelyn Faubert, Irina Rish

Figure 1 for Amplifying Pathological Detection in EEG Signaling Pathways through Cross-Dataset Transfer Learning
Figure 2 for Amplifying Pathological Detection in EEG Signaling Pathways through Cross-Dataset Transfer Learning
Figure 3 for Amplifying Pathological Detection in EEG Signaling Pathways through Cross-Dataset Transfer Learning
Figure 4 for Amplifying Pathological Detection in EEG Signaling Pathways through Cross-Dataset Transfer Learning
Viaarxiv icon

Continual Pre-Training of Large Language Models: How to (re)warm your model?

Aug 08, 2023
Kshitij Gupta, Benjamin Thérien, Adam Ibrahim, Mats L. Richter, Quentin Anthony, Eugene Belilovsky, Irina Rish, Timothée Lesort

Figure 1 for Continual Pre-Training of Large Language Models: How to (re)warm your model?
Figure 2 for Continual Pre-Training of Large Language Models: How to (re)warm your model?
Figure 3 for Continual Pre-Training of Large Language Models: How to (re)warm your model?
Figure 4 for Continual Pre-Training of Large Language Models: How to (re)warm your model?
Viaarxiv icon

GOKU-UI: Ubiquitous Inference through Attention and Multiple Shooting for Continuous-time Generative Models

Jul 11, 2023
Germán Abrevaya, Mahta Ramezanian-Panahi, Jean-Christophe Gagnon-Audet, Irina Rish, Pablo Polosecki, Silvina Ponce Dawson, Guillermo Cecchi, Guillaume Dumas

Figure 1 for GOKU-UI: Ubiquitous Inference through Attention and Multiple Shooting for Continuous-time Generative Models
Figure 2 for GOKU-UI: Ubiquitous Inference through Attention and Multiple Shooting for Continuous-time Generative Models
Figure 3 for GOKU-UI: Ubiquitous Inference through Attention and Multiple Shooting for Continuous-time Generative Models
Figure 4 for GOKU-UI: Ubiquitous Inference through Attention and Multiple Shooting for Continuous-time Generative Models
Viaarxiv icon

Maximum State Entropy Exploration using Predecessor and Successor Representations

Jun 26, 2023
Arnav Kumar Jain, Lucas Lehnert, Irina Rish, Glen Berseth

Figure 1 for Maximum State Entropy Exploration using Predecessor and Successor Representations
Figure 2 for Maximum State Entropy Exploration using Predecessor and Successor Representations
Figure 3 for Maximum State Entropy Exploration using Predecessor and Successor Representations
Figure 4 for Maximum State Entropy Exploration using Predecessor and Successor Representations
Viaarxiv icon

Predicting Grokking Long Before it Happens: A look into the loss landscape of models which grok

Jun 23, 2023
Pascal Jr. Tikeng Notsawo, Hattie Zhou, Mohammad Pezeshki, Irina Rish, Guillaume Dumas

Figure 1 for Predicting Grokking Long Before it Happens: A look into the loss landscape of models which grok
Figure 2 for Predicting Grokking Long Before it Happens: A look into the loss landscape of models which grok
Figure 3 for Predicting Grokking Long Before it Happens: A look into the loss landscape of models which grok
Figure 4 for Predicting Grokking Long Before it Happens: A look into the loss landscape of models which grok
Viaarxiv icon