Picture for Irina Rish

Irina Rish

Training Dynamics Underlying Language Model Scaling Laws: Loss Deceleration and Zero-Sum Learning

Add code
Jun 05, 2025
Viaarxiv icon

MuLoCo: Muon is a practical inner optimizer for DiLoCo

Add code
May 29, 2025
Viaarxiv icon

Handling Delay in Real-Time Reinforcement Learning

Add code
Mar 30, 2025
Viaarxiv icon

Beyond Cosine Decay: On the effectiveness of Infinite Learning Rate Schedule for Continual Pre-training

Add code
Mar 06, 2025
Viaarxiv icon

Continual Pre-training of MoEs: How robust is your router?

Add code
Mar 06, 2025
Viaarxiv icon

Robin: a Suite of Multi-Scale Vision-Language Models and the CHIRP Evaluation Benchmark

Add code
Jan 16, 2025
Figure 1 for Robin: a Suite of Multi-Scale Vision-Language Models and the CHIRP Evaluation Benchmark
Figure 2 for Robin: a Suite of Multi-Scale Vision-Language Models and the CHIRP Evaluation Benchmark
Figure 3 for Robin: a Suite of Multi-Scale Vision-Language Models and the CHIRP Evaluation Benchmark
Figure 4 for Robin: a Suite of Multi-Scale Vision-Language Models and the CHIRP Evaluation Benchmark
Viaarxiv icon

Enabling Realtime Reinforcement Learning at Scale with Staggered Asynchronous Inference

Add code
Dec 18, 2024
Figure 1 for Enabling Realtime Reinforcement Learning at Scale with Staggered Asynchronous Inference
Figure 2 for Enabling Realtime Reinforcement Learning at Scale with Staggered Asynchronous Inference
Figure 3 for Enabling Realtime Reinforcement Learning at Scale with Staggered Asynchronous Inference
Figure 4 for Enabling Realtime Reinforcement Learning at Scale with Staggered Asynchronous Inference
Viaarxiv icon

RedPajama: an Open Dataset for Training Large Language Models

Add code
Nov 19, 2024
Viaarxiv icon

Non-Adversarial Inverse Reinforcement Learning via Successor Feature Matching

Add code
Nov 11, 2024
Figure 1 for Non-Adversarial Inverse Reinforcement Learning via Successor Feature Matching
Figure 2 for Non-Adversarial Inverse Reinforcement Learning via Successor Feature Matching
Figure 3 for Non-Adversarial Inverse Reinforcement Learning via Successor Feature Matching
Figure 4 for Non-Adversarial Inverse Reinforcement Learning via Successor Feature Matching
Viaarxiv icon

Seq-VCR: Preventing Collapse in Intermediate Transformer Representations for Enhanced Reasoning

Add code
Nov 04, 2024
Figure 1 for Seq-VCR: Preventing Collapse in Intermediate Transformer Representations for Enhanced Reasoning
Figure 2 for Seq-VCR: Preventing Collapse in Intermediate Transformer Representations for Enhanced Reasoning
Figure 3 for Seq-VCR: Preventing Collapse in Intermediate Transformer Representations for Enhanced Reasoning
Figure 4 for Seq-VCR: Preventing Collapse in Intermediate Transformer Representations for Enhanced Reasoning
Viaarxiv icon