Picture for Maor Ivgi

Maor Ivgi

From Loops to Oops: Fallback Behaviors of Language Models Under Uncertainty

Add code
Jul 08, 2024
Viaarxiv icon

DataComp-LM: In search of the next generation of training sets for language models

Add code
Jun 18, 2024
Viaarxiv icon

In-Context Learning with Long-Context Models: An In-Depth Exploration

Add code
Apr 30, 2024
Figure 1 for In-Context Learning with Long-Context Models: An In-Depth Exploration
Figure 2 for In-Context Learning with Long-Context Models: An In-Depth Exploration
Figure 3 for In-Context Learning with Long-Context Models: An In-Depth Exploration
Figure 4 for In-Context Learning with Long-Context Models: An In-Depth Exploration
Viaarxiv icon

Accelerated Parameter-Free Stochastic Optimization

Add code
Mar 31, 2024
Viaarxiv icon

ZeroSCROLLS: A Zero-Shot Benchmark for Long Text Understanding

Add code
May 23, 2023
Figure 1 for ZeroSCROLLS: A Zero-Shot Benchmark for Long Text Understanding
Figure 2 for ZeroSCROLLS: A Zero-Shot Benchmark for Long Text Understanding
Figure 3 for ZeroSCROLLS: A Zero-Shot Benchmark for Long Text Understanding
Figure 4 for ZeroSCROLLS: A Zero-Shot Benchmark for Long Text Understanding
Viaarxiv icon

DoG is SGD's Best Friend: A Parameter-Free Dynamic Step Size Schedule

Add code
Feb 08, 2023
Figure 1 for DoG is SGD's Best Friend: A Parameter-Free Dynamic Step Size Schedule
Figure 2 for DoG is SGD's Best Friend: A Parameter-Free Dynamic Step Size Schedule
Figure 3 for DoG is SGD's Best Friend: A Parameter-Free Dynamic Step Size Schedule
Figure 4 for DoG is SGD's Best Friend: A Parameter-Free Dynamic Step Size Schedule
Viaarxiv icon

Efficient Long-Text Understanding with Short-Text Models

Add code
Aug 01, 2022
Figure 1 for Efficient Long-Text Understanding with Short-Text Models
Figure 2 for Efficient Long-Text Understanding with Short-Text Models
Figure 3 for Efficient Long-Text Understanding with Short-Text Models
Figure 4 for Efficient Long-Text Understanding with Short-Text Models
Viaarxiv icon

Scaling Laws Under the Microscope: Predicting Transformer Performance from Small Scale Experiments

Add code
Feb 13, 2022
Figure 1 for Scaling Laws Under the Microscope: Predicting Transformer Performance from Small Scale Experiments
Figure 2 for Scaling Laws Under the Microscope: Predicting Transformer Performance from Small Scale Experiments
Figure 3 for Scaling Laws Under the Microscope: Predicting Transformer Performance from Small Scale Experiments
Figure 4 for Scaling Laws Under the Microscope: Predicting Transformer Performance from Small Scale Experiments
Viaarxiv icon

SCROLLS: Standardized CompaRison Over Long Language Sequences

Add code
Jan 10, 2022
Figure 1 for SCROLLS: Standardized CompaRison Over Long Language Sequences
Figure 2 for SCROLLS: Standardized CompaRison Over Long Language Sequences
Figure 3 for SCROLLS: Standardized CompaRison Over Long Language Sequences
Figure 4 for SCROLLS: Standardized CompaRison Over Long Language Sequences
Viaarxiv icon

Beyond Importance Scores: Interpreting Tabular ML by Visualizing Feature Semantics

Add code
Nov 30, 2021
Figure 1 for Beyond Importance Scores: Interpreting Tabular ML by Visualizing Feature Semantics
Figure 2 for Beyond Importance Scores: Interpreting Tabular ML by Visualizing Feature Semantics
Figure 3 for Beyond Importance Scores: Interpreting Tabular ML by Visualizing Feature Semantics
Figure 4 for Beyond Importance Scores: Interpreting Tabular ML by Visualizing Feature Semantics
Viaarxiv icon