Picture for Amitis Shidani

Amitis Shidani

Revisiting the Scaling Properties of Downstream Metrics in Large Language Model Training

Add code
Dec 09, 2025
Viaarxiv icon

Distillation Scaling Laws

Add code
Feb 12, 2025
Viaarxiv icon

Theory, Analysis, and Best Practices for Sigmoid Self-Attention

Add code
Sep 06, 2024
Figure 1 for Theory, Analysis, and Best Practices for Sigmoid Self-Attention
Figure 2 for Theory, Analysis, and Best Practices for Sigmoid Self-Attention
Figure 3 for Theory, Analysis, and Best Practices for Sigmoid Self-Attention
Figure 4 for Theory, Analysis, and Best Practices for Sigmoid Self-Attention
Viaarxiv icon

Poly-View Contrastive Learning

Add code
Mar 08, 2024
Figure 1 for Poly-View Contrastive Learning
Figure 2 for Poly-View Contrastive Learning
Figure 3 for Poly-View Contrastive Learning
Figure 4 for Poly-View Contrastive Learning
Viaarxiv icon

Optimal Regret Bounds for Collaborative Learning in Bandits

Add code
Dec 15, 2023
Viaarxiv icon

Ranking in Contextual Multi-Armed Bandits

Add code
Jun 30, 2022
Figure 1 for Ranking in Contextual Multi-Armed Bandits
Figure 2 for Ranking in Contextual Multi-Armed Bandits
Figure 3 for Ranking in Contextual Multi-Armed Bandits
Figure 4 for Ranking in Contextual Multi-Armed Bandits
Viaarxiv icon

Chained Generalisation Bounds

Add code
Mar 02, 2022
Figure 1 for Chained Generalisation Bounds
Figure 2 for Chained Generalisation Bounds
Viaarxiv icon