Picture for Sotiris Anagnostidis

Sotiris Anagnostidis

ETH Zurich

A Language Model's Guide Through Latent Space

Add code
Feb 22, 2024
Figure 1 for A Language Model's Guide Through Latent Space
Figure 2 for A Language Model's Guide Through Latent Space
Figure 3 for A Language Model's Guide Through Latent Space
Figure 4 for A Language Model's Guide Through Latent Space
Viaarxiv icon

Towards Meta-Pruning via Optimal Transport

Add code
Feb 13, 2024
Figure 1 for Towards Meta-Pruning via Optimal Transport
Figure 2 for Towards Meta-Pruning via Optimal Transport
Figure 3 for Towards Meta-Pruning via Optimal Transport
Figure 4 for Towards Meta-Pruning via Optimal Transport
Viaarxiv icon

Harnessing Synthetic Datasets: The Role of Shape Bias in Deep Neural Network Generalization

Add code
Nov 10, 2023
Figure 1 for Harnessing Synthetic Datasets: The Role of Shape Bias in Deep Neural Network Generalization
Figure 2 for Harnessing Synthetic Datasets: The Role of Shape Bias in Deep Neural Network Generalization
Figure 3 for Harnessing Synthetic Datasets: The Role of Shape Bias in Deep Neural Network Generalization
Figure 4 for Harnessing Synthetic Datasets: The Role of Shape Bias in Deep Neural Network Generalization
Viaarxiv icon

Navigating Scaling Laws: Accelerating Vision Transformer's Training via Adaptive Strategies

Add code
Nov 06, 2023
Figure 1 for Navigating Scaling Laws: Accelerating Vision Transformer's Training via Adaptive Strategies
Figure 2 for Navigating Scaling Laws: Accelerating Vision Transformer's Training via Adaptive Strategies
Figure 3 for Navigating Scaling Laws: Accelerating Vision Transformer's Training via Adaptive Strategies
Figure 4 for Navigating Scaling Laws: Accelerating Vision Transformer's Training via Adaptive Strategies
Viaarxiv icon

Transformer Fusion with Optimal Transport

Add code
Oct 15, 2023
Figure 1 for Transformer Fusion with Optimal Transport
Figure 2 for Transformer Fusion with Optimal Transport
Figure 3 for Transformer Fusion with Optimal Transport
Figure 4 for Transformer Fusion with Optimal Transport
Viaarxiv icon

Scaling MLPs: A Tale of Inductive Bias

Add code
Jun 23, 2023
Figure 1 for Scaling MLPs: A Tale of Inductive Bias
Figure 2 for Scaling MLPs: A Tale of Inductive Bias
Figure 3 for Scaling MLPs: A Tale of Inductive Bias
Figure 4 for Scaling MLPs: A Tale of Inductive Bias
Viaarxiv icon

Dynamic Context Pruning for Efficient and Interpretable Autoregressive Transformers

Add code
May 25, 2023
Figure 1 for Dynamic Context Pruning for Efficient and Interpretable Autoregressive Transformers
Figure 2 for Dynamic Context Pruning for Efficient and Interpretable Autoregressive Transformers
Figure 3 for Dynamic Context Pruning for Efficient and Interpretable Autoregressive Transformers
Figure 4 for Dynamic Context Pruning for Efficient and Interpretable Autoregressive Transformers
Viaarxiv icon

OpenAssistant Conversations -- Democratizing Large Language Model Alignment

Add code
Apr 14, 2023
Figure 1 for OpenAssistant Conversations -- Democratizing Large Language Model Alignment
Figure 2 for OpenAssistant Conversations -- Democratizing Large Language Model Alignment
Figure 3 for OpenAssistant Conversations -- Democratizing Large Language Model Alignment
Figure 4 for OpenAssistant Conversations -- Democratizing Large Language Model Alignment
Viaarxiv icon

Random Teachers are Good Teachers

Add code
Feb 23, 2023
Figure 1 for Random Teachers are Good Teachers
Figure 2 for Random Teachers are Good Teachers
Figure 3 for Random Teachers are Good Teachers
Figure 4 for Random Teachers are Good Teachers
Viaarxiv icon

Cosmology from Galaxy Redshift Surveys with PointNet

Add code
Nov 22, 2022
Figure 1 for Cosmology from Galaxy Redshift Surveys with PointNet
Figure 2 for Cosmology from Galaxy Redshift Surveys with PointNet
Figure 3 for Cosmology from Galaxy Redshift Surveys with PointNet
Figure 4 for Cosmology from Galaxy Redshift Surveys with PointNet
Viaarxiv icon