Picture for Hrayr Harutyunyan

Hrayr Harutyunyan

Continuous Chain of Thought Enables Parallel Exploration and Reasoning

Add code
May 29, 2025
Viaarxiv icon

Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA

Add code
Oct 28, 2024
Figure 1 for Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA
Figure 2 for Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA
Figure 3 for Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA
Figure 4 for Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA
Viaarxiv icon

A Little Help Goes a Long Way: Efficient LLM Training by Leveraging Small LMs

Add code
Oct 24, 2024
Figure 1 for A Little Help Goes a Long Way: Efficient LLM Training by Leveraging Small LMs
Figure 2 for A Little Help Goes a Long Way: Efficient LLM Training by Leveraging Small LMs
Figure 3 for A Little Help Goes a Long Way: Efficient LLM Training by Leveraging Small LMs
Figure 4 for A Little Help Goes a Long Way: Efficient LLM Training by Leveraging Small LMs
Viaarxiv icon

Mimetic Initialization Helps State Space Models Learn to Recall

Add code
Oct 14, 2024
Figure 1 for Mimetic Initialization Helps State Space Models Learn to Recall
Figure 2 for Mimetic Initialization Helps State Space Models Learn to Recall
Figure 3 for Mimetic Initialization Helps State Space Models Learn to Recall
Figure 4 for Mimetic Initialization Helps State Space Models Learn to Recall
Viaarxiv icon

In-context Learning in Presence of Spurious Correlations

Add code
Oct 04, 2024
Figure 1 for In-context Learning in Presence of Spurious Correlations
Figure 2 for In-context Learning in Presence of Spurious Correlations
Figure 3 for In-context Learning in Presence of Spurious Correlations
Figure 4 for In-context Learning in Presence of Spurious Correlations
Viaarxiv icon

On information captured by neural networks: connections with memorization and generalization

Add code
Jun 28, 2023
Figure 1 for On information captured by neural networks: connections with memorization and generalization
Figure 2 for On information captured by neural networks: connections with memorization and generalization
Figure 3 for On information captured by neural networks: connections with memorization and generalization
Figure 4 for On information captured by neural networks: connections with memorization and generalization
Viaarxiv icon

Identifying and Disentangling Spurious Features in Pretrained Image Representations

Add code
Jun 22, 2023
Figure 1 for Identifying and Disentangling Spurious Features in Pretrained Image Representations
Figure 2 for Identifying and Disentangling Spurious Features in Pretrained Image Representations
Figure 3 for Identifying and Disentangling Spurious Features in Pretrained Image Representations
Figure 4 for Identifying and Disentangling Spurious Features in Pretrained Image Representations
Viaarxiv icon

A Meta-Learning Approach to Predicting Performance and Data Requirements

Add code
Mar 02, 2023
Viaarxiv icon

Supervision Complexity and its Role in Knowledge Distillation

Add code
Jan 28, 2023
Figure 1 for Supervision Complexity and its Role in Knowledge Distillation
Figure 2 for Supervision Complexity and its Role in Knowledge Distillation
Figure 3 for Supervision Complexity and its Role in Knowledge Distillation
Figure 4 for Supervision Complexity and its Role in Knowledge Distillation
Viaarxiv icon

Formal limitations of sample-wise information-theoretic generalization bounds

Add code
May 13, 2022
Viaarxiv icon