Picture for Anna Rumshisky

Anna Rumshisky

Beyond Perplexity: A Geometric and Spectral Study of Low-Rank Pre-Training

Add code
May 13, 2026
Viaarxiv icon

Adversarial Arena: Crowdsourcing Data Generation through Interactive Competition

Add code
Apr 20, 2026
Viaarxiv icon

Diverse, not Short: A Length-Controlled Self-Learning Framework for Improving Response Diversity of Language Models

Add code
May 22, 2025
Figure 1 for Diverse, not Short: A Length-Controlled Self-Learning Framework for Improving Response Diversity of Language Models
Figure 2 for Diverse, not Short: A Length-Controlled Self-Learning Framework for Improving Response Diversity of Language Models
Figure 3 for Diverse, not Short: A Length-Controlled Self-Learning Framework for Improving Response Diversity of Language Models
Figure 4 for Diverse, not Short: A Length-Controlled Self-Learning Framework for Improving Response Diversity of Language Models
Viaarxiv icon

MergeME: Model Merging Techniques for Homogeneous and Heterogeneous MoEs

Add code
Feb 04, 2025
Figure 1 for MergeME: Model Merging Techniques for Homogeneous and Heterogeneous MoEs
Figure 2 for MergeME: Model Merging Techniques for Homogeneous and Heterogeneous MoEs
Figure 3 for MergeME: Model Merging Techniques for Homogeneous and Heterogeneous MoEs
Figure 4 for MergeME: Model Merging Techniques for Homogeneous and Heterogeneous MoEs
Viaarxiv icon

Emergent Abilities in Reduced-Scale Generative Language Models

Add code
Apr 02, 2024
Figure 1 for Emergent Abilities in Reduced-Scale Generative Language Models
Figure 2 for Emergent Abilities in Reduced-Scale Generative Language Models
Figure 3 for Emergent Abilities in Reduced-Scale Generative Language Models
Figure 4 for Emergent Abilities in Reduced-Scale Generative Language Models
Viaarxiv icon

Deconstructing In-Context Learning: Understanding Prompts via Corruption

Add code
Apr 02, 2024
Figure 1 for Deconstructing In-Context Learning: Understanding Prompts via Corruption
Figure 2 for Deconstructing In-Context Learning: Understanding Prompts via Corruption
Figure 3 for Deconstructing In-Context Learning: Understanding Prompts via Corruption
Figure 4 for Deconstructing In-Context Learning: Understanding Prompts via Corruption
Viaarxiv icon

Prompt Perturbation Consistency Learning for Robust Language Models

Add code
Feb 24, 2024
Figure 1 for Prompt Perturbation Consistency Learning for Robust Language Models
Figure 2 for Prompt Perturbation Consistency Learning for Robust Language Models
Figure 3 for Prompt Perturbation Consistency Learning for Robust Language Models
Figure 4 for Prompt Perturbation Consistency Learning for Robust Language Models
Viaarxiv icon

Let's Reinforce Step by Step

Add code
Nov 10, 2023
Figure 1 for Let's Reinforce Step by Step
Figure 2 for Let's Reinforce Step by Step
Viaarxiv icon

Stack More Layers Differently: High-Rank Training Through Low-Rank Updates

Add code
Jul 13, 2023
Figure 1 for Stack More Layers Differently: High-Rank Training Through Low-Rank Updates
Figure 2 for Stack More Layers Differently: High-Rank Training Through Low-Rank Updates
Figure 3 for Stack More Layers Differently: High-Rank Training Through Low-Rank Updates
Figure 4 for Stack More Layers Differently: High-Rank Training Through Low-Rank Updates
Viaarxiv icon

Recipes for Sequential Pre-training of Multilingual Encoder and Seq2Seq Models

Add code
Jun 14, 2023
Figure 1 for Recipes for Sequential Pre-training of Multilingual Encoder and Seq2Seq Models
Figure 2 for Recipes for Sequential Pre-training of Multilingual Encoder and Seq2Seq Models
Figure 3 for Recipes for Sequential Pre-training of Multilingual Encoder and Seq2Seq Models
Figure 4 for Recipes for Sequential Pre-training of Multilingual Encoder and Seq2Seq Models
Viaarxiv icon