Picture for Abhishek Panigrahi

Abhishek Panigrahi

AdaptMI: Adaptive Skill-based In-context Math Instruction for Small Language Models

Add code
Apr 30, 2025
Viaarxiv icon

On the Power of Context-Enhanced Learning in LLMs

Add code
Mar 03, 2025
Viaarxiv icon

Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?

Add code
Jan 05, 2025
Figure 1 for Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?
Figure 2 for Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?
Figure 3 for Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?
Figure 4 for Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?
Viaarxiv icon

Progressive distillation induces an implicit curriculum

Add code
Oct 07, 2024
Viaarxiv icon

Representing Rule-based Chatbots with Transformers

Add code
Jul 15, 2024
Figure 1 for Representing Rule-based Chatbots with Transformers
Figure 2 for Representing Rule-based Chatbots with Transformers
Figure 3 for Representing Rule-based Chatbots with Transformers
Figure 4 for Representing Rule-based Chatbots with Transformers
Viaarxiv icon

Efficient Stagewise Pretraining via Progressive Subnetworks

Add code
Feb 08, 2024
Viaarxiv icon

Trainable Transformer in Transformer

Add code
Jul 03, 2023
Figure 1 for Trainable Transformer in Transformer
Figure 2 for Trainable Transformer in Transformer
Figure 3 for Trainable Transformer in Transformer
Figure 4 for Trainable Transformer in Transformer
Viaarxiv icon

Do Transformers Parse while Predicting the Masked Word?

Add code
Mar 14, 2023
Figure 1 for Do Transformers Parse while Predicting the Masked Word?
Figure 2 for Do Transformers Parse while Predicting the Masked Word?
Figure 3 for Do Transformers Parse while Predicting the Masked Word?
Figure 4 for Do Transformers Parse while Predicting the Masked Word?
Viaarxiv icon

Task-Specific Skill Localization in Fine-tuned Language Models

Add code
Feb 13, 2023
Figure 1 for Task-Specific Skill Localization in Fine-tuned Language Models
Figure 2 for Task-Specific Skill Localization in Fine-tuned Language Models
Figure 3 for Task-Specific Skill Localization in Fine-tuned Language Models
Figure 4 for Task-Specific Skill Localization in Fine-tuned Language Models
Viaarxiv icon

On the SDEs and Scaling Rules for Adaptive Gradient Algorithms

Add code
May 20, 2022
Figure 1 for On the SDEs and Scaling Rules for Adaptive Gradient Algorithms
Figure 2 for On the SDEs and Scaling Rules for Adaptive Gradient Algorithms
Figure 3 for On the SDEs and Scaling Rules for Adaptive Gradient Algorithms
Figure 4 for On the SDEs and Scaling Rules for Adaptive Gradient Algorithms
Viaarxiv icon