Picture for Rachit Bansal

Rachit Bansal

RL Excursions during Pre-Training: Re-examining Policy Optimization for LLM training

Add code
Jun 02, 2026
Viaarxiv icon

Why Larger Models Learn More: Effects of Capacity, Interference, and Rare-Task Retention

Add code
May 28, 2026
Viaarxiv icon

Interleaved Head Attention

Add code
Feb 24, 2026
Viaarxiv icon

Let's (not) just put things in Context: Test-Time Training for Long-Context LLMs

Add code
Dec 15, 2025
Viaarxiv icon

LLM Augmented LLMs: Expanding Capabilities through Composition

Add code
Jan 04, 2024
Figure 1 for LLM Augmented LLMs: Expanding Capabilities through Composition
Figure 2 for LLM Augmented LLMs: Expanding Capabilities through Composition
Figure 3 for LLM Augmented LLMs: Expanding Capabilities through Composition
Figure 4 for LLM Augmented LLMs: Expanding Capabilities through Composition
Viaarxiv icon

Measures of Information Reflect Memorization Patterns

Add code
Oct 19, 2022
Figure 1 for Measures of Information Reflect Memorization Patterns
Figure 2 for Measures of Information Reflect Memorization Patterns
Figure 3 for Measures of Information Reflect Memorization Patterns
Figure 4 for Measures of Information Reflect Memorization Patterns
Viaarxiv icon

LM-CORE: Language Models with Contextually Relevant External Knowledge

Add code
Aug 12, 2022
Figure 1 for LM-CORE: Language Models with Contextually Relevant External Knowledge
Figure 2 for LM-CORE: Language Models with Contextually Relevant External Knowledge
Figure 3 for LM-CORE: Language Models with Contextually Relevant External Knowledge
Figure 4 for LM-CORE: Language Models with Contextually Relevant External Knowledge
Viaarxiv icon

CoSe-Co: Text Conditioned Generative CommonSense Contextualizer

Add code
Jun 17, 2022
Figure 1 for CoSe-Co: Text Conditioned Generative CommonSense Contextualizer
Figure 2 for CoSe-Co: Text Conditioned Generative CommonSense Contextualizer
Figure 3 for CoSe-Co: Text Conditioned Generative CommonSense Contextualizer
Figure 4 for CoSe-Co: Text Conditioned Generative CommonSense Contextualizer
Viaarxiv icon

Linear Connectivity Reveals Generalization Strategies

Add code
May 24, 2022
Figure 1 for Linear Connectivity Reveals Generalization Strategies
Figure 2 for Linear Connectivity Reveals Generalization Strategies
Figure 3 for Linear Connectivity Reveals Generalization Strategies
Figure 4 for Linear Connectivity Reveals Generalization Strategies
Viaarxiv icon

How Low is Too Low? A Computational Perspective on Extremely Low-Resource Languages

Add code
May 30, 2021
Figure 1 for How Low is Too Low? A Computational Perspective on Extremely Low-Resource Languages
Figure 2 for How Low is Too Low? A Computational Perspective on Extremely Low-Resource Languages
Figure 3 for How Low is Too Low? A Computational Perspective on Extremely Low-Resource Languages
Figure 4 for How Low is Too Low? A Computational Perspective on Extremely Low-Resource Languages
Viaarxiv icon