Picture for Prateek Jain

Prateek Jain

Spark Transformer: Reactivating Sparsity in FFN and Attention

Add code
Jun 07, 2025
Viaarxiv icon

Faster Approx. Top-K: Harnessing the Full Power of Two Stages

Add code
Jun 05, 2025
Viaarxiv icon

Matryoshka Model Learning for Improved Elastic Student Models

Add code
May 29, 2025
Viaarxiv icon

Efficient Single-Pass Training for Multi-Turn Reasoning

Add code
Apr 25, 2025
Viaarxiv icon

Interleaved Gibbs Diffusion for Constrained Generation

Add code
Feb 19, 2025
Viaarxiv icon

Matryoshka Quantization

Add code
Feb 10, 2025
Viaarxiv icon

Does Safety Training of LLMs Generalize to Semantically Related Natural Prompts?

Add code
Dec 04, 2024
Figure 1 for Does Safety Training of LLMs Generalize to Semantically Related Natural Prompts?
Figure 2 for Does Safety Training of LLMs Generalize to Semantically Related Natural Prompts?
Figure 3 for Does Safety Training of LLMs Generalize to Semantically Related Natural Prompts?
Figure 4 for Does Safety Training of LLMs Generalize to Semantically Related Natural Prompts?
Viaarxiv icon

Time-Reversal Provides Unsupervised Feedback to LLMs

Add code
Dec 03, 2024
Figure 1 for Time-Reversal Provides Unsupervised Feedback to LLMs
Figure 2 for Time-Reversal Provides Unsupervised Feedback to LLMs
Figure 3 for Time-Reversal Provides Unsupervised Feedback to LLMs
Figure 4 for Time-Reversal Provides Unsupervised Feedback to LLMs
Viaarxiv icon

Mixture of Nested Experts: Adaptive Processing of Visual Tokens

Add code
Jul 29, 2024
Figure 1 for Mixture of Nested Experts: Adaptive Processing of Visual Tokens
Figure 2 for Mixture of Nested Experts: Adaptive Processing of Visual Tokens
Figure 3 for Mixture of Nested Experts: Adaptive Processing of Visual Tokens
Figure 4 for Mixture of Nested Experts: Adaptive Processing of Visual Tokens
Viaarxiv icon

LookupViT: Compressing visual information to a limited number of tokens

Add code
Jul 17, 2024
Figure 1 for LookupViT: Compressing visual information to a limited number of tokens
Figure 2 for LookupViT: Compressing visual information to a limited number of tokens
Figure 3 for LookupViT: Compressing visual information to a limited number of tokens
Figure 4 for LookupViT: Compressing visual information to a limited number of tokens
Viaarxiv icon