Picture for Prateek Jain

Prateek Jain

AGGRNet: Selective Feature Extraction and Aggregation for Enhanced Medical Image Classification

Add code
Nov 15, 2025
Viaarxiv icon

Spark Transformer: Reactivating Sparsity in FFN and Attention

Add code
Jun 07, 2025
Figure 1 for Spark Transformer: Reactivating Sparsity in FFN and Attention
Figure 2 for Spark Transformer: Reactivating Sparsity in FFN and Attention
Figure 3 for Spark Transformer: Reactivating Sparsity in FFN and Attention
Figure 4 for Spark Transformer: Reactivating Sparsity in FFN and Attention
Viaarxiv icon

Faster Approx. Top-K: Harnessing the Full Power of Two Stages

Add code
Jun 05, 2025
Viaarxiv icon

Matryoshka Model Learning for Improved Elastic Student Models

Add code
May 29, 2025
Viaarxiv icon

Efficient Single-Pass Training for Multi-Turn Reasoning

Add code
Apr 25, 2025
Figure 1 for Efficient Single-Pass Training for Multi-Turn Reasoning
Figure 2 for Efficient Single-Pass Training for Multi-Turn Reasoning
Figure 3 for Efficient Single-Pass Training for Multi-Turn Reasoning
Figure 4 for Efficient Single-Pass Training for Multi-Turn Reasoning
Viaarxiv icon

Interleaved Gibbs Diffusion for Constrained Generation

Add code
Feb 19, 2025
Viaarxiv icon

Matryoshka Quantization

Add code
Feb 10, 2025
Figure 1 for Matryoshka Quantization
Figure 2 for Matryoshka Quantization
Figure 3 for Matryoshka Quantization
Figure 4 for Matryoshka Quantization
Viaarxiv icon

Does Safety Training of LLMs Generalize to Semantically Related Natural Prompts?

Add code
Dec 04, 2024
Figure 1 for Does Safety Training of LLMs Generalize to Semantically Related Natural Prompts?
Figure 2 for Does Safety Training of LLMs Generalize to Semantically Related Natural Prompts?
Figure 3 for Does Safety Training of LLMs Generalize to Semantically Related Natural Prompts?
Figure 4 for Does Safety Training of LLMs Generalize to Semantically Related Natural Prompts?
Viaarxiv icon

Time-Reversal Provides Unsupervised Feedback to LLMs

Add code
Dec 03, 2024
Figure 1 for Time-Reversal Provides Unsupervised Feedback to LLMs
Figure 2 for Time-Reversal Provides Unsupervised Feedback to LLMs
Figure 3 for Time-Reversal Provides Unsupervised Feedback to LLMs
Figure 4 for Time-Reversal Provides Unsupervised Feedback to LLMs
Viaarxiv icon

Mixture of Nested Experts: Adaptive Processing of Visual Tokens

Add code
Jul 29, 2024
Figure 1 for Mixture of Nested Experts: Adaptive Processing of Visual Tokens
Figure 2 for Mixture of Nested Experts: Adaptive Processing of Visual Tokens
Figure 3 for Mixture of Nested Experts: Adaptive Processing of Visual Tokens
Figure 4 for Mixture of Nested Experts: Adaptive Processing of Visual Tokens
Viaarxiv icon