Picture for Sanjiv Kumar

Sanjiv Kumar

Google Research

No more hard prompts: SoftSRV prompting for synthetic data generation

Add code
Oct 23, 2024
Figure 1 for No more hard prompts: SoftSRV prompting for synthetic data generation
Figure 2 for No more hard prompts: SoftSRV prompting for synthetic data generation
Figure 3 for No more hard prompts: SoftSRV prompting for synthetic data generation
Figure 4 for No more hard prompts: SoftSRV prompting for synthetic data generation
Viaarxiv icon

Mimetic Initialization Helps State Space Models Learn to Recall

Add code
Oct 14, 2024
Figure 1 for Mimetic Initialization Helps State Space Models Learn to Recall
Figure 2 for Mimetic Initialization Helps State Space Models Learn to Recall
Figure 3 for Mimetic Initialization Helps State Space Models Learn to Recall
Figure 4 for Mimetic Initialization Helps State Space Models Learn to Recall
Viaarxiv icon

Can Looped Transformers Learn to Implement Multi-step Gradient Descent for In-context Learning?

Add code
Oct 10, 2024
Figure 1 for Can Looped Transformers Learn to Implement Multi-step Gradient Descent for In-context Learning?
Figure 2 for Can Looped Transformers Learn to Implement Multi-step Gradient Descent for In-context Learning?
Figure 3 for Can Looped Transformers Learn to Implement Multi-step Gradient Descent for In-context Learning?
Figure 4 for Can Looped Transformers Learn to Implement Multi-step Gradient Descent for In-context Learning?
Viaarxiv icon

On the Inductive Bias of Stacking Towards Improving Reasoning

Add code
Sep 27, 2024
Viaarxiv icon

Promises and Pitfalls of Generative Masked Language Modeling: Theoretical Framework and Practical Guidelines

Add code
Jul 22, 2024
Viaarxiv icon

Efficient Document Ranking with Learnable Late Interactions

Add code
Jun 25, 2024
Figure 1 for Efficient Document Ranking with Learnable Late Interactions
Figure 2 for Efficient Document Ranking with Learnable Late Interactions
Figure 3 for Efficient Document Ranking with Learnable Late Interactions
Figure 4 for Efficient Document Ranking with Learnable Late Interactions
Viaarxiv icon

Landscape-Aware Growing: The Power of a Little LAG

Add code
Jun 04, 2024
Figure 1 for Landscape-Aware Growing: The Power of a Little LAG
Figure 2 for Landscape-Aware Growing: The Power of a Little LAG
Figure 3 for Landscape-Aware Growing: The Power of a Little LAG
Figure 4 for Landscape-Aware Growing: The Power of a Little LAG
Viaarxiv icon

Faster Cascades via Speculative Decoding

Add code
May 29, 2024
Viaarxiv icon

Language Model Cascades: Token-level uncertainty and beyond

Add code
Apr 15, 2024
Viaarxiv icon

Towards Fast Inference: Exploring and Improving Blockwise Parallel Drafts

Add code
Apr 14, 2024
Figure 1 for Towards Fast Inference: Exploring and Improving Blockwise Parallel Drafts
Figure 2 for Towards Fast Inference: Exploring and Improving Blockwise Parallel Drafts
Figure 3 for Towards Fast Inference: Exploring and Improving Blockwise Parallel Drafts
Figure 4 for Towards Fast Inference: Exploring and Improving Blockwise Parallel Drafts
Viaarxiv icon