Picture for Shashank Rajput

Shashank Rajput

Maestro: Uncovering Low-Rank Structures via Trainable Decomposition

Add code
Aug 28, 2023
Figure 1 for Maestro: Uncovering Low-Rank Structures via Trainable Decomposition
Figure 2 for Maestro: Uncovering Low-Rank Structures via Trainable Decomposition
Figure 3 for Maestro: Uncovering Low-Rank Structures via Trainable Decomposition
Figure 4 for Maestro: Uncovering Low-Rank Structures via Trainable Decomposition
Viaarxiv icon

Recommender Systems with Generative Retrieval

Add code
May 08, 2023
Figure 1 for Recommender Systems with Generative Retrieval
Figure 2 for Recommender Systems with Generative Retrieval
Figure 3 for Recommender Systems with Generative Retrieval
Figure 4 for Recommender Systems with Generative Retrieval
Viaarxiv icon

The Expressive Power of Tuning Only the Norm Layers

Add code
Feb 15, 2023
Figure 1 for The Expressive Power of Tuning Only the Norm Layers
Figure 2 for The Expressive Power of Tuning Only the Norm Layers
Figure 3 for The Expressive Power of Tuning Only the Norm Layers
Figure 4 for The Expressive Power of Tuning Only the Norm Layers
Viaarxiv icon

Looped Transformers as Programmable Computers

Add code
Jan 30, 2023
Figure 1 for Looped Transformers as Programmable Computers
Figure 2 for Looped Transformers as Programmable Computers
Figure 3 for Looped Transformers as Programmable Computers
Figure 4 for Looped Transformers as Programmable Computers
Viaarxiv icon

LIFT: Language-Interfaced Fine-Tuning for Non-Language Machine Learning Tasks

Add code
Jun 15, 2022
Figure 1 for LIFT: Language-Interfaced Fine-Tuning for Non-Language Machine Learning Tasks
Figure 2 for LIFT: Language-Interfaced Fine-Tuning for Non-Language Machine Learning Tasks
Figure 3 for LIFT: Language-Interfaced Fine-Tuning for Non-Language Machine Learning Tasks
Figure 4 for LIFT: Language-Interfaced Fine-Tuning for Non-Language Machine Learning Tasks
Viaarxiv icon

Utilizing Language-Image Pretraining for Efficient and Robust Bilingual Word Alignment

Add code
May 23, 2022
Figure 1 for Utilizing Language-Image Pretraining for Efficient and Robust Bilingual Word Alignment
Figure 2 for Utilizing Language-Image Pretraining for Efficient and Robust Bilingual Word Alignment
Figure 3 for Utilizing Language-Image Pretraining for Efficient and Robust Bilingual Word Alignment
Figure 4 for Utilizing Language-Image Pretraining for Efficient and Robust Bilingual Word Alignment
Viaarxiv icon

Finding Everything within Random Binary Networks

Add code
Oct 22, 2021
Figure 1 for Finding Everything within Random Binary Networks
Figure 2 for Finding Everything within Random Binary Networks
Figure 3 for Finding Everything within Random Binary Networks
Figure 4 for Finding Everything within Random Binary Networks
Viaarxiv icon

Minibatch vs Local SGD with Shuffling: Tight Convergence Bounds and Beyond

Add code
Oct 20, 2021
Viaarxiv icon

An Exponential Improvement on the Memorization Capacity of Deep Threshold Networks

Add code
Jun 14, 2021
Figure 1 for An Exponential Improvement on the Memorization Capacity of Deep Threshold Networks
Figure 2 for An Exponential Improvement on the Memorization Capacity of Deep Threshold Networks
Figure 3 for An Exponential Improvement on the Memorization Capacity of Deep Threshold Networks
Figure 4 for An Exponential Improvement on the Memorization Capacity of Deep Threshold Networks
Viaarxiv icon

Permutation-Based SGD: Is Random Optimal?

Add code
Feb 19, 2021
Figure 1 for Permutation-Based SGD: Is Random Optimal?
Figure 2 for Permutation-Based SGD: Is Random Optimal?
Figure 3 for Permutation-Based SGD: Is Random Optimal?
Viaarxiv icon