Picture for Prateek Yadav

Prateek Yadav

A Survey on Model MoErging: Recycling and Routing Among Specialized Experts for Collaborative Learning

Add code
Aug 13, 2024
Viaarxiv icon

BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions

Add code
Jun 26, 2024
Figure 1 for BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions
Figure 2 for BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions
Figure 3 for BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions
Figure 4 for BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions
Viaarxiv icon

Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order

Add code
Mar 30, 2024
Viaarxiv icon

ComPEFT: Compression for Communicating Parameter Efficient Updates via Sparsification and Quantization

Add code
Nov 22, 2023
Figure 1 for ComPEFT: Compression for Communicating Parameter Efficient Updates via Sparsification and Quantization
Figure 2 for ComPEFT: Compression for Communicating Parameter Efficient Updates via Sparsification and Quantization
Figure 3 for ComPEFT: Compression for Communicating Parameter Efficient Updates via Sparsification and Quantization
Figure 4 for ComPEFT: Compression for Communicating Parameter Efficient Updates via Sparsification and Quantization
Viaarxiv icon

D2 Pruning: Message Passing for Balancing Diversity and Difficulty in Data Pruning

Add code
Oct 11, 2023
Figure 1 for D2 Pruning: Message Passing for Balancing Diversity and Difficulty in Data Pruning
Figure 2 for D2 Pruning: Message Passing for Balancing Diversity and Difficulty in Data Pruning
Figure 3 for D2 Pruning: Message Passing for Balancing Diversity and Difficulty in Data Pruning
Figure 4 for D2 Pruning: Message Passing for Balancing Diversity and Difficulty in Data Pruning
Viaarxiv icon

Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy

Add code
Oct 02, 2023
Figure 1 for Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy
Figure 2 for Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy
Figure 3 for Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy
Figure 4 for Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy
Viaarxiv icon

Exploring Continual Learning for Code Generation Models

Add code
Jul 05, 2023
Figure 1 for Exploring Continual Learning for Code Generation Models
Figure 2 for Exploring Continual Learning for Code Generation Models
Figure 3 for Exploring Continual Learning for Code Generation Models
Figure 4 for Exploring Continual Learning for Code Generation Models
Viaarxiv icon

Resolving Interference When Merging Models

Add code
Jun 02, 2023
Figure 1 for Resolving Interference When Merging Models
Figure 2 for Resolving Interference When Merging Models
Figure 3 for Resolving Interference When Merging Models
Figure 4 for Resolving Interference When Merging Models
Viaarxiv icon

Self-Chained Image-Language Model for Video Localization and Question Answering

Add code
May 11, 2023
Figure 1 for Self-Chained Image-Language Model for Video Localization and Question Answering
Figure 2 for Self-Chained Image-Language Model for Video Localization and Question Answering
Figure 3 for Self-Chained Image-Language Model for Video Localization and Question Answering
Figure 4 for Self-Chained Image-Language Model for Video Localization and Question Answering
Viaarxiv icon

Exclusive Supermask Subnetwork Training for Continual Learning

Add code
Oct 18, 2022
Figure 1 for Exclusive Supermask Subnetwork Training for Continual Learning
Figure 2 for Exclusive Supermask Subnetwork Training for Continual Learning
Figure 3 for Exclusive Supermask Subnetwork Training for Continual Learning
Figure 4 for Exclusive Supermask Subnetwork Training for Continual Learning
Viaarxiv icon