Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

AdaMix: Mixture-of-Adapter for Parameter-efficient Tuning of Large Language Models



Yaqing Wang , Subhabrata Mukherjee , Xiaodong Liu , Jing Gao , Ahmed Hassan Awadallah , Jianfeng Gao


   Access Paper or Ask Questions

Sparsely Activated Mixture-of-Experts are Robust Multi-Task Learners



Shashank Gupta , Subhabrata Mukherjee , Krishan Subudhi , Eduardo Gonzalez , Damien Jose , Ahmed H. Awadallah , Jianfeng Gao


   Access Paper or Ask Questions

LiteTransformerSearch: Training-free On-device Search for Efficient Autoregressive Language Models



Mojan Javaheripi , Shital Shah , Subhabrata Mukherjee , Tomasz L. Religa , Caio C. T. Mendes , Gustavo H. de Rosa , Sebastien Bubeck , Farinaz Koushanfar , Debadeepta Dey


   Access Paper or Ask Questions

AutoDistil: Few-shot Task-agnostic Neural Architecture Search for Distilling Large Language Models



Dongkuan Xu , Subhabrata Mukherjee , Xiaodong Liu , Debadeepta Dey , Wenhui Wang , Xiang Zhang , Ahmed Hassan Awadallah , Jianfeng Gao

* 13 pages, 4 figures, 10 tables 

   Access Paper or Ask Questions

CLUES: Few-Shot Learning Evaluation in Natural Language Understanding



Subhabrata Mukherjee , Xiaodong Liu , Guoqing Zheng , Saghar Hosseini , Hao Cheng , Greg Yang , Christopher Meek , Ahmed Hassan Awadallah , Jianfeng Gao

* NeurIPS 2021 Datasets and Benchmarks Track 

   Access Paper or Ask Questions

What do Compressed Large Language Models Forget? Robustness Challenges in Model Compression



Mengnan Du , Subhabrata Mukherjee , Yu Cheng , Milad Shokouhi , Xia Hu , Ahmed Hassan Awadallah


   Access Paper or Ask Questions

LiST: Lite Self-training Makes Efficient Few-shot Learners



Yaqing Wang , Subhabrata Mukherjee , Xiaodong Liu , Jing Gao , Ahmed Hassan Awadallah , Jianfeng Gao


   Access Paper or Ask Questions

Self-training with Few-shot Rationalization: Teacher Explanations Aid Student in Few-shot NLU



Meghana Moorthy Bhat , Alessandro Sordoni , Subhabrata Mukherjee

* To Appear in EMNLP 2021 

   Access Paper or Ask Questions

Fairness via Representation Neutralization



Mengnan Du , Subhabrata Mukherjee , Guanchu Wang , Ruixiang Tang , Ahmed Hassan Awadallah , Xia Hu


   Access Paper or Ask Questions

XtremeDistilTransformers: Task Transfer for Task-agnostic Distillation



Subhabrata Mukherjee , Ahmed Hassan Awadallah , Jianfeng Gao

* Code and checkpoints released (links in draft) 

   Access Paper or Ask Questions

1
2
3
4
>>