Alert button
Picture for Rameswar Panda

Rameswar Panda

Alert button

Dense Training, Sparse Inference: Rethinking Training of Mixture-of-Experts Language Models

Add code
Bookmark button
Alert button
Apr 08, 2024
Bowen Pan, Yikang Shen, Haokun Liu, Mayank Mishra, Gaoyuan Zhang, Aude Oliva, Colin Raffel, Rameswar Panda

Viaarxiv icon

Mitigating the Impact of Outlier Channels for Language Model Quantization with Activation Regularization

Add code
Bookmark button
Alert button
Apr 04, 2024
Aniruddha Nrusimha, Mayank Mishra, Naigang Wang, Dan Alistarh, Rameswar Panda, Yoon Kim

Viaarxiv icon

Scattered Mixture-of-Experts Implementation

Add code
Bookmark button
Alert button
Mar 13, 2024
Shawn Tan, Yikang Shen, Rameswar Panda, Aaron Courville

Figure 1 for Scattered Mixture-of-Experts Implementation
Figure 2 for Scattered Mixture-of-Experts Implementation
Figure 3 for Scattered Mixture-of-Experts Implementation
Figure 4 for Scattered Mixture-of-Experts Implementation
Viaarxiv icon

API Pack: A Massive Multilingual Dataset for API Call Generation

Add code
Bookmark button
Alert button
Feb 16, 2024
Zhen Guo, Adriana Meza Soria, Wei Sun, Yikang Shen, Rameswar Panda

Viaarxiv icon

Data Engineering for Scaling Language Models to 128K Context

Add code
Bookmark button
Alert button
Feb 15, 2024
Yao Fu, Rameswar Panda, Xinyao Niu, Xiang Yue, Hannaneh Hajishirzi, Yoon Kim, Hao Peng

Viaarxiv icon

Diversity Measurement and Subset Selection for Instruction Tuning Datasets

Add code
Bookmark button
Alert button
Feb 04, 2024
Peiqi Wang, Yikang Shen, Zhen Guo, Matthew Stallone, Yoon Kim, Polina Golland, Rameswar Panda

Viaarxiv icon

Gated Linear Attention Transformers with Hardware-Efficient Training

Add code
Bookmark button
Alert button
Dec 24, 2023
Songlin Yang, Bailin Wang, Yikang Shen, Rameswar Panda, Yoon Kim

Viaarxiv icon

Learning Human Action Recognition Representations Without Real Humans

Add code
Bookmark button
Alert button
Nov 10, 2023
Howard Zhong, Samarth Mishra, Donghyun Kim, SouYoung Jin, Rameswar Panda, Hilde Kuehne, Leonid Karlinsky, Venkatesh Saligrama, Aude Oliva, Rogerio Feris

Viaarxiv icon

LangNav: Language as a Perceptual Representation for Navigation

Add code
Bookmark button
Alert button
Oct 11, 2023
Bowen Pan, Rameswar Panda, SouYoung Jin, Rogerio Feris, Aude Oliva, Phillip Isola, Yoon Kim

Figure 1 for LangNav: Language as a Perceptual Representation for Navigation
Figure 2 for LangNav: Language as a Perceptual Representation for Navigation
Figure 3 for LangNav: Language as a Perceptual Representation for Navigation
Figure 4 for LangNav: Language as a Perceptual Representation for Navigation
Viaarxiv icon

Dense and Aligned Captions (DAC) Promote Compositional Reasoning in VL Models

Add code
Bookmark button
Alert button
Jun 01, 2023
Sivan Doveh, Assaf Arbelle, Sivan Harary, Roei Herzig, Donghyun Kim, Paola Cascante-bonilla, Amit Alfassy, Rameswar Panda, Raja Giryes, Rogerio Feris, Shimon Ullman, Leonid Karlinsky

Figure 1 for Dense and Aligned Captions (DAC) Promote Compositional Reasoning in VL Models
Figure 2 for Dense and Aligned Captions (DAC) Promote Compositional Reasoning in VL Models
Figure 3 for Dense and Aligned Captions (DAC) Promote Compositional Reasoning in VL Models
Figure 4 for Dense and Aligned Captions (DAC) Promote Compositional Reasoning in VL Models
Viaarxiv icon