Alert button
Picture for Ganesh Jawahar

Ganesh Jawahar

Alert button

LLM Performance Predictors are good initializers for Architecture Search

Add code
Bookmark button
Alert button
Oct 25, 2023
Ganesh Jawahar, Muhammad Abdul-Mageed, Laks V. S. Lakshmanan, Dujian Ding

Viaarxiv icon

Mixture-of-Supernets: Improving Weight-Sharing Supernet Training with Architecture-Routed Mixture-of-Experts

Add code
Bookmark button
Alert button
Jun 08, 2023
Ganesh Jawahar, Haichuan Yang, Yunyang Xiong, Zechun Liu, Dilin Wang, Fei Sun, Meng Li, Aasish Pappu, Barlas Oguz, Muhammad Abdul-Mageed, Laks V. S. Lakshmanan, Raghuraman Krishnamoorthi, Vikas Chandra

Figure 1 for Mixture-of-Supernets: Improving Weight-Sharing Supernet Training with Architecture-Routed Mixture-of-Experts
Figure 2 for Mixture-of-Supernets: Improving Weight-Sharing Supernet Training with Architecture-Routed Mixture-of-Experts
Figure 3 for Mixture-of-Supernets: Improving Weight-Sharing Supernet Training with Architecture-Routed Mixture-of-Experts
Figure 4 for Mixture-of-Supernets: Improving Weight-Sharing Supernet Training with Architecture-Routed Mixture-of-Experts
Viaarxiv icon

Orca: Progressive Learning from Complex Explanation Traces of GPT-4

Add code
Bookmark button
Alert button
Jun 05, 2023
Subhabrata Mukherjee, Arindam Mitra, Ganesh Jawahar, Sahaj Agarwal, Hamid Palangi, Ahmed Awadallah

Figure 1 for Orca: Progressive Learning from Complex Explanation Traces of GPT-4
Figure 2 for Orca: Progressive Learning from Complex Explanation Traces of GPT-4
Figure 3 for Orca: Progressive Learning from Complex Explanation Traces of GPT-4
Figure 4 for Orca: Progressive Learning from Complex Explanation Traces of GPT-4
Viaarxiv icon

AutoMoE: Neural Architecture Search for Efficient Sparsely Activated Transformers

Add code
Bookmark button
Alert button
Oct 14, 2022
Ganesh Jawahar, Subhabrata Mukherjee, Xiaodong Liu, Young Jin Kim, Muhammad Abdul-Mageed, Laks V. S. Lakshmanan, Ahmed Hassan Awadallah, Sebastien Bubeck, Jianfeng Gao

Figure 1 for AutoMoE: Neural Architecture Search for Efficient Sparsely Activated Transformers
Figure 2 for AutoMoE: Neural Architecture Search for Efficient Sparsely Activated Transformers
Figure 3 for AutoMoE: Neural Architecture Search for Efficient Sparsely Activated Transformers
Figure 4 for AutoMoE: Neural Architecture Search for Efficient Sparsely Activated Transformers
Viaarxiv icon

Small Character Models Match Large Word Models for Autocomplete Under Memory Constraints

Add code
Bookmark button
Alert button
Oct 06, 2022
Ganesh Jawahar, Subhabrata Mukherjee, Debadeepta Dey, Muhammad Abdul-Mageed, Laks V. S. Lakshmanan, Caio Cesar Teodoro Mendes, Gustavo Henrique de Rosa, Shital Shah

Figure 1 for Small Character Models Match Large Word Models for Autocomplete Under Memory Constraints
Figure 2 for Small Character Models Match Large Word Models for Autocomplete Under Memory Constraints
Figure 3 for Small Character Models Match Large Word Models for Autocomplete Under Memory Constraints
Figure 4 for Small Character Models Match Large Word Models for Autocomplete Under Memory Constraints
Viaarxiv icon

Automatic Detection of Entity-Manipulated Text using Factual Knowledge

Add code
Bookmark button
Alert button
Mar 19, 2022
Ganesh Jawahar, Muhammad Abdul-Mageed, Laks V. S. Lakshmanan

Figure 1 for Automatic Detection of Entity-Manipulated Text using Factual Knowledge
Figure 2 for Automatic Detection of Entity-Manipulated Text using Factual Knowledge
Figure 3 for Automatic Detection of Entity-Manipulated Text using Factual Knowledge
Figure 4 for Automatic Detection of Entity-Manipulated Text using Factual Knowledge
Viaarxiv icon

InfoDCL: A Distantly Supervised Contrastive Learning Framework for Social Meaning

Add code
Bookmark button
Alert button
Mar 15, 2022
Chiyu Zhang, Muhammad Abdul-Mageed, Ganesh Jawahar

Figure 1 for InfoDCL: A Distantly Supervised Contrastive Learning Framework for Social Meaning
Figure 2 for InfoDCL: A Distantly Supervised Contrastive Learning Framework for Social Meaning
Figure 3 for InfoDCL: A Distantly Supervised Contrastive Learning Framework for Social Meaning
Figure 4 for InfoDCL: A Distantly Supervised Contrastive Learning Framework for Social Meaning
Viaarxiv icon

Simple, Interpretable and Stable Method for Detecting Words with Usage Change across Corpora

Add code
Bookmark button
Alert button
Dec 28, 2021
Hila Gonen, Ganesh Jawahar, Djamé Seddah, Yoav Goldberg

Figure 1 for Simple, Interpretable and Stable Method for Detecting Words with Usage Change across Corpora
Figure 2 for Simple, Interpretable and Stable Method for Detecting Words with Usage Change across Corpora
Figure 3 for Simple, Interpretable and Stable Method for Detecting Words with Usage Change across Corpora
Figure 4 for Simple, Interpretable and Stable Method for Detecting Words with Usage Change across Corpora
Viaarxiv icon

Exploring Text-to-Text Transformers for English to Hinglish Machine Translation with Synthetic Code-Mixing

Add code
Bookmark button
Alert button
May 18, 2021
Ganesh Jawahar, El Moatez Billah Nagoudi, Muhammad Abdul-Mageed, Laks V. S. Lakshmanan

Figure 1 for Exploring Text-to-Text Transformers for English to Hinglish Machine Translation with Synthetic Code-Mixing
Figure 2 for Exploring Text-to-Text Transformers for English to Hinglish Machine Translation with Synthetic Code-Mixing
Figure 3 for Exploring Text-to-Text Transformers for English to Hinglish Machine Translation with Synthetic Code-Mixing
Figure 4 for Exploring Text-to-Text Transformers for English to Hinglish Machine Translation with Synthetic Code-Mixing
Viaarxiv icon

Automatic Detection of Machine Generated Text: A Critical Survey

Add code
Bookmark button
Alert button
Nov 02, 2020
Ganesh Jawahar, Muhammad Abdul-Mageed, Laks V. S. Lakshmanan

Figure 1 for Automatic Detection of Machine Generated Text: A Critical Survey
Figure 2 for Automatic Detection of Machine Generated Text: A Critical Survey
Figure 3 for Automatic Detection of Machine Generated Text: A Critical Survey
Viaarxiv icon