Alert button
Picture for Mitesh M. Khapra

Mitesh M. Khapra

Alert button

IndicLLMSuite: A Blueprint for Creating Pre-training and Fine-Tuning Datasets for Indian Languages

Add code
Bookmark button
Alert button
Mar 11, 2024
Mohammed Safi Ur Rahman Khan, Priyam Mehta, Ananth Sankar, Umashankar Kumaravelan, Sumanth Doddapaneni, Suriyaprasaad G, Varun Balan G, Sparsh Jain, Anoop Kunchukuttan, Pratyush Kumar, Raj Dabre, Mitesh M. Khapra

Figure 1 for IndicLLMSuite: A Blueprint for Creating Pre-training and Fine-Tuning Datasets for Indian Languages
Figure 2 for IndicLLMSuite: A Blueprint for Creating Pre-training and Fine-Tuning Datasets for Indian Languages
Figure 3 for IndicLLMSuite: A Blueprint for Creating Pre-training and Fine-Tuning Datasets for Indian Languages
Figure 4 for IndicLLMSuite: A Blueprint for Creating Pre-training and Fine-Tuning Datasets for Indian Languages
Viaarxiv icon

Airavata: Introducing Hindi Instruction-tuned LLM

Add code
Bookmark button
Alert button
Jan 26, 2024
Jay Gala, Thanmay Jayakumar, Jaavid Aktar Husain, Aswanth Kumar M, Mohammed Safi Ur Rahman Khan, Diptesh Kanojia, Ratish Puduppully, Mitesh M. Khapra, Raj Dabre, Rudra Murthy, Anoop Kunchukuttan

Viaarxiv icon

An Empirical Analysis of In-context Learning Abilities of LLMs for MT

Add code
Bookmark button
Alert button
Jan 22, 2024
Pranjal A. Chitale, Jay Gala, Varun Gumma, Mitesh M. Khapra, Raj Dabre

Viaarxiv icon

IndicTrans2: Towards High-Quality and Accessible Machine Translation Models for all 22 Scheduled Indian Languages

Add code
Bookmark button
Alert button
May 25, 2023
AI4Bharat, Jay Gala, Pranjal A. Chitale, Raghavan AK, Sumanth Doddapaneni, Varun Gumma, Aswanth Kumar, Janki Nawale, Anupama Sujatha, Ratish Puduppully, Vivek Raghavan, Pratyush Kumar, Mitesh M. Khapra, Raj Dabre, Anoop Kunchukuttan

Figure 1 for IndicTrans2: Towards High-Quality and Accessible Machine Translation Models for all 22 Scheduled Indian Languages
Figure 2 for IndicTrans2: Towards High-Quality and Accessible Machine Translation Models for all 22 Scheduled Indian Languages
Figure 3 for IndicTrans2: Towards High-Quality and Accessible Machine Translation Models for all 22 Scheduled Indian Languages
Figure 4 for IndicTrans2: Towards High-Quality and Accessible Machine Translation Models for all 22 Scheduled Indian Languages
Viaarxiv icon

Bhasha-Abhijnaanam: Native-script and romanized Language Identification for 22 Indic languages

Add code
Bookmark button
Alert button
May 25, 2023
Yash Madhani, Mitesh M. Khapra, Anoop Kunchukuttan

Figure 1 for Bhasha-Abhijnaanam: Native-script and romanized Language Identification for 22 Indic languages
Figure 2 for Bhasha-Abhijnaanam: Native-script and romanized Language Identification for 22 Indic languages
Figure 3 for Bhasha-Abhijnaanam: Native-script and romanized Language Identification for 22 Indic languages
Figure 4 for Bhasha-Abhijnaanam: Native-script and romanized Language Identification for 22 Indic languages
Viaarxiv icon

Svarah: Evaluating English ASR Systems on Indian Accents

Add code
Bookmark button
Alert button
May 25, 2023
Tahir Javed, Sakshi Joshi, Vignesh Nagarajan, Sai Sundaresan, Janki Nawale, Abhigyan Raman, Kaushal Bhogale, Pratyush Kumar, Mitesh M. Khapra

Figure 1 for Svarah: Evaluating English ASR Systems on Indian Accents
Figure 2 for Svarah: Evaluating English ASR Systems on Indian Accents
Figure 3 for Svarah: Evaluating English ASR Systems on Indian Accents
Figure 4 for Svarah: Evaluating English ASR Systems on Indian Accents
Viaarxiv icon

Vistaar: Diverse Benchmarks and Training Sets for Indian Language ASR

Add code
Bookmark button
Alert button
May 24, 2023
Kaushal Santosh Bhogale, Sai Sundaresan, Abhigyan Raman, Tahir Javed, Mitesh M. Khapra, Pratyush Kumar

Figure 1 for Vistaar: Diverse Benchmarks and Training Sets for Indian Language ASR
Figure 2 for Vistaar: Diverse Benchmarks and Training Sets for Indian Language ASR
Figure 3 for Vistaar: Diverse Benchmarks and Training Sets for Indian Language ASR
Figure 4 for Vistaar: Diverse Benchmarks and Training Sets for Indian Language ASR
Viaarxiv icon

A Comprehensive Analysis of Adapter Efficiency

Add code
Bookmark button
Alert button
May 12, 2023
Nandini Mundra, Sumanth Doddapaneni, Raj Dabre, Anoop Kunchukuttan, Ratish Puduppully, Mitesh M. Khapra

Figure 1 for A Comprehensive Analysis of Adapter Efficiency
Figure 2 for A Comprehensive Analysis of Adapter Efficiency
Figure 3 for A Comprehensive Analysis of Adapter Efficiency
Figure 4 for A Comprehensive Analysis of Adapter Efficiency
Viaarxiv icon

IndicMT Eval: A Dataset to Meta-Evaluate Machine Translation metrics for Indian Languages

Add code
Bookmark button
Alert button
Dec 20, 2022
Ananya B. Sai, Vignesh Nagarajan, Tanay Dixit, Raj Dabre, Anoop Kunchukuttan, Pratyush Kumar, Mitesh M. Khapra

Figure 1 for IndicMT Eval: A Dataset to Meta-Evaluate Machine Translation metrics for Indian Languages
Figure 2 for IndicMT Eval: A Dataset to Meta-Evaluate Machine Translation metrics for Indian Languages
Figure 3 for IndicMT Eval: A Dataset to Meta-Evaluate Machine Translation metrics for Indian Languages
Figure 4 for IndicMT Eval: A Dataset to Meta-Evaluate Machine Translation metrics for Indian Languages
Viaarxiv icon

Naamapadam: A Large-Scale Named Entity Annotated Data for Indic Languages

Add code
Bookmark button
Alert button
Dec 20, 2022
Arnav Mhaske, Harshit Kedia, Sumanth Doddapaneni, Mitesh M. Khapra, Pratyush Kumar, Rudra Murthy V, Anoop Kunchukuttan

Figure 1 for Naamapadam: A Large-Scale Named Entity Annotated Data for Indic Languages
Figure 2 for Naamapadam: A Large-Scale Named Entity Annotated Data for Indic Languages
Figure 3 for Naamapadam: A Large-Scale Named Entity Annotated Data for Indic Languages
Figure 4 for Naamapadam: A Large-Scale Named Entity Annotated Data for Indic Languages
Viaarxiv icon