Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Dhruv Rathi

Towards Orthographically-Informed Evaluation of Speech Recognition Systems for Indian Languages

Mar 01, 2026

Kaushal Santosh Bhogale, Tahir Javed, Greeshma Susan John, Dhruv Rathi, Akshayasree Padmanaban, Niharika Parasa, Mitesh M. Khapra

Abstract:Evaluating ASR systems for Indian languages is challenging due to spelling variations, suffix splitting flexibility, and non-standard spellings in code-mixed words. Traditional Word Error Rate (WER) often presents a bleaker picture of system performance than what human users perceive. Better aligning evaluation with real-world performance requires capturing permissible orthographic variations, which is extremely challenging for under-resourced Indian languages. Leveraging recent advances in LLMs, we propose a framework for creating benchmarks that capture permissible variations. Through extensive experiments, we demonstrate that OIWER, by accounting for orthographic variations, reduces pessimistic error rates (an average improvement of 6.3 points), narrows inflated model gaps (e.g., Gemini-Canary performance difference drops from 18.1 to 11.5 points), and aligns more closely with human perception than prior methods like WER-SN by 4.9 points.

* Accepted in ICASSP 2026

Via

Access Paper or Ask Questions

Underwater Fish Species Classification using Convolutional Neural Network and Deep Learning

May 25, 2018

Dhruv Rathi, Sushant Jain, Dr. S. Indu

Figure 1 for Underwater Fish Species Classification using Convolutional Neural Network and Deep Learning

Figure 2 for Underwater Fish Species Classification using Convolutional Neural Network and Deep Learning

Figure 3 for Underwater Fish Species Classification using Convolutional Neural Network and Deep Learning

Figure 4 for Underwater Fish Species Classification using Convolutional Neural Network and Deep Learning

Abstract:The target of this paper is to recommend a way for Automated classification of Fish species. A high accuracy fish classification is required for greater understanding of fish behavior in Ichthyology and by marine biologists. Maintaining a ledger of the number of fishes per species and marking the endangered species in large and small water bodies is required by concerned institutions. Majority of available methods focus on classification of fishes outside of water because underwater classification poses challenges such as background noises, distortion of images, the presence of other water bodies in images, image quality and occlusion. This method uses a novel technique based on Convolutional Neural Networks, Deep Learning and Image Processing to achieve an accuracy of 96.29%. This method ensures considerably discrimination accuracy improvements than the previously proposed methods.

* Pre-print of paper to be published in IEEEXplore, accepted under the International Conference of Advances in Pattern Recognition 2017

Via

Access Paper or Ask Questions

Optimization of Transfer Learning for Sign Language Recognition Targeting Mobile Platform

May 17, 2018

Dhruv Rathi

Abstract:The target of this research is to experiment, iterate and recommend a system that is successful in recognition of American Sign Language (ASL). It is a challenging as well as an interesting problem that if solved will bring a leap in social and technological aspects alike. In this paper, we propose a real-time recognizer of ASL based on a mobile platform, so that it will have more accessibility and provides an ease of use. The technique implemented is Transfer Learning of new data of Hand gestures for alphabets in ASL to be modelled on various pre-trained high- end models and optimize the best model to run on a mobile platform considering the various limitations of the same during optimization. The data used consists of 27,455 images of 24 alphabets of ASL. The optimized model when ran over a memory-efficient mobile application, provides an accuracy of 95.03% of accurate recognition with an average recognition time of 2.42 seconds. This method ensures considerable discrimination in accuracy and recognition time than the previous research.

* 6 Pages, Journal

Via

Access Paper or Ask Questions