Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Muhammad Abdullah Adnan

BanglaSTEM: A Parallel Corpus for Technical Domain Bangla-English Translation

Nov 05, 2025

Kazi Reyazul Hasan, Mubasshira Musarrat, A. B. M. Alim Al Islam, Muhammad Abdullah Adnan

Abstract:Large language models work well for technical problem solving in English but perform poorly when the same questions are asked in Bangla. A simple solution would be to translate Bangla questions into English first and then use these models. However, existing Bangla-English translation systems struggle with technical terms. They often mistranslate specialized vocabulary, which changes the meaning of the problem and leads to wrong answers. We present BanglaSTEM, a dataset of 5,000 carefully selected Bangla-English sentence pairs from STEM fields including computer science, mathematics, physics, chemistry, and biology. We generated over 12,000 translations using language models and then used human evaluators to select the highest quality pairs that preserve technical terminology correctly. We train a T5-based translation model on BanglaSTEM and test it on two tasks: generating code and solving math problems. Our results show significant improvements in translation accuracy for technical content, making it easier for Bangla speakers to use English-focused language models effectively. Both the BanglaSTEM dataset and the trained translation model are publicly released at https://huggingface.co/reyazul/BanglaSTEM-T5.

Via

Access Paper or Ask Questions

Attention-Driven Multi-Modal Fusion: Enhancing Sign Language Recognition and Translation

Sep 06, 2023

Zaber Ibn Abdul Hakim, Rasman Mubtasim Swargo, Muhammad Abdullah Adnan

Figure 1 for Attention-Driven Multi-Modal Fusion: Enhancing Sign Language Recognition and Translation

Figure 2 for Attention-Driven Multi-Modal Fusion: Enhancing Sign Language Recognition and Translation

Figure 3 for Attention-Driven Multi-Modal Fusion: Enhancing Sign Language Recognition and Translation

Figure 4 for Attention-Driven Multi-Modal Fusion: Enhancing Sign Language Recognition and Translation

Abstract:In this paper, we devise a mechanism for the addition of multi-modal information with an existing pipeline for continuous sign language recognition and translation. In our procedure, we have incorporated optical flow information with RGB images to enrich the features with movement-related information. This work studies the feasibility of such modality inclusion using a cross-modal encoder. The plugin we have used is very lightweight and doesn't need to include a separate feature extractor for the new modality in an end-to-end manner. We have applied the changes in both sign language recognition and translation, improving the result in each case. We have evaluated the performance on the RWTH-PHOENIX-2014 dataset for sign language recognition and the RWTH-PHOENIX-2014T dataset for translation. On the recognition task, our approach reduced the WER by 0.9, and on the translation task, our approach increased most of the BLEU scores by ~0.6 on the test set.

* This version has some errors. Our schedule is packed, so we don't have enough time to correct it. We will share another work when we have time to fix this

Via

Access Paper or Ask Questions