Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Mukhlis Amien

Comprehensive Evaluation of Large Language Models on Software Engineering Tasks: A Multi-Task Benchmark

Feb 06, 2026

Go Frendi Gunawan, Mukhlis Amien

Abstract:Large Language Models (LLMs) have demonstrated remarkable capabilities in software engineering, yet comprehensive benchmarks covering diverse SE activities remain limited. We present a multi-task evaluation of 11 state-of-the-art LLMs across five representative software engineering tasks: bug fixing, feature development, code refactoring, technical copywriting, and research synthesis. Our automated verification framework measures both output quality and completion efficiency. Key findings reveal that (1) models achieving identical perfect scores exhibit 22x variation in completion time, 49x variation in tool efficiency, and 53x variation in estimated cost; (2) tool usage frequency shows no correlation with success (r = 0.077, p = 0.575) - one model used 917 tool calls while another solved the same task with 3 calls; (3) we identify two distinct inefficiency patterns: loop inefficiency and inference inefficiency; and (4) coding tasks achieve 100 percent success while research tasks present greater challenges (90.9 percent). We release all experimental data, verification scripts, and analysis code for full reproducibility.

* 10 pages, 7 figures. Under review. Code and data will be fully released

Via

Access Paper or Ask Questions

Sejarah dan Perkembangan Teknik Natural Language Processing (NLP) Bahasa Indonesia: Tinjauan tentang sejarah, perkembangan teknologi, dan aplikasi NLP dalam bahasa Indonesia

Mar 28, 2023

Mukhlis Amien

Figure 1 for Sejarah dan Perkembangan Teknik Natural Language Processing (NLP) Bahasa Indonesia: Tinjauan tentang sejarah, perkembangan teknologi, dan aplikasi NLP dalam bahasa Indonesia

Figure 2 for Sejarah dan Perkembangan Teknik Natural Language Processing (NLP) Bahasa Indonesia: Tinjauan tentang sejarah, perkembangan teknologi, dan aplikasi NLP dalam bahasa Indonesia

Figure 3 for Sejarah dan Perkembangan Teknik Natural Language Processing (NLP) Bahasa Indonesia: Tinjauan tentang sejarah, perkembangan teknologi, dan aplikasi NLP dalam bahasa Indonesia

Abstract:This study provides an overview of the history of the development of Natural Language Processing (NLP) in the context of the Indonesian language, with a focus on the basic technologies, methods, and practical applications that have been developed. This review covers developments in basic NLP technologies such as stemming, part-of-speech tagging, and related methods; practical applications in cross-language information retrieval systems, information extraction, and sentiment analysis; and methods and techniques used in Indonesian language NLP research, such as machine learning, statistics-based machine translation, and conflict-based approaches. This study also explores the application of NLP in Indonesian language industry and research and identifies challenges and opportunities in Indonesian language NLP research and development. Recommendations for future Indonesian language NLP research and development include developing more efficient methods and technologies, expanding NLP applications, increasing sustainability, further research into the potential of NLP, and promoting interdisciplinary collaboration. It is hoped that this review will help researchers, practitioners, and the government to understand the development of Indonesian language NLP and identify opportunities for further research and development.

* Paper in Indonesian and for Indonesian researcher

Via

Access Paper or Ask Questions

Reduce Indonesian Vocabularies with an Indonesian Sub-word Separator

Jul 01, 2022

Mukhlis Amien, Feng Chong, Huang Heyan

Figure 1 for Reduce Indonesian Vocabularies with an Indonesian Sub-word Separator

Figure 2 for Reduce Indonesian Vocabularies with an Indonesian Sub-word Separator

Figure 3 for Reduce Indonesian Vocabularies with an Indonesian Sub-word Separator

Figure 4 for Reduce Indonesian Vocabularies with an Indonesian Sub-word Separator

Abstract:Indonesian is an agglutinative language since it has a compounding process of word-formation. Therefore, the translation model of this language requires a mechanism that is even lower than the word level, referred to as the sub-word level. This compounding process leads to a rare word problem since the number of vocabulary explodes. We propose a strategy to address the unique word problem of the neural machine translation (NMT) system, which uses Indonesian as a pair language. Our approach uses a rule-based method to transform a word into its roots and accompanied affixes to retain its meaning and context. Using a rule-based algorithm has more advantages: it does not require corpus data but only applies the standard Indonesian rules. Our experiments confirm that this method is practical. It reduces the number of vocabulary significantly up to 57\%, and on the English to Indonesian translation, this strategy provides an improvement of up to 5 BLEU points over a similar NMT system that does not use this technique.

Via

Access Paper or Ask Questions

Location-based Twitter Filtering for the Creation of Low-Resource Language Datasets in Indonesian Local Languages

Jun 15, 2022

Mukhlis Amien, Chong Feng, Heyan Huang

Figure 1 for Location-based Twitter Filtering for the Creation of Low-Resource Language Datasets in Indonesian Local Languages

Figure 2 for Location-based Twitter Filtering for the Creation of Low-Resource Language Datasets in Indonesian Local Languages

Figure 3 for Location-based Twitter Filtering for the Creation of Low-Resource Language Datasets in Indonesian Local Languages

Figure 4 for Location-based Twitter Filtering for the Creation of Low-Resource Language Datasets in Indonesian Local Languages

Abstract:Twitter contains an abundance of linguistic data from the real world. We examine Twitter for user-generated content in low-resource languages such as local Indonesian. For NLP to work in Indonesian, it must consider local dialects, geographic context, and regional culture influence Indonesian languages. This paper identifies the problems we faced when constructing a Local Indonesian NLP dataset. Furthermore, we are developing a framework for creating, collecting, and classifying Local Indonesian datasets for NLP. Using twitter's geolocation tool for automatic annotating.

Via

Access Paper or Ask Questions

Synthetic Source Language Augmentation for Colloquial Neural Machine Translation

Dec 30, 2020

Asrul Sani Ariesandy, Mukhlis Amien, Alham Fikri Aji, Radityo Eko Prasojo

Figure 1 for Synthetic Source Language Augmentation for Colloquial Neural Machine Translation

Figure 2 for Synthetic Source Language Augmentation for Colloquial Neural Machine Translation

Figure 3 for Synthetic Source Language Augmentation for Colloquial Neural Machine Translation

Figure 4 for Synthetic Source Language Augmentation for Colloquial Neural Machine Translation

Abstract:Neural machine translation (NMT) is typically domain-dependent and style-dependent, and it requires lots of training data. State-of-the-art NMT models often fall short in handling colloquial variations of its source language and the lack of parallel data in this regard is a challenging hurdle in systematically improving the existing models. In this work, we develop a novel colloquial Indonesian-English test-set collected from YouTube transcript and Twitter. We perform synthetic style augmentation to the source of formal Indonesian language and show that it improves the baseline Id-En models (in BLEU) over the new test data.

* 5 pages

Via

Access Paper or Ask Questions