Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Sanad Aburass

An Ensemble Approach to Question Classification: Integrating Electra Transformer, GloVe, and LSTM

Aug 25, 2023

Sanad Aburass, Osama Dorgham

Figure 1 for An Ensemble Approach to Question Classification: Integrating Electra Transformer, GloVe, and LSTM

Figure 2 for An Ensemble Approach to Question Classification: Integrating Electra Transformer, GloVe, and LSTM

Figure 3 for An Ensemble Approach to Question Classification: Integrating Electra Transformer, GloVe, and LSTM

Abstract:Natural Language Processing (NLP) has emerged as a crucial technology for understanding and generating human language, playing an essential role in tasks such as machine translation, sentiment analysis, and more pertinently, question classification. As a subfield within NLP, question classification focuses on determining the type of information being sought, a fundamental step for downstream applications like question answering systems. This study presents an innovative ensemble approach for question classification, combining the strengths of Electra, GloVe, and LSTM models. Rigorously tested on the well-regarded TREC dataset, the model demonstrates how the integration of these disparate technologies can lead to superior results. Electra brings in its transformer-based capabilities for complex language understanding, GloVe offers global vector representations for capturing word-level semantics, and LSTM contributes its sequence learning abilities to model long-term dependencies. By fusing these elements strategically, our ensemble model delivers a robust and efficient solution for the complex task of question classification. Through rigorous comparisons with well-known models like BERT, RoBERTa, and DistilBERT, the ensemble approach verifies its effectiveness by attaining an 80% accuracy score on the test dataset.

Via

Access Paper or Ask Questions

Quantifying Overfitting: Introducing the Overfitting Index

Aug 16, 2023

Sanad Aburass

Figure 1 for Quantifying Overfitting: Introducing the Overfitting Index

Figure 2 for Quantifying Overfitting: Introducing the Overfitting Index

Figure 3 for Quantifying Overfitting: Introducing the Overfitting Index

Figure 4 for Quantifying Overfitting: Introducing the Overfitting Index

Abstract:In the rapidly evolving domain of machine learning, ensuring model generalizability remains a quintessential challenge. Overfitting, where a model exhibits superior performance on training data but falters on unseen data, is a recurrent concern. This paper introduces the Overfitting Index (OI), a novel metric devised to quantitatively assess a model's tendency to overfit. Through extensive experiments on the Breast Ultrasound Images Dataset (BUS) and the MNIST dataset using architectures such as MobileNet, U-Net, ResNet, Darknet, and ViT-32, we illustrate the utility and discernment of the OI. Our results underscore the variable overfitting behaviors across architectures and highlight the mitigative impact of data augmentation, especially on smaller and more specialized datasets. The ViT-32's performance on MNIST further emphasizes the robustness of certain models and the dataset's comprehensive nature. By providing an objective lens to gauge overfitting, the OI offers a promising avenue to advance model optimization and ensure real-world efficacy.

Via

Access Paper or Ask Questions

A Hybrid Machine Learning Model for Classifying Gene Mutations in Cancer using LSTM, BiLSTM, CNN, GRU, and GloVe

Aug 07, 2023

Sanad Aburass, Osama Dorgham, Jamil Al Shaqsi

Figure 1 for A Hybrid Machine Learning Model for Classifying Gene Mutations in Cancer using LSTM, BiLSTM, CNN, GRU, and GloVe

Figure 2 for A Hybrid Machine Learning Model for Classifying Gene Mutations in Cancer using LSTM, BiLSTM, CNN, GRU, and GloVe

Figure 3 for A Hybrid Machine Learning Model for Classifying Gene Mutations in Cancer using LSTM, BiLSTM, CNN, GRU, and GloVe

Figure 4 for A Hybrid Machine Learning Model for Classifying Gene Mutations in Cancer using LSTM, BiLSTM, CNN, GRU, and GloVe

Abstract:This study presents an ensemble model combining LSTM, BiLSTM, CNN, GRU, and GloVe to classify gene mutations using Kaggle's Personalized Medicine: Redefining Cancer Treatment dataset. The results were compared against well-known transformers like as BERT, Electra, Roberta, XLNet, Distilbert, and their LSTM ensembles. Our model outperformed all other models in terms of accuracy, precision, recall, F1 score, and Mean Squared Error. Surprisingly, it also needed less training time, resulting in a perfect combination of performance and efficiency. This study demonstrates the utility of ensemble models for difficult tasks such as gene mutation classification.

* 6 pages, 7 figures and 2 tables

Via

Access Paper or Ask Questions

Performance Evaluation of Swin Vision Transformer Model using Gradient Accumulation Optimization Technique

Jul 31, 2023

Sanad Aburass, Osama Dorgham

Figure 1 for Performance Evaluation of Swin Vision Transformer Model using Gradient Accumulation Optimization Technique

Figure 2 for Performance Evaluation of Swin Vision Transformer Model using Gradient Accumulation Optimization Technique

Figure 3 for Performance Evaluation of Swin Vision Transformer Model using Gradient Accumulation Optimization Technique

Figure 4 for Performance Evaluation of Swin Vision Transformer Model using Gradient Accumulation Optimization Technique

Abstract:Vision Transformers (ViTs) have emerged as a promising approach for visual recognition tasks, revolutionizing the field by leveraging the power of transformer-based architectures. Among the various ViT models, Swin Transformers have gained considerable attention due to their hierarchical design and ability to capture both local and global visual features effectively. This paper evaluates the performance of Swin ViT model using gradient accumulation optimization (GAO) technique. We investigate the impact of gradient accumulation optimization technique on the model's accuracy and training time. Our experiments show that applying the GAO technique leads to a significant decrease in the accuracy of the Swin ViT model, compared to the standard Swin Transformer model. Moreover, we detect a significant increase in the training time of the Swin ViT model when GAO model is applied. These findings suggest that applying the GAO technique may not be suitable for the Swin ViT model, and concern should be undertaken when using GAO technique for other transformer-based models.

Via

Access Paper or Ask Questions