Volume 1 Number 2 (Jun. 2011)
Home > Archive > 2011 > Volume 1 Number 2 (Jun. 2011) >
IJEEEE 2011 Vol.1(2): 169-174 ISSN: 2010-3654
DOI: 10.7763/IJEEEE.2011.V1.27

The Knowledge Discovery of β-Thalassemia Using Principal Components Analysis: PCA and Machine Learning Techniques

Patcharaporn Paokanta, Napat Harnpornchai, Somdat Srichairatanakool, and Michele Ceccarelli
Abstract—Feature Selection plays an important role in many areas especially in classification tasks. It is also an important pre-treatment for every classification process and not only decreases the computational time and cost, but selecting an appropriate variable also increases the classification accuracy. In this research, the comparison of classification performance of machine learning techniques using Principal Components Analysis (PCA) for screening the genotypes of β-Thalassemia patients will be proposed. The aim of this study is to reduce the dimensions of data before classification. According to the PCA method and classification techniques, results show that the Multi-Layer Perceptron (MLP) is the best algorithm providing that the percentage of accuracy reaches 86.61. K- Nearest Neighbors (KNN), Naive Bayes, Bayesian Networks (BNs) and Multinomial Logistic Regression require accuracy percentages of 85.83, 85.04, 85.04 and 82.68 respectively. In the future, we will search for the other feature selection techniques in order to improve the classification performance such as the hybrid method, filtering method etc.

Index Terms—β-Thalassemia, Classification Techniques, Principal Components Analysis (PCA), Feature Selection, Machine Leaning Techniques.

Dr. Somdat Srichairatanakool is Associate Professor in Biochemistry at Faculty of Medical, Chiang Mai University, Thailand. He can be contacted by E-mail: ssrichai@med.cmu.ac.th.
Dr. Napat Harnpornchai is Assistant Professor in Knowledge management at College of Arts, Media and Technology, Chiang Mai University, Thailand. He can be contacted by E-mail: napatresearch@gmail.com.
Michele Ceccarelli is Associate Professor in Computer Sciences at Faculty of Medical, Chiang Mai University, Thailand. He can be contacted by E-mail: ceccarelli@unisannio.it.

Cite: Patcharaporn Paokanta, Napat Harnpornchai, Somdat Srichairatanakool, and Michele Ceccarelli, "The Knowledge Discovery of β-Thalassemia Using Principal Components Analysis: PCA and Machine Learning Techniques," International Journal of e-Education, e-Business, e-Management and e-Learning vol. 1, no. 2, pp. 169-174, 2011. 

General Information

ISSN: 2010-3654 (Online)
Frequency: Quarterly (Since 2015)
Editor-in-Chief: Prof. Kuan-Chou Chen
Executive Editor: Ms. Nancy Lau
Abstracting/ Indexing: EBSCO, Google Scholar, Electronic Journals Library, QUALIS, ProQuest, EI (INSPEC, IET)
E-mail: ijeeee@iap.org
  • May 14, 2019 News!

    Vol.7, No.4-Vol.8, No.2 have been indexed by EI (Inspec).   [Click]

  • May 21, 2019 News!

    The papers published in Vol.9, No.2 have all received dois from Crossref.

  • May 09, 2019 News!

    IJEEEE Vol. 9, No. 2 is available online!    [Click]

  • Oct 08, 2018 News!

    The papers published in Vol.9, No.1 have all received dois from Crossref.

  • Aug 06, 2018 News!

    Vol.7, No.1-No.3 have been indexed by EI (Inspec).   [Click]

  • Read more>>