A New Filter Approach Based on Effective Ranges for Classification of Gene Expression Data

Research output: Contribution to journalArticlepeer-review

1 Citation (Scopus)

Abstract

Over the years, many studies have been carried out to reduce and eliminate the effects of diseases on human health. Gene expression data sets play a critical role in diagnosing and treating diseases. These data sets consist of thousands of genes and a small number of sample sizes. This situation creates the curse of dimensionality and it becomes problematic to analyze such data sets. One of the most effective strategies to solve this problem is feature selection methods. Feature selection is a preprocessing step to improve classification performance by selecting the most relevant and informative features while increasing the accuracy of classification. In this article, we propose a new statistically based filter method for the feature selection approach named Effective Range-based Feature Selection Algorithm (FSAER). As an extension of the previous Effective Range based Gene Selection (ERGS) and Improved Feature Selection based on Effective Range (IFSER) algorithms, our novel method includes the advantages of both methods while taking into account the disjoint area. To illustrate the efficacy of the proposed algorithm, the experiments have been conducted on six benchmark gene expression data sets. The results of the FSAER and the other filter methods have been compared in terms of classification accuracies to demonstrate the effectiveness of the proposed method. For classification methods, support vector machines, naive Bayes classifier, and k-nearest neighbor algorithms have been used.

Original languageEnglish
Pages (from-to)312-330
Number of pages19
JournalBig Data
Volume12
Issue number4
DOIs
Publication statusPublished - 1 Aug 2024

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being

Keywords

  • classification methods
  • effective range
  • feature selection
  • filter methods
  • gene expression data

Fingerprint

Dive into the research topics of 'A New Filter Approach Based on Effective Ranges for Classification of Gene Expression Data'. Together they form a unique fingerprint.

Cite this