Abstract
Bioinformatics has numerous approaches for evaluating the similarities between RNA-seq data for disease classification. Processing RNA-sequencing (RNA-seq) data using clustering or classification approach is extremely challenging, although analysis of ribonucleic acid (RNA-Seq) helps understand differentially expressed genes and classify the patient in a risk-free method. In this study, we present a hybrid end-to-end pipeline for analyzing, processing, and classifying the RNA-Seq data with a major focus on the covid-19 data set. The pipeline has been developed in three phases initially the raw data is normalized. Then the normalized data is pushed to a colonization algorithm to remove the noise data. The optimized data set is passed to a Deep Learning (DL) classifier. Further, a comparative analysis is performed with state of art methods discussed in the literature. The results prove that our proposed hybrid pipeline achieved the best accuracy over other methods. Gene set enrichment analysis was also performed to analyze the genes that are informative towards COVID-19 identification.
Original language | English |
---|---|
Title of host publication | Proceeding of 2023 Australasian Computer Science Week, ACSW 2023 |
Publisher | Association for Computing Machinery |
Pages | 183-189 |
Number of pages | 7 |
ISBN (Electronic) | 9798400700057 |
ISBN (Print) | 9798400700057 |
DOIs | |
Publication status | Published - 30 Jan 2023 |
Event | Australasian Computer Science Week - Duration: 31 Jan 2023 → … |
Publication series
Name | ACM International Conference Proceeding Series |
---|
Conference
Conference | Australasian Computer Science Week |
---|---|
Period | 31/01/23 → … |
Bibliographical note
Publisher Copyright:© 2023 ACM.
Keywords
- Classification
- Covid-19
- Deep learning
- Gene set Enrichment Analysis
- RNA-Seq