Analysis of high dimensionality yeast gene expression data using data mining

Mazin Aouf, Liwan Liyanage

    Research output: Contribution to journalArticlepeer-review

    Abstract

    Data Mining is the process of discovering interesting knowledge from large amounts of data stored either in databases, data warehouses, or other information repositories. From biological studies, the Yeast Proteome Database (YPD) is a model for the organization and presentation of genome-wide functional data. Accordingly, a yeast gene expression which is a unicellular DNA is selected which contains 6103 genes and the database combined with a number of related dataset to create a general dataset. DNA-binding transcriptional regulators interpret the genome's regulatory code by binding to specific sequences to induce or repress gene expression. The gene products including RNA and protein are responsible for the development and functioning of all living membranes by 2 steps process, transcription and translation. Various transcription factors control gene transcription by binding to the promoter regions. Translation is the production of proteins from mRNA produced in transcription. In this study, out of the 169 transcription factors known to access yeast, we are considering those thought to be involved in the response of Hydrogen Peroxide (H2O2). They are 22 transcription factors. Each one is partitioned to 3 parts: TF with No H2O2, TF with Low H2O2 and TF with High H2O2. The aim of this paper was to enhance the effectiveness of the integration of hydrogen peroxide response data related to yeast gene expression.
    Original languageEnglish
    Pages (from-to)515-522
    Number of pages8
    JournalApplied Mechanics and Materials
    Volume197
    DOIs
    Publication statusPublished - 2012

    Keywords

    • data
    • data mining
    • decision trees
    • gene expression
    • hydrogen peroxide
    • protein response
    • yeast

    Fingerprint

    Dive into the research topics of 'Analysis of high dimensionality yeast gene expression data using data mining'. Together they form a unique fingerprint.

    Cite this