Constructing a synthetic longitudinal health dataset for data mining

Shima Ghassem Pour, Anthony Maeder, Louisa Jorm

    Research output: Chapter in Book / Conference PaperConference Paperpeer-review

    Abstract

    ![CDATA[The traditional approach to epidemiological research is to analyse data in an explicit statistical fashion, attempting to answer a question or test a hypothesis. However, increasing experience in the application of data mining and exploratory data analysis methods suggests that valuable information can be obtained from large datasets using these less constrained approaches. Available data mining techniques, such as clustering, have mainly been applied to cross-sectional point-in-time data. However, health datasets often include repeated observations for individuals and so researchers are interested in following their health trajectories. This requires methods for analysis of multiple-points-over-time or longitudinal data. Here, we describe an approach to construct a synthetic longitudinal version of a major population health dataset in which clusters merge and split over time, to investigate the utility of clustering for discovering time sequence based patterns.]]
    Original languageEnglish
    Title of host publicationThe Fourth International Conference on Advances in Databases, Knowledge, and Data Applications : DBKDA 2012 : February 29 - March 5, 2012, Saint Gilles, Reunion Island
    PublisherCurran Associates
    Pages86-90
    Number of pages5
    ISBN (Print)9781612081854
    Publication statusPublished - 2012
    EventInternational Conference on Advances in Databases_Knowledge_and Data Applications -
    Duration: 29 Feb 2012 → …

    Conference

    ConferenceInternational Conference on Advances in Databases_Knowledge_and Data Applications
    Period29/02/12 → …

    Keywords

    • cluster analysis
    • medical informatics
    • data mining

    Fingerprint

    Dive into the research topics of 'Constructing a synthetic longitudinal health dataset for data mining'. Together they form a unique fingerprint.

    Cite this