A case study on parsing chemotherapy related free-text data

Ante Prodan, Joanne Curry

Research output: Chapter in Book / Conference PaperConference Paperpeer-review

Abstract

When modelling and simulating healthcare related processes free-text data is often the only possible source of information. This data may contain vocabulary variations such as mistyped, misspelled and/or abbreviated words. This paper describes a semi-automated approach to free-text normalisation based on a combination of commonly used techniques and local expertise of oncological nurses. In our approach particular emphasis is given to the effectiveness of the vocabulary creation process through an interactive software application. When local knowledge is successfully captured, normalisation of large data sets can be done very rapidly with a high accuracy rate achieved. Furthermore, the techniques for localised normalisation can have significant benefits to free-text parsing accuracy when data is aggregated from multiple sites (hospitals). A vocabulary created at one site can be reused at multiple other sites to speed up the vocabulary creation but at the same time facilitate the capture of local idiosyncrasies that otherwise would be lost. We believe that this research may lead to better understanding the free-text data which in turn may impact patient treatment and outcomes.
Original languageEnglish
Title of host publicationInvesting in E-health: People, Knowledge and Technology for a Healthy Future: Selected Papers from the 22nd Australian National Health Informatics Conference (HIC 2014), 11-14 August 2014, Melbourne Convention and Exhibition Centre
PublisherIOS Press
Pages116-122
Number of pages7
ISBN (Print)9781614994275
DOIs
Publication statusPublished - 2014
EventAustralian National Health Informatics Conference -
Duration: 11 Aug 2014 → …

Publication series

Name
ISSN (Print)1879-8365

Conference

ConferenceAustralian National Health Informatics Conference
Period11/08/14 → …

Keywords

  • hospital information systems
  • medical oncology
  • medical records
  • user-computer interface

Fingerprint

Dive into the research topics of 'A case study on parsing chemotherapy related free-text data'. Together they form a unique fingerprint.

Cite this