TY - GEN
T1 - A case study on parsing chemotherapy related free-text data
AU - Prodan, Ante
AU - Curry, Joanne
PY - 2014
Y1 - 2014
N2 - When modelling and simulating healthcare related processes free-text data is often the only possible source of information. This data may contain vocabulary variations such as mistyped, misspelled and/or abbreviated words. This paper describes a semi-automated approach to free-text normalisation based on a combination of commonly used techniques and local expertise of oncological nurses. In our approach particular emphasis is given to the effectiveness of the vocabulary creation process through an interactive software application. When local knowledge is successfully captured, normalisation of large data sets can be done very rapidly with a high accuracy rate achieved. Furthermore, the techniques for localised normalisation can have significant benefits to free-text parsing accuracy when data is aggregated from multiple sites (hospitals). A vocabulary created at one site can be reused at multiple other sites to speed up the vocabulary creation but at the same time facilitate the capture of local idiosyncrasies that otherwise would be lost. We believe that this research may lead to better understanding the free-text data which in turn may impact patient treatment and outcomes.
AB - When modelling and simulating healthcare related processes free-text data is often the only possible source of information. This data may contain vocabulary variations such as mistyped, misspelled and/or abbreviated words. This paper describes a semi-automated approach to free-text normalisation based on a combination of commonly used techniques and local expertise of oncological nurses. In our approach particular emphasis is given to the effectiveness of the vocabulary creation process through an interactive software application. When local knowledge is successfully captured, normalisation of large data sets can be done very rapidly with a high accuracy rate achieved. Furthermore, the techniques for localised normalisation can have significant benefits to free-text parsing accuracy when data is aggregated from multiple sites (hospitals). A vocabulary created at one site can be reused at multiple other sites to speed up the vocabulary creation but at the same time facilitate the capture of local idiosyncrasies that otherwise would be lost. We believe that this research may lead to better understanding the free-text data which in turn may impact patient treatment and outcomes.
KW - hospital information systems
KW - medical oncology
KW - medical records
KW - user-computer interface
UR - http://handle.uws.edu.au:8081/1959.7/565543
UR - https://hisa.site-ym.com/page/hic2014
U2 - 10.3233/978-1-61499-427-5-116
DO - 10.3233/978-1-61499-427-5-116
M3 - Conference Paper
SN - 9781614994275
SP - 116
EP - 122
BT - Investing in E-health: People, Knowledge and Technology for a Healthy Future: Selected Papers from the 22nd Australian National Health Informatics Conference (HIC 2014), 11-14 August 2014, Melbourne Convention and Exhibition Centre
PB - IOS Press
T2 - Australian National Health Informatics Conference
Y2 - 11 August 2014
ER -