Skip to main navigation Skip to search Skip to main content

Classification of kinetic-related injury in hospital triage data using NLP

  • Western Sydney University
  • Ingham Institute of Applied Medical Research
  • Liverpool Hospital

Research output: Chapter in Book / Conference PaperChapterpeer-review

Abstract

Triage notes, created at the start of a patient’s hospital visit, contain a wealth of information that can help medical staff and researchers understand Emergency Department patient epidemiology and the degree of time-dependent illness or injury. Unfortunately, applying modern Natural Language Processing and Machine Learning techniques to analyse triage data faces some challenges: Firstly, hospital data contains highly sensitive information that is subject to privacy regulation thus need to be analysed on site; Secondly, most hospitals and medical facilities lack the necessary hardware to fine-tune a Large Language Model (LLM), much less training one from scratch; Lastly, to identify the records of interest, expert inputs are needed to manually label the datasets, which can be time-consuming and costly. We present in this paper a pipeline that enables the classification of triage data using LLM and limited compute resources. We first fine-tuned a pre-trained LLM with a classifier using a small (2k) open sourced dataset on a GPU; and then further fine-tuned the model with a hospital specific dataset of 1000 samples on a CPU. We demonstrated that by carefully curating the datasets and leveraging existing models and open sourced data, we can successfully classify triage data with limited compute resources.

Original languageEnglish
Title of host publicationAdvanced Data Mining and Applications: 21st International Conference, ADMA 2025, Kyoto, Japan, October 22-24, 2025, Proceedings, Part III
EditorsMasatoshi Yoshikawa, Xiaofeng Meng, Yang Cao, Chuan Xiao, Weitong Chen, Yanda Wang
Place of PublicationSingapore
PublisherSpringer
Pages209-216
Number of pages8
ISBN (Electronic)9789819534593
ISBN (Print)9789819534586
DOIs
Publication statusPublished - 2026
Event21st International Conference on Advanced Data Mining and Applications, ADMA 2025 - Kyoto, Japan
Duration: 22 Oct 202524 Oct 2025

Publication series

NameLecture Notes in Computer Science
Volume16199 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference21st International Conference on Advanced Data Mining and Applications, ADMA 2025
Country/TerritoryJapan
CityKyoto
Period22/10/2524/10/25

Keywords

  • Bio-Clinical BERT
  • Classification
  • Clinical Notes
  • Electronic Health Record
  • NLP
  • Triage Notes

Fingerprint

Dive into the research topics of 'Classification of kinetic-related injury in hospital triage data using NLP'. Together they form a unique fingerprint.

Cite this