Abstract
Automatic biomedical text summarization is maturing and can provide a solution for biomedical researchers to access the information they need efficiently. Biomedical summarization approaches often rely on the similarity measure to model the source document, mainly when they employ redundancy removal or graph structures. In this paper, we examine the impact of the similarity measure on the performance of the summarization methods. We model the document as a weighted graph. Various similarity measures are used to build different graphs based on biomedical concepts, semantic types and a combination of them. We next use the graphs to generate and evaluate the automatic summaries. The results suggest that the selection of the similarity measure has a substantial effect on the quality of the summaries (≈37% improvement in ROUGE-2 metric, and ≈29% in ROUGE-SU4). The results also demonstrate that exploiting both biomedical concepts and semantic types yields slightly better performance.
Original language | English |
---|---|
Title of host publication | Intelligent Systems Design and Applications: 17th International Conference on Intelligent Systems Design and Applications (ISDA 2017) Held in Delhi, India, December 14–16, 2017 |
Editors | Ajith Abraham, Pranab Kr. Muhuri, Azah Kamilah Muda, Niketa Gandhi |
Place of Publication | Switzerland |
Publisher | Springer |
Pages | 305-314 |
Number of pages | 10 |
ISBN (Electronic) | 9783319763484 |
ISBN (Print) | 9783319763477 |
DOIs | |
Publication status | Published - 2018 |