Abstract
We present a novel natural language processing (NLP) approach to deriving plain English descriptors for science cases otherwise restricted by obfuscating technical terminology. We address the limitations of common radio galaxy morphology classifications by applying this approach. We experimentally derive a set of semantic tags for the Radio Galaxy Zoo EMU (Evolutionary Map of the Universe) project and the wider astronomical community. We collect 8486 plain English annotations of radio galaxy morphology, from which we derive a taxonomy of tags. The tags are plain English. The result is an extensible framework, which is more flexible, more easily communicated, and more sensitive to rare feature combinations, which are indescribable using the current framework of radio astronomy classifications.
Original language | English |
---|---|
Pages (from-to) | 2584-2600 |
Number of pages | 17 |
Journal | Monthly Notices of the Royal Astronomical Society |
Volume | 522 |
Issue number | 2 |
Publication status | Published - 1 Jun 2023 |
Bibliographical note
Publisher Copyright:© 2023 The Author(s). Published by Oxford University Press on behalf of Royal Astronomical Society.
Keywords
- galaxies
- statistics-radio continuum
- statistical-catalogues-galaxies
- standards-methods