Determining the need for multi-label classifiers by measuring unexplained covariance

Laurence A.F. Park, Jesse Read

Research output: Chapter in Book / Conference PaperChapterpeer-review

Abstract

Multi-label classifiers make use of associations between labels in multi-label data to increase the accuracy of prediction. Before using a multi-label classifier, the data should be analysed to identify if there are associations between labels. If sets of independent labels are found, the data can be split in to multiple smaller data sets for analysis. Unfortunately, each label is dependent on the set of observations and so measuring label dependence is futile. What we actually seek is independence after taking the observations into account. In this article, we examine the concepts of explained and unexplained label covariance for measuring label dependence. We explore the use of a Normal copula model for modelling the label dependence/covariance and show that it is not able to measure conditional covariance directly. We then propose a new statistical model that allows direct measurement of label covariance (both constant and conditional). The model is validated using generated data and it is also used to examine the label covariance in real world data, allowing us to build simpler multi-label models.

Original languageEnglish
Title of host publicationData Science: Foundations and Applications: 29th Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2025, Sydney, Australia, June 10-13, 2025, Proceedings, Part VI
EditorsXintao Wu, Myra Spiliopoulou, Can Wang, Vipin Kumar, Longbing Cao, Xiangmin Zhou, Guansong Pang, Joao Gama
Place of PublicationSingapore
PublisherSpringer
Pages250-261
Number of pages12
ISBN (Electronic)9789819682959
ISBN (Print)9789819682942
DOIs
Publication statusPublished - 2025
EventPacific-Asia Conference on Knowledge Discovery and Data Mining - Sydney, Australia
Duration: 10 Jun 202513 Jun 2025
Conference number: 29th

Publication series

NameLecture Notes in Computer Science
Volume15875
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

ConferencePacific-Asia Conference on Knowledge Discovery and Data Mining
Abbreviated titlePAKDD
Country/TerritoryAustralia
CitySydney
Period10/06/2513/06/25

Fingerprint

Dive into the research topics of 'Determining the need for multi-label classifiers by measuring unexplained covariance'. Together they form a unique fingerprint.

Cite this