Abstract
Multi-label classifiers make use of associations between labels in multi-label data to increase the accuracy of prediction. Before using a multi-label classifier, the data should be analysed to identify if there are associations between labels. If sets of independent labels are found, the data can be split in to multiple smaller data sets for analysis. Unfortunately, each label is dependent on the set of observations and so measuring label dependence is futile. What we actually seek is independence after taking the observations into account. In this article, we examine the concepts of explained and unexplained label covariance for measuring label dependence. We explore the use of a Normal copula model for modelling the label dependence/covariance and show that it is not able to measure conditional covariance directly. We then propose a new statistical model that allows direct measurement of label covariance (both constant and conditional). The model is validated using generated data and it is also used to examine the label covariance in real world data, allowing us to build simpler multi-label models.
| Original language | English |
|---|---|
| Title of host publication | Data Science: Foundations and Applications: 29th Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2025, Sydney, Australia, June 10-13, 2025, Proceedings, Part VI |
| Editors | Xintao Wu, Myra Spiliopoulou, Can Wang, Vipin Kumar, Longbing Cao, Xiangmin Zhou, Guansong Pang, Joao Gama |
| Place of Publication | Singapore |
| Publisher | Springer |
| Pages | 250-261 |
| Number of pages | 12 |
| ISBN (Electronic) | 9789819682959 |
| ISBN (Print) | 9789819682942 |
| DOIs | |
| Publication status | Published - 2025 |
| Event | Pacific-Asia Conference on Knowledge Discovery and Data Mining - Sydney, Australia Duration: 10 Jun 2025 → 13 Jun 2025 Conference number: 29th |
Publication series
| Name | Lecture Notes in Computer Science |
|---|---|
| Volume | 15875 |
| ISSN (Print) | 0302-9743 |
| ISSN (Electronic) | 1611-3349 |
Conference
| Conference | Pacific-Asia Conference on Knowledge Discovery and Data Mining |
|---|---|
| Abbreviated title | PAKDD |
| Country/Territory | Australia |
| City | Sydney |
| Period | 10/06/25 → 13/06/25 |