Abstract
The iVAT (asiVAT) algorithms reorder symmetric (asymmetric) dissimilarity data so that an image of the data may reveal cluster substructure. Images formed from incomplete data don't offer a very rich interpretation of cluster structure. In this paper, we examine four methods for completing the input data with imputed values before imaging. We choose a best method using contaminated versions of the complete Iris data, for which the desired results are known. Then, we analyze two real world data sets from social networks that are incomplete using the best imputation method chosen in the juried trials with Iris: (i) Sampson's monastery data, an incomplete, asymmetric relation matrix; and (ii) the karate club data, comprising a symmetric similarity matrix that is about 86 percent incomplete.
Original language | English |
---|---|
Pages (from-to) | 3409-3422 |
Number of pages | 14 |
Journal | IEEE Transactions on Knowledge and Data Engineering |
Volume | 28 |
Issue number | 12 |
DOIs | |
Publication status | Published - 1 Dec 2016 |
Bibliographical note
Publisher Copyright:© 1989-2012 IEEE.
Keywords
- cluster heat maps
- incomplete data
- visualization