Visual assessment of clustering tendency for incomplete data

Laurence A. F. Park, James C. Bezdek, Christopher Leckie, Kotagiri Ramamohanarao, James Bailey, Marimuthu Palaniswami

Research output: Contribution to journalArticlepeer-review

11 Citations (Scopus)

Abstract

The iVAT (asiVAT) algorithms reorder symmetric (asymmetric) dissimilarity data so that an image of the data may reveal cluster substructure. Images formed from incomplete data don't offer a very rich interpretation of cluster structure. In this paper, we examine four methods for completing the input data with imputed values before imaging. We choose a best method using contaminated versions of the complete Iris data, for which the desired results are known. Then, we analyze two real world data sets from social networks that are incomplete using the best imputation method chosen in the juried trials with Iris: (i) Sampson's monastery data, an incomplete, asymmetric relation matrix; and (ii) the karate club data, comprising a symmetric similarity matrix that is about 86 percent incomplete.
Original languageEnglish
Pages (from-to)3409-3422
Number of pages14
JournalIEEE Transactions on Knowledge and Data Engineering
Volume28
Issue number12
DOIs
Publication statusPublished - 1 Dec 2016

Bibliographical note

Publisher Copyright:
© 1989-2012 IEEE.

Keywords

  • cluster heat maps
  • incomplete data
  • visualization

Fingerprint

Dive into the research topics of 'Visual assessment of clustering tendency for incomplete data'. Together they form a unique fingerprint.

Cite this