Efficient constrained k-center clustering with background knowledge

Longkun Guo, Chaoqi Jia, Kewen Liao, Zhigang Lu, Minhui Xue

Research output: Chapter in Book / Conference PaperConference Paperpeer-review

Abstract

Center-based clustering has attracted significant research interest from both theory and practice. In many practical applications, input data often contain background knowledge that can be used to improve clustering results. In this work, we build on widely adopted k-center clustering and model its input background knowledge as must-link (ML) and cannot-link (CL) constraint sets. However, most clustering problems including k-center are inherently NP-hard, while the more complex constrained variants are known to suffer severer approximation and computation barriers that significantly limit their applicability. By employing a suite of techniques including reverse dominating sets, linear programming (LP) integral polyhedron, and LP duality, we arrive at the first efficient approximation algorithm for constrained k-center with the best possible ratio of 2. We also construct competitive baseline algorithms and empirically evaluate our approximation algorithm against them on a variety of real datasets. The results validate our theoretical findings and demonstrate the great advantages of our algorithm in terms of clustering cost, clustering quality, and running time.

Original languageEnglish
Title of host publicationProceedings of the 38th AAAI Conference on Artificial Intelligence (AAAI 2024), Vancouver, Canada, February 20-27, 2024
PublisherAssociation for the Advancement of Artificial Intelligence
Pages20709-20717
Number of pages9
ISBN (Print)9781577358879
DOIs
Publication statusPublished - 25 Mar 2024
EventAAAI Conference on Artificial Intelligence -
Duration: 1 Jan 2024 → …

Conference

ConferenceAAAI Conference on Artificial Intelligence
Period1/01/24 → …

Fingerprint

Dive into the research topics of 'Efficient constrained k-center clustering with background knowledge'. Together they form a unique fingerprint.

Cite this