Abstract
The major steps of an overall clustering task are preclustering, clustering, and postclustering. Preclustering involves data preparation, including feature extraction, selection, transformation normalization, cleaning, and data reduction, whereas postclustering involves cluster usability encompassing cluster validity, reasoning, interpretation, and visualization. This article focuses on the second step, “clustering,” which is further divided into two key modules: clustering criterion and clustering method. This clustering step takes a set X={x1, x2, …, xn} of preprocessed points (synonymously elements, objects, instances, cases or patterns) as an input and produces a clustered result as an output (either partitioning or hierarchical) for postclustering. It first requires a clustering criterion to be built and needs a clustering algorithm to optimize the clustering criterion.
| Original language | English |
|---|---|
| Title of host publication | Comprehensive Chemometrics |
| Subtitle of host publication | Chemical and Biochemical Data Analysis, Second Edition: Four Volume Set |
| Publisher | Elsevier |
| Pages | 531-564 |
| Number of pages | 34 |
| Volume | 2 |
| ISBN (Electronic) | 9780444641656 |
| DOIs | |
| Publication status | Published - 1 Jan 2020 |
Bibliographical note
Publisher Copyright:© 2020 Elsevier B.V. All rights reserved
Keywords
- k-Means clustering
- k-Medoid clustering
- Agglomerative clustering
- Clustering
- Constrained clustering
- Divisive clustering
- Hierarchical clustering
- Hybrid clustering
- Partitioning clustering
- Unsupervised learning