Entropy-weighted medoid shift: An automated clustering algorithm for high-dimensional data

dc.contributor.authorKumar, Abhishek
dc.contributor.authorAjani, Oladayo S.
dc.contributor.authorDas, Swagatam
dc.contributor.authorMallipeddi, Rammohan
dc.date.accessioned2026-05-29T10:39:21Z
dc.date.available2026-05-29T10:39:21Z
dc.date.issued2025
dc.description.abstractUnveiling the intrinsic structure within high-dimensional data presents a significant challenge, particularly when clusters manifest themselves in lower-dimensional subspaces rather than in the full feature space. This complexity is prevalent in real-world datasets, such as text documents and images, which often contain numerous noisy or sparse features. Traditional clustering methods often overlook these latent subspace structures. This paper introduces a novel subspace-based clustering algorithm designed explicitly to address this challenge. Building upon the robust medoid shift framework, we integrate a dimensionality reduction scheme that dynamically projects data onto evolving subspaces determined through entropy-constrained optimization. This approach effectively filters irrelevant information and identifies underlying clusters, optimizing subspace representation while avoiding trivial solutions. Unlike existing methods, our algorithm ensures convergence without necessitating stopping criteria, thereby enabling efficient processing of large datasets. We validate the efficacy of our approach through extensive experiments on synthetic and real-world datasets, demonstrating substantial performance enhancements over state-of-the-art techniques. By explicitly uncovering the underlying subspace structures, our method opens new avenues for effective high-dimensional data clustering and offers valuable insights into complex data environments.
dc.description.firstpageart. no. 112347
dc.description.sourceWeb of Science
dc.description.volume169
dc.identifier.citationApplied Soft Computing. 2025, vol. 169, art. no. 112347.
dc.identifier.doi10.1016/j.asoc.2024.112347
dc.identifier.issn1568-4946
dc.identifier.issn1872-9681
dc.identifier.urihttp://hdl.handle.net/10084/158732
dc.identifier.wos001387829800001
dc.language.isoen
dc.publisherElsevier
dc.relation.ispartofseriesApplied Soft Computing
dc.relation.urihttps://doi.org/10.1016/j.asoc.2024.112347
dc.rights© 2024 Published by Elsevier B.V.
dc.subjectmedoid shift
dc.subjectdata clustering
dc.subjectunsupervised learning
dc.subjecthigh-dimensional data
dc.titleEntropy-weighted medoid shift: An automated clustering algorithm for high-dimensional data
dc.typearticle
dc.type.statusPeer-reviewed
dc.type.versionpublishedVersion

Files

License bundle

Now showing 1 - 1 out of 1 results
Loading...
Thumbnail Image
Name:
license.txt
Size:
718 B
Format:
Item-specific license agreed upon to submission
Description: