High-dimensional text clustering by dimensionality reduction and improved density peak
| dc.contributor.author | Sun, Yujia | |
| dc.contributor.author | Platoš, Jan | |
| dc.date.accessioned | 2021-02-04T11:35:37Z | |
| dc.date.available | 2021-02-04T11:35:37Z | |
| dc.date.issued | 2020 | |
| dc.description.abstract | This study focuses on high-dimensional text data clustering, given the inability of K-means to process high-dimensional data and the need to specify the number of clusters and randomly select the initial centers. We propose a Stacked-Random Projection dimensionality reduction framework and an enhanced K-means algorithm DPC-K-means based on the improved density peaks algorithm. The improved density peaks algorithm determines the number of clusters and the initial clustering centers of K-means. Our proposed algorithm is validated using seven text datasets. Experimental results show that this algorithm is suitable for clustering of text data by correcting the defects of K-means. | cs |
| dc.description.firstpage | art. no. 8881112 | cs |
| dc.description.source | Web of Science | cs |
| dc.description.volume | 2020 | cs |
| dc.identifier.citation | Wireless Communications and Mobile Computing. 2020, vol. 2020, art. no. 8881112. | cs |
| dc.identifier.doi | 10.1155/2020/8881112 | |
| dc.identifier.issn | 1530-8669 | |
| dc.identifier.issn | 1530-8677 | |
| dc.identifier.uri | http://hdl.handle.net/10084/142640 | |
| dc.identifier.wos | 000594623600001 | |
| dc.language.iso | en | cs |
| dc.publisher | Hindawi and Wiley | cs |
| dc.relation.ispartofseries | Wireless Communications and Mobile Computing | cs |
| dc.relation.uri | http://doi.org/10.1155/2020/8881112 | cs |
| dc.rights | Copyright © 2020 Yujia Sun and Jan Platoš. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. | cs |
| dc.rights.access | openAccess | cs |
| dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | cs |
| dc.title | High-dimensional text clustering by dimensionality reduction and improved density peak | cs |
| dc.type | article | cs |
| dc.type.status | Peer-reviewed | cs |
| dc.type.version | publishedVersion | cs |
Files
Original bundle
1 - 1 out of 1 results
Loading...
- Name:
- 1530-8669-2020v2020an8881112.pdf
- Size:
- 1.27 MB
- Format:
- Adobe Portable Document Format
- Description:
License bundle
1 - 1 out of 1 results
Loading...
- Name:
- license.txt
- Size:
- 718 B
- Format:
- Item-specific license agreed upon to submission
- Description: