A hierarchical set-enumeration tree enabling high occupancy item set mining and the use of an adaptive occupancy threshold

dc.contributor.authorTran, Thanh-Nam
dc.contributor.authorHoang, Vinh Truong
dc.contributor.authorTruong, Thanh-Cong
dc.contributor.authorVozňák, Miroslav
dc.date.accessioned2026-04-24T10:55:03Z
dc.date.available2026-04-24T10:55:03Z
dc.date.issued2025
dc.description.abstractThe highly efficient HEP algorithm is a useful tool for mining High Occupancy (HO) item sets. Occupancy is an important measure that describes the interestingness of frequent item sets. The current study examines the efficiency problems in mining HO item sets and proposes an improved HEP algorithm, named advanced HEP (A-HEP), based on set theory rules which eliminate a large number of redundant iterations. The study also proposes a novel adaptive-and-modified HEP (NAM-HEP) algorithm that uses HO Set-Enumeration (SE) trees to store HO item sets. The study proposes definitions for adaptive thresholds such as support threshold and occupancy threshold based on the attributes of the transaction database for efficient pruning of the HO-SE tree. Two pseudo-code blocks are presented in addition to a detailed description of the A-HEP and NAM-HEP algorithms and their advantages. Using the A-HEP and NAM-HEP algorithms, HO item sets are investigated from the practical transaction databases named mushroom and retail. The results indicate that the proposed A-HEP and NAM-HEP algorithms enhance mining performance and runtime benchmarks.
dc.description.issue2
dc.description.sourceWeb of Science
dc.description.volume55
dc.identifier.citationApplied Intelligence. 2025, vol. 55, issue 2, art. no. 205.
dc.identifier.doi10.1007/s10489-024-06166-7
dc.identifier.issn0924-669X
dc.identifier.issn1573-7497
dc.identifier.urihttp://hdl.handle.net/10084/158480
dc.identifier.wos001383333200008
dc.language.isoen
dc.publisherSpringer Nature
dc.relation.ispartofseriesApplied Intelligence
dc.relation.urihttps://link.springer.com/content/pdf/10.1007/s10489-024-06166-7.pdf?utm_source=clarivate&getft_integrator=clarivate
dc.rights© The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024
dc.subjectsupport threshold
dc.subjectoccupancy threshold
dc.subjecthigh occupancy mining
dc.subjectpre-pruned high occupancy (HO) set-enumeration (SE) tree
dc.subjectutility mining
dc.titleA hierarchical set-enumeration tree enabling high occupancy item set mining and the use of an adaptive occupancy threshold
dc.type.statusPeer-reviewed
dc.type.versionpublishedVersion
local.files.count1
local.files.size1313937
local.has.filesyes

Files

Original bundle

Now showing 1 - 1 out of 1 results
Loading...
Thumbnail Image
Name:
0924-669X-2025v55i2an205.pdf
Size:
1.25 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 out of 1 results
Loading...
Thumbnail Image
Name:
license.txt
Size:
718 B
Format:
Item-specific license agreed upon to submission
Description: