An efficient algorithm to mine high average-utility itemsets

Lin, Jerry Chun-Wei

doi:10.1016/j.aei.2016.04.002

An efficient algorithm to mine high average-utility itemsets

dc.contributor.author	Lin, Jerry Chun-Wei
dc.contributor.author	Li, Ting
dc.contributor.author	Fournier-Viger, Philippe
dc.contributor.author	Hong, Tzung-Pei
dc.contributor.author	Zhan, Justin
dc.contributor.author	Vozňák, Miroslav
dc.date.accessioned	2016-07-12T06:28:39Z
dc.date.available	2016-07-12T06:28:39Z
dc.date.issued	2016
dc.description.abstract	With the ever increasing number of applications of data mining, high-utility itemset mining (HUIM) has become a critical issue in recent decades. In traditional HUIM, the utility of an itemset is defined as the sum of the utilities of its items, in transactions where it appears. An important problem with this definition is that it does not take itemset length into account. Because the utility of larger itemset is generally greater than the utility of smaller itemset, traditional HUIM algorithms tend to be biased toward finding a set of large itemsets. Thus, this definition is not a fair measurement of utility. To provide a better assessment of each itemset’s utility, the task of high average-utility itemset mining (HAUIM) was proposed. It introduces the average utility measure, which considers both the length of itemsets and their utilities, and is thus more appropriate in real-world situations. Several algorithms have been designed for this task. They can be generally categorized as either level-wise or pattern-growth approaches. Both of them require, however, the amount of computation to find the actual high average-utility itemsets (HAUIs). In this paper, we present an efficient average-utility (AU)-list structure to discover the HAUIs more efficiently. A depth-first search algorithm named HAUI-Miner is proposed to explore the search space without candidate generation, and an efficient pruning strategy is developed to reduce the search space and speed up the mining process. Extensive experiments are conducted to compare the performance of HAUI-Miner with the state-of-the-art HAUIM algorithms in terms of runtime, number of determining nodes, memory usage and scalability.	cs
dc.description.firstpage	233	cs
dc.description.issue	2	cs
dc.description.lastpage	243	cs
dc.description.source	Web of Science	cs
dc.description.volume	30	cs
dc.identifier.citation	Advanced Engineering Informatics. 2016, vol. 30, issue 2, p. 233-243.	cs
dc.identifier.doi	10.1016/j.aei.2016.04.002
dc.identifier.issn	1474-0346
dc.identifier.issn	1873-5320
dc.identifier.uri	http://hdl.handle.net/10084/111822
dc.identifier.wos	000376694600011
dc.language.iso	en	cs
dc.publisher	Elsevier	cs
dc.relation.ispartofseries	Advanced Engineering Informatics	cs
dc.relation.uri	http://dx.doi.org/10.1016/j.aei.2016.04.002	cs
dc.rights	© 2016 Elsevier Ltd. All rights reserved.	cs
dc.subject	high average-utility itemsets	cs
dc.subject	list structure	cs
dc.subject	data mining	cs
dc.subject	HAUIM	cs
dc.title	An efficient algorithm to mine high average-utility itemsets	cs
dc.type	article	cs
dc.type.status	Peer-reviewed	cs

Files

License bundle

Now showing 1 - 1 out of 1 results

Name:: license.txt
Size:: 1.71 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Publikační činnost VŠB-TUO ve Web of Science / Publications of VŠB-TUO in Web of Science
Publikační činnost Katedry telekomunikačních technologií / Publications of Department of Telecommunications (440)
Články z časopisů s impakt faktorem / Articles from Impact Factor Journals