Discovering periodic itemsets using novel periodicity measures

Loading...
Thumbnail Image

Downloads

3

Date issued

Journal Title

Journal ISSN

Volume Title

Publisher

VŠB - Technická univerzita Ostrava

Location

Signature

License

Abstract

Discovering periodic patterns in a customer transaction database is the task of identifying item-sets (sets of items or values) that periodically appear in a sequence of transactions. Numerous methods can identify patterns exhibiting a periodic behavior. Nonetheless, a problem of these traditional approaches is that the concept of periodic behavior is defined very strictly. Indeed, a pattern is considered to be periodic if the amount of time or number of transactions between all pairs of its consecutive occurrences is less than a fixed maxPer (maximum periodicity) threshold. As a result, a pattern can be eliminated by a traditional algorithm for mining periodic patterns even if all of its periods but one respect the maxPer constraint. Consequently, many patterns that are almost always periodic are not presented to the user. But these patterns could be considered as interesting as they generally appear periodically. To address this issue, this paper suggests to use three measures to identify periodic patterns. These measures are named average, maximum and minimum periodicity, respectively. They are each designed to evaluate a different aspect of the periodic behavior of patterns. By using them together in a novel algorithm called Periodic Frequent Pattern Miner, more flexibility is given to users to select patterns meeting specific periodic requirements. The designed algorithm has been evaluated on several datasets. Results show that the proposed solution is scalable, efficient, and can identify a small sets of patterns compared to the Eclat algorithm for mining all frequent patterns in a database.

Description

Subject(s)

itemset mining, periodic pattern, periodicity, average periodicity

Citation

Advances in Electrical and Electronic Engineering. 2019, vol. 17, issue 1, p. 33-44.