Scalable and Efficient Approach for High Temporal Fuzzy Utility Pattern Mining
提出一种无需生成候选集的新数据结构与剪枝技术,用于高效挖掘考虑时间因素的高模糊效用模式,在真实与合成数据集上相比现有算法在运行时间、内存使用和可扩展性上表现更优。
Fuzzy utility (FU) pattern mining with an advantage in human reasoning has become one of the interesting topics in studies of knowledge discovery. The discovered information in FU pattern mining from real-life quantitative databases with item profits is suitable for interpreting data from a human perspective because it is not expressed using numerical values but linguistic terms which consist of natural languages. State-of-the-art approaches in this literature provide extended results by considering temporal factors, such as seasons, which can be influential in real-life situations. However, they still suffer from scalability issues because they are based on level-wise approaches which generate a number of candidates. In this article, we propose a scalable and efficient approach with a novel data structure for mining high temporal FU patterns without generating candidates. Efficient pruning techniques and algorithms are presented to improve the performance of the proposed approach. Performance experiments on both real and synthetic datasets show that the suggested algorithm has better performance than the state-of-the-art algorithms in terms of runtime, memory usage, and scalability.