Mining High Utility Time Interval Sequences Using MapReduce Approach: Multiple Utility Framework

SALETI, Sumalatha, TANGIRALA, Jaya Lakshmi and AHMAD, Mohd Wazih (2022). Mining High Utility Time Interval Sequences Using MapReduce Approach: Multiple Utility Framework. IEEE Access, 10, 123301-123315.

[img]
Preview
PDF
Gebreslassie-MiningHighUtility(VoR).pdf - Published Version
Creative Commons Attribution.

Download (1MB) | Preview
Official URL: http://dx.doi.org/10.1109/access.2022.3224217
Open Access URL: https://ieeexplore.ieee.org/document/9961173 (Published version)
Link to published version:: https://doi.org/10.1109/access.2022.3224217

Abstract

Mining high utility sequential patterns is observed to be a significant research in data mining. Several methods mine the sequential patterns while taking utility values into consideration. The patterns of this type can determine the order in which items were purchased, but not the time interval between them. The time interval among items is important for predicting the most useful real-world circumstances, including retail market basket data analysis, stock market fluctuations, DNA sequence analysis, and so on. There are a very few algorithms for mining sequential patterns those consider both the utility and time interval. However, they assume the same threshold for each item, maintaining the same unit profit. Moreover, with the rapid growth in data, the traditional algorithms cannot handle the big data and are not scalable. To handle this problem, we propose a distributed three phase MapReduce framework that considers multiple utilities and suitable for handling big data. The time constraints are pushed into the algorithm instead of pre-defined intervals. Also, the proposed upper bound minimizes the number of candidate patterns during the mining process. The approach has been tested and the experimental results show its efficiency in terms of run time, memory utilization, and scalability.

Item Type: Article
Uncontrolled Keywords: 08 Information and Computing Sciences; 09 Engineering; 10 Technology; 40 Engineering; 46 Information and computing sciences
Identification Number: https://doi.org/10.1109/access.2022.3224217
Page Range: 123301-123315
SWORD Depositor: Symplectic Elements
Depositing User: Symplectic Elements
Date Deposited: 01 Mar 2024 10:13
Last Modified: 01 Mar 2024 10:15
URI: https://shura.shu.ac.uk/id/eprint/33298

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year

View more statistics