Nguyễn Thị Thu Hiền * , Văn Thế Thành Quản Thành Thơ

* Tác giả liên hệ (nguyenthithuhien_toasoanctu@gmail.com)

Abstract

Mining the frequent item-set with time factor from transaction database requires much execcution time. Thus, this paper presents an approach of tree structure known as FS-Tree (Frequent Sequence Tree) to store event sequences with appearance time. From there, we propose an algorithm known as FS-Alg (Frequent Sequence Algorithm) to mine the  frequent sequence from FS-Tree. To illustrate the effectiveness of the algorithm, the paper compares it with the existing approach of TSET-Miner.
Keywords: Event, Event-based sequence, Frequent item-set

Tóm tắt

Khai thác tập phổ biến có yếu tố thời gian từ cơ sở dữ liệu giao dịch đòi hỏi nhiều thời gian thực thi. Vì vậy, bài báo tiến hành xây dựng cấu trúc cây FS-Tree (Frequent Sequence Tree) để lưu trữ các dãy sự kiện tổ hợp với thời điểm xuất hiện tương ứng. Từ đó, bài báo đề xuất thuật toán FS-Alg (Frequent Sequence Algorithm) để khai thác các dãy sự kiện phổ biến từ cây FS-Tree. Để minh họa tính hiệu quả, bài báo đánh giá thuật toán đề xuất so với thuật toán tương đồng là TSET-Miner.
Từ khóa: Sự kiện, dãy sự kiện, tập phổ biến

Article Details

Tài liệu tham khảo

Tung, A.K.H., H. Lu, J. Han and L. Feng, 2003. Efficient mining of inter-transaction association rules. IEEE Trans. Knowl. Data Eng., 15: 43-56.

Mkaouar M., Bouaziz R., Moalla M., 2011. Modelling temporal databases and temporal constraints, paper at the Third International Conference on Advances in Databases, Knowledge, and Data Applications.

Mohamed Mkaouar, Rafik Bouaziz, and Mohamed Moalla, 2011. Querying and manipulating temporal databases, International Journal of Database Management Systems, Vol.3, No.1.

H. Mannila, H. Toivonen, A.I. Verkamo, 1997. Discovery of frequent episodes in event sequences, Data Mining and Knowledge Discovery 1 (3): 259–289.

Chun-Sheng Wang, Anthony J.T. Lee, 2009. Mining inter-sequence patterns, Expert Systems with Applications 36: 8649–8658.

S. Nandagopal, V.P. Arunachalam, S. Karthik, 2004. A novel approach for mining inter-transaction itemsets, vol. 8, No.14: 1857 – 7881.

Francisco Guil, Roque Marín, 2012. A tree structure for event-based sequence mining, Knowledge-Based Systems 35: 186–200.

Park, J.S., M.S. Chen, P.S. Yu, 1995. An effective hash-based algorithm for mining association rules. Proceedings of the ACM-SIGMOD International Conference on Management of Data, (ICMD’ 95), ACM Press New York, NY, USA, pp: 175-186.

Lu, H.S., G. West and S. Venkatesh, 2005. An extended frequent pattern tree for intertransaction association rule mining. Curtin University of Technology, Perth, Western Australia.

M.J. Zaki, SPADE, 2001. An efficient algorithm for mining frequent sequences, Machine Learning 42 (1/2): 31–60.

K. Bouandas, A. Osmani, 2007. Mining association rules in temporal sequences, in: Proc. of the 2007 IEEE Symposium on Computational Intelligence and Data Mining (CIDM’07), IEEE Computer Society.

C.H. Lee, C.R. Lin, M.S. Chen, 2001. On mining general temporal association rules in a publication database, in: Proc. of the IEEE Int. Conf. on Data Mining (ICDM’01), IEEE Computer Society.

Chun-Sheng Wang, Ying-Ho Liu, Kuo-Chung Chu, 2013. Closed inter-sequence pattern mining, Journal of Systems and Software Volume 86, Issue 6: 1603–1612

Tarek F. Ghari, Hamed Nassar, Mohamed Taha, Ajith Abraham, 2010. An efficient algorithm for incremental mining of temporal association rules, Data & Knowledge Engineering, Volume 69, Issue 8: 800–815.

Wei-Wen Wu, Yu-Ting Lee, Ming-Lang Tseng, Yi-Hui Chiang, 2010. Data mining for exploring hidden patterns between KM and its performance, Knowledge-Based Systems 23: 397–401.

Nguyễn Thị Thu Hiền, Lê Hữu Hà, Văn Thế Thành. Khai thác dãy sự kiện phổ biến sử dụng cây Seq-Tree, Tạp chí Khoa học – Đại học Huế, Tập 106, Số 07, 2015, Tr. 109-120

Vincent S. Tseng, Cheng-Wei Wu, Bai-En Shie, and Philip S. Yu, 2010. UP-Growth: An Efficient Algorithm for High Utility Itemset Mining, KDD’10, Washington, DC, USA.

Ming-Yen Lin, Tzer-Fu Tu, Sue-Chen Hsueh, 2012. High utility pattern mining using the maximal itemset property and lexicographic tree structures, Information Sciences, No. of Pages 14, Model 3G.