Giải thuật ước lượng số cụm dữ liệu cải tiến cho tập dữ liệu lớn
Abstract
Tóm tắt
Article Details
Tài liệu tham khảo
Barioni, M. C. N., Razente, H., Marcelino, A. M. R., Traina, A. J. M. and Traina, C. (2014). Open Issues for Partitioning Clustering Methods: An Overview. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery. 4.3, (2014) : 161-177.
Fahad, A., Alshatri, N., Tari, Z., Alamri, A., Khalil, I., Zomaya, A. Y., Foufou, S. and Bouras, A. (2014). A Survey of Clustering Algorithms for Big Data: Taxonomy and Empirical Analysis. IEEE Trans. on Emerging Topics in Computing. 2.3: 267-279.
Kokol, P. (2015). Introduction To Data Mining and Knowledge Discovery. In: Encyclopedia of Complexity and Systems Science. Robert A. Meyers (editor). New York: Springer Science+Business Media, pp 1-3.
Kolesnikov, A., Trichina, E. and Kauranne, T. (2015). Estimating the Number of Clusters in a Numerical Data Set Via Quantization Error Modeling. Pattern Recognition. 48.3: 941-952.
Romero, C. and Ventura, S. (2013). Data Mining in Education. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery. 3.1: 12-27.
Sedgewick, R. and Wayne, K. (2011). Algorithms (4th Edition). Addison-Wesley Professional.
Jesus M., Julian L., and Salvador G. (2017). Exact fuzzy k-nearest neighbor classification for big datasets. Proceedings of 2017 IEEE International Conference on Fuzzy Systems
Shao, X., Pi, J. and Liu, L. (2013). A Method of Dynamically Determining the Number of Clusters and Cluster Centers. Proceedings of 2013 8th International conference on Computer Science Education (ICCSE):283-286.
Starczewski, A. and Krzyak, K. (2015). Performance Evaluation of the Silhouette Index. Artificial Intelligence and Soft Computing:49-58.
Stokes, K. (2014). Graph K-Anonymity through K-Means and as Modular Decomposition.
Texas Tech University (2015). Recommended Software and Hardware Configurations. 2015. https://www.depts.ttu.edu/ithelpcentral/configurations.php (ngày truy cập 17/8/2015).
Van Hieu, D. and Meesad, P. (2015). A Cell-MST-Based Method for Big Dataset Clustering on Limited Memory Computers. Proccedings of 2015 7th International Conference on Information Technology and Electrical Engineering. 632-637.
Van Hieu, D. and Meesad, P. (2016). Cell-RDOS: A Fast Outlier Detection Method for Big Datasets. International Jurnal of Advances in Soft Computing and Its Aplication. 8(3):1-15.
Yan, M. and Ye, K. (2007). Determining the Number of Clusters Using the Weighted Gap Statistic. Biometrics. 63.4, (2007) : 1031-1037.
Yu, H., Liu, Z. and Wang, G. (2014). An Automatic Method to Determine the Number of Clusters Using Decision-Theoretic Rough Set. International Journal of Approximate Reasoning. 55.1: 101-115.
Zhong, C., Malinen, M., Miao, D. and Frnti, P. (2015). A Fast Minimum Spanning Tree Algorithm Based on K-Means. Information Sciences. 295.0: 1-17.