Phân loại nhạc Việt Nam theo thể loại dựa trên âm sắc và nhịp điệu

Phan Anh Cang; Nguyen Thi Kim Khanh; Phan Thuong Cang

doi:10.22144/ctu.jsi.2017.020

Phan Anh Cang ^* , Nguyen Thi Kim Khanh and Phan Thuong Cang

* Corresponding author (pacang@nomail.com)

Full Text: PDF

Received: 15 Sep 2017

Revised: 10 Oct 2017

Accepted: 20 Oct 2017

Published: 20 Oct 2017

Title: Vietnamese music classification by genre based on timbral texture and rhythmic content

DOI: 10.22144/ctu.jsi.2017.020

Views

336

Downloads

341

How to Cite

Cang, P. A., Khánh, N. T. K., & Cang, P. T. (2017). Vietnamese music classification by genre based on timbral texture and rhythmic content. CTU Journal of Science, (CĐ Công nghệ TT), 145-154. https://doi.org/10.22144/ctu.jsi.2017.020

Issue

No. Special issue on Information Technology (2017)

Section

Information Technology

Abstract

These days, digital music storage systems (DMSS) in Vietnam usually arrange pieces of music according to the composer’s name and the song’s title, whereas listeners need to search for songs based on genres and contents. This increases the demand for categorizing songs in accordance with genres in DMSS, which enables listeners to search for the most wanted music. However, with a large number of songs collected, the way to classify them for easy management becomes a challenge for all DMSS. Therefore, it is necessary to build up an automatic sorting system. This paper suggests a new method of extracting specific timbral disposition including timbral texture, rhythmic content by using wavelet convert. Thanks to such distinctive features, KNN and SVM methods are utilized to identify types of music files. This study is conducted on four types of music: Bolero, Cai luong (reformed theatre), Cheo (classical theatre) and Hat Boi (traditional opera). The findings show that the reliability is up to 93.75% and 94% corresponding to KNN and SVM on the timbral texture. Moreover, these suggested methods are simple, effective, speedy, and suitable for Vienamese music sorting systems today.

Keywords: Classification, digital music in Vietnam, extracting specific timbral disposition, wavelet transform

Tóm tắt

Hiện nay, các hệ thống lưu trữ nhạc số Việt nam thường sắp xếp các bản nhạc theo tên nhạc sĩ hoặc theo tên bài hát trong khi người nghe nhạc cũng cần tìm kiếm các bản nhạc theo thể loại và nội dung. Điều này đã nảy sinh nhu cầu phân loại nhạc theo thể loại trong các hệ thống lưu trữ nhạc số để cho phép người nghe nhạc có thể tìm kiếm bản nhạc theo yêu cầu. Tuy nhiên, với số lượng lớn nhạc số sưu tập được, việc phân loại chúng để dễ dàng quản lý trở thành một thách thức đối với các hệ thống lưu trữ nhạc số. Điều này cho thấy việc xây dựng một hệ thống phân loại nhạc tự động là rất cần thiết. Trong luận văn này, chúng tôi đề xuất phương pháp rút trích tập đặc trưng bố cục âm sắc của tín hiệu audio bao gồm kết cấu âm sắc (timbral texture) và nhịp điệu (rhythmic content) sử dụng phép biến đổi wavelet rời rạc. Dựa trên tập đặc trưng này, phương pháp KNN và SVM được sử dụng để nhận dạng thể loại của các tập tin nhạc. Nghiên cứu của chúng tôi thực hiện minh họa trên bốn thể loại Bolero, Cải lương, Chèo và Hát bội. Kết quả thực nghiệm cho thấy độ chính xác là 93,75 % và 94 % đối với phương pháp phân loại KNN và SVM tương ứng trên tập đặc trưng về bố cục âm sắc. Hơn nữa, phương pháp đề xuất này đơn giản, hiệu quả và có thời gian thực hiện nhanh phù hợp cho các hệ thống phân loại nhạc Việt hiện nay.

Từ khóa: Nhạc Việt, phân loại nhạc, rút trích đặc trưng tín hiệu audio, tín hiệu âm nhạc, wavelet rời rạc

References

Anan, Yoko, Hatano, Kohei, Bannai, Hideo, and Takeda, Masayuki, "Music Genre Classification Using Similarity Functions", Proceedings of the 12th International Society for Music Information Retrieval Conference (Miami (Florida), USA, pp. 693-698, 2011.

Đỗ Thanh Nghị, “Khai mỏ dữ liệu – Minh học bằng ngôn ngữ R”, Nhà xuất bản Đại học Cần Thơ, trang 7-52, Cần Thơ 2008.

G. Tzanetakis and P. Cook, “Musical genre classification of audio signals”, IEEE Trans. on speech and audio process, vol. 10, no. 5, pages 293–302, July 2002.

George Tzanetakis, Georg Essl and Perry Cook, “Automatic Musical Genre Classification of Audio Signals”, 2nd Annual International Symposium on Music Information Retrieval 2001 ( ISMIR 2001), pages 1-6, 2001.

Li, Tao and Tzanetakis, G. , "Factors in automatic musical genre classification of audio signals", Applications of Signal Processing to Audio and Acoustics, IEEE Workshop, pp. 143-146, 2003.

Logan, Beth. "Mel Frequency Cepstral Coefficients for Music Modeling", Proceedings of the 1st International Conference on Music Information Retrieval (Plymouth (Massachusetts), USA October 23, 2000.

Phạm Thị Hòa và Ngô Thị Nam, “Giáo dục âm nhạc”, Tập 1-Nhạc lý cơ bản – xướng âm, Nhà xuất bản Đại học Sư phạm, trang 7 -8, ĐH 2006.

Phạm Thị Hòa, “Giáo dục âm nhạc”, Tập 2- Phương pháp giáo dục âm nhạc, Nhà xuất bản Đại học Sư phạm, trang 7 -9, ĐH 2007.

R. Tao, Z. Li, Y. Ji, and E. Bakker, “Music genre classification using temporal information and support vector machine”, Proceedings of the Sixteenth annual conference of the Advanced School for Computing and Imaging (ASCI 2010), pages 1-7, 2010.

Rini Wongso and Diaz D. Santika, “Automatic music genre classification using dual tree complex wavelet transform and support vector machine”, in Journal of Theoretical and Applied Information Technology, Vol. 63 No.1, pages 1-8, May 2014.

Tzanetakis, George, Essl, Georg, and Cook, Perry, "Automatic Musical Genre Classification of Audio Signals", Proceedings of the 2nd Annual International Symposium on Music Information Retrieval (Bloomington (Indiana), USA, pp. 205-210, 2001.

Y.M.D. Chathuranga and K.L. Jayaratne, “Automatic Music Genre Classification of Audio Signals with Machine Learning Approaches”, in GSTF International Journal on Computing (JoC), Vol. 3 No.2, pages 1-12, July 2013.

Article Sidebar

Abstract

Tóm tắt

Article Details

References