Geravanchizadeh M, Mobasheri P, Jamshidi Avanaki H. Classification of Iranian Traditional Music Dastgahs Using Features Based on Pitch Frequency. JSDP 2022; 19 (3) : 8 URL: http://jsdp.rcisp.ac.ir/article-1-1155-en.html
The Iranian traditional music is composed of seven majors Dastgahs: Chahargah, Homayoun, Mahour, Segah, Shour, Nava, and Rast-Panjgah. In this paper, a new algorithm for the classification of the Iranian traditional music Dastgahs based on pitch frequency is proposed. In this algorithm, the features of Lagrange coefficients of pitch logarithm (LCPL), Fuzzy similarity sets type 2 (FSST2), and their combination are used as the representation of music signals which are fed into the multi-class support vector machine (MSVM) as the classifier. The features of LCPL and FSST2 are obtained by applying some modifications on the pitch frequency of the desired music. To compute LCPL, first, the values of pitch frequency are extracted by the PRAAT algorithm. Then, after the applying a logarithmic operation, the tracks of pitch frequency are partitioned into smaller segments. The method of feature extraction is based on detecting the trough or valley points of the pitch tracks. In the following, the coordinates of trough points (i.e., the index of pitch frequency and the logarithmic value of the frequency) are considered as each segment boundaries. In the next step, the track between the two boundaries of each segment is approximated by a 6th order Lagrange polynomial and the computed polynomial coefficients are considered as a 6-dimensional feature vector. The first step in extracting the FSST2 feature is to compute the pitch frequencies of the input signal by the PRAAT algorithm. The second step involves the classification of music notes. Then, the subtractive clustering method is used to eliminate the incorrectly estimated pitch frequencies of the previous step. Next, the process of folding notes (i.e., transferring the extracted pitch frequencies into the reference octave band of 220-440 Hz) is performed followed by translating the frequency points to the cents with respect to 220 Hz. After folding notes in one octave, the Mahalanobis distance is applied to recognize which point on the reference octave corresponds to each musical note. These same procedures are conducted for the information pattern (theoretical data) of each Dastgah. In the final step, the folded frequency points of the unknown input signal and the information pattern of all Dastgahs are transferred to the Fuzzy logicType-2 domain and compared to determine a similarity measure which is considered as the extracted feature. The dataset used in the proposed classification algorithm contains the excerpts from solo performances with Tar played by Alizadeh, the well-known Iranian music master. The performances of the baselines and proposed classification algorithms are evaluated by the measures of Accuracy, Recall, Precision, F-measure, and MCC. The results show that the proposed algorithm has a better performance as compared with the baseline methods in terms of different classification criteria.