Dataset Ratio Experiment for Tajweed Law Recognition Using MFCC and CNN Features

Authors

  • Mas Muhammad Aqil Salim UPN Veteran Jawa Timur University
  • Ani Dijah Rahajoe UPN Veteran Jawa Timur University
  • Anggraini Puspita Sari UPN Veteran Jawa Timur University

DOI:

https://doi.org/10.26905/jeemecs.v8i2.15546

Keywords:

Audio Classification, Convolutional Neural Network, Mel-Frequency Cepstral Coefficient, Nun Sukun, Tajwid Detection

Abstract

This research aims to develop a tajwid classification system focusing on the detection of nun sukun and tanwin rules in Quranic recitation using the Convolutional Neural Network (CNN) and Mel-Frequency Cepstral Coefficient (MFCC) methods. The dataset used includes 1,344 audio samples collected from both direct recordings and YouTube observations. Audio preprocessing involved silence removal and noise filtering, followed by feature extraction using MFCC with 40 coefficients. These features were then classified into six categories: Idghom Bighunnah, Idghom Bilaghunnah, Idzhar Halqi, Ikhfa Haqiqi, Iqlab, and No Class. The CNN architecture implemented includes three convolutional layers with Batch Normalization and Leaky ReLU activation, optimized with a softmax classifier. Three different dataset split scenarios (80:10:10, 70:15:15, and 60:20:20) were evaluated to determine the best performance. The highest accuracy of 89% was achieved using the 80:10:10 data split, with macro-average F1-score reaching 0.87. Results show that CNN combined with MFCC provides reliable classification of tajwid rules, particularly in identifying distinctive acoustic patterns. The study confirms that data partitioning significantly influences model performance and highlights the importance of optimal preprocessing and architecture selection in deep learning-based speech recognition tasks.

Downloads

Download data is not yet available.

Author Biographies

Mas Muhammad Aqil Salim, UPN Veteran Jawa Timur University

Informatics, Computer Science Faculty, UPN Veteran Jawa Timur University, Surabaya, Indonesia

Ani Dijah Rahajoe, UPN Veteran Jawa Timur University

Informatics, Computer Science Faculty, UPN Veteran Jawa Timur University, Surabaya, Indonesia

Anggraini Puspita Sari, UPN Veteran Jawa Timur University

Informatics, Computer Science Faculty, UPN Veteran Jawa Timur University, Surabaya, Indonesia

References

[1] N. Rihladhatul and A. Sayoga, “AL-AFKAR : Journal for Islamic Studies The Living Qur ’ an : Ethnographic Study and Understanding of Al- Isra ’: 9 in Urban Communities,” vol. 7, no. 4, pp. 334–346, 2024, doi: 10.31943/afkarjournal.v7i4.1648.Abstract.

[2] H. Fajrussalam, W. Widaningsih, N. Nurdiansyah, L. N. Istighna, C. Nurjanah, and C. R. A. Widjaya, “Implementasi Program Maghrib Mengaji sebagai Sarana Pembelajaran Al-Quran pada Anak Usia Dini: Studi Deskriptif Kualitatif di TPQ Al Ittihad Kab. Cirebon,” At-Ta’lim Media Inf. Pendidik. Islam, vol. 22, no. 2, p. 259, 2023, doi: 10.29300/attalim.v22i2.2546.

[3] R. D. Arifin, “No Title,” Dianisa. [Online]. Available: https://dianisa.com/nun-sukun-dan-tanwin/

[4] M. H. Ashadiqi, A. Erlansari, and F. Farady, “Aplikasi Pembelajaran Tajwid Berbasis Android,” J. Rekursif, vol. 8, no. 1, pp. 59–70, 2020.

[5] A. Wirdiani, S. N. Machetho, I. K. G. D. Putra, M. Sudarma, R. S. Hartati, and H. A. Ferdian, “Improvement Model for Speaker Recognition using MFCC-CNN and Online Triplet Mining,” Int. J. Adv. Sci. Eng. Inf. Technol., vol. 14, no. 2, pp. 420–427, 2024, doi: 10.18517/ijaseit.14.2.19396.

[6] L. Syafa’ah, R. Prasetyono, and H. Hariyady, “Enhancing Qur’anic Recitation Experience with CNN and MFCC Features for Emotion Identification,” Kinet. Game Technol. Inf. Syst. Comput. Network, Comput. Electron. Control, vol. 4, no. 2, 2024, doi: 10.22219/kinetik.v9i2.2007.

[7] J. J. M. Montemayor, G. P. G. Escuadra, M. A. G. Nambatac, and D. T. Tenoria, “Detecting Rice Weevils in Stored Grains using MFCC and CNN,” Procedia Comput. Sci., vol. 234, no. 2023, pp. 1681–1688, 2024, doi: 10.1016/j.procs.2024.03.173.

[8] Z. K. Abdul and A. K. Al-Talabani, “Mel Frequency Cepstral Coefficient and its Applications: A Review,” IEEE Access, vol. 10, no. October, pp. 122136–122158, 2022, doi: 10.1109/ACCESS.2022.3223444.

[9] F. Wang and X. Shen, “Research on Speech Emotion Recognition Based on Teager Energy Operator Coefficients and Inverted MFCC Feature Fusion,” Electron., vol. 12, no. 17, 2023, doi: 10.3390/electronics12173599.

[10] S. Kulkarni, “Impact of Various Data Splitting Ratios on the Performance of Machine Learning Models in the Classification of Lung Cancer,” in Proceedings of the Second International Conference on Emerging Trends in Engineering (ICETE 2023), Springer Nature, 2023, p. 96.

[11] Y. Dokuz and Z. Tufekci, “Mini-batch sample selection strategies for deep learning based speech recognition,” Appl. Acoust., vol. 171, p. 107573, 2021, doi: 10.1016/j.apacoust.2020.107573.

[12] W. Mu, B. Yin, X. Huang, J. Xu, and Z. Du, “Environmental sound classification using temporal-frequency attention based convolutional neural network,” Sci. Rep., vol. 11, no. 1, pp. 1–14, 2021, doi: 10.1038/s41598-021-01045-4.

[13] A. P. Sari, H. Suzuki, T. Kitajima, T. Yasuno, D. A. Prasetya, and N. Nachrowie, “Prediction model of wind speed and direction using convolutional neural network - Long short term memory,” PECon 2020 - 2020 IEEE Int. Conf. Power Energy, no. 1, pp. 356–361, 2020, doi: 10.1109/PECon48942.2020.9314474.

[14] Y. Furusho and K. Ikeda, “Theoretical analysis of skip connections and batch normalization from generalization and optimization perspectives,” APSIPA Trans. Signal Inf. Process., vol. 9, 2020, doi: 10.1017/ATSIP.2020.7.

[15] Y. Liu, X. Chen, M. Cheng, C. Hsieh, Y. You, and L. Angeles, “C ONCURRENT A DVERSARIAL L EARNING FOR L ARGE -,” pp. 1–17, 2021.

[16] S. H. Golzar, H. Bagherpour, and J. A. Parian, “Research Article A New Method to Optimize Deep CNN Model for Classification of Regular Cucumber Based on Global Average Pooling,” vol. 2024, no. Dl, 2024, doi: 10.1155/2024/5818803.

Downloads

Published

2025-08-26

Issue

Section

Articles