Building Acoustic and Language Model for Continuous Speech Recognition in Bahasa Indonesia

Vincent Elbert Budiman; Andreas Widjaja

doi:10.28932/jutisi.v6i2.2684

PDF

Published: Aug 10, 2020

DOI: https://doi.org/10.28932/jutisi.v6i2.2684

Vincent Elbert Budiman

Maranatha Christian University

Andreas Widjaja

Maranatha Christian University

Abstract

Here a development of an Acoustic and Language Model is presented. Low Word Error Rate is an early good sign of a good Language and Acoustic Model. Although there are still parameters other than Words Error Rate, our work focused on building Bahasa Indonesia with approximately 2000 common words and achieved the minimum threshold of 25% Word Error Rate. There were several experiments consist of different cases, training data, and testing data with Word Error Rate and Testing Ratio as the main comparison. The language and acoustic model were built using Sphinx4 from Carnegie Mellon University using Hidden Markov Model for the acoustic model and ARPA Model for the language model. The models configurations, which are Beam Width and Force Alignment, directly correlates with Word Error Rate. The configurations were set to 1e-80 for Beam Width and 1e-60 for Force Alignment to prevent underfitting or overfitting of the acoustic model. The goals of this research are to build continuous speech recognition in Bahasa Indonesia which has low Word Error Rate and to determine the optimum numbers of training and testing data which minimize the Word Error Rate.

Downloads

Download data is not yet available.

How to Cite

[1]

V. E. Budiman and A. Widjaja, “Building Acoustic and Language Model for Continuous Speech Recognition in Bahasa Indonesia”, JuTISI, vol. 6, no. 2, Aug. 2020.

Issue

Vol. 6 No. 2 (2020): JuTISI

Section

Articles

This is an open-access article distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (https://creativecommons.org/licenses/by-nc/4.0/) which permits unrestricted non-commercial used, distribution and reproduction in any medium.

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

Most read articles by the same author(s)

Ariel Elbert Budiman, Andreas Widjaja, Analisis Pengaruh Teks Preprocessing Terhadap Deteksi Plagiarisme Pada Dokumen Tugas Akhir , Jurnal Teknik Informatika dan Sistem Informasi: Vol. 6 No. 3 (2020): JuTISI
Joseph Sanjaya, Erick Renata, Vincent Elbert Budiman, Francis Anderson, Mewati Ayub, Prediksi Kelalaian Pinjaman Bank Menggunakan Random Forest dan Adaptive Boosting , Jurnal Teknik Informatika dan Sistem Informasi: Vol. 6 No. 1 (2020): JuTISI
Kristiawan Kristiawan, Andreas Widjaja, Perbandingan Algoritma Machine Learning dalam Menilai Sebuah Lokasi Toko Ritel , Jurnal Teknik Informatika dan Sistem Informasi: Vol. 7 No. 1 (2021): JuTISI
Feliks Victor Parningotan Samosir, Loudry Palmarums Mustamu, Erik Dwi Anggara, Albertus Indarko Wiyogo, Andreas Widjaja, Exploratory Data Analysis terhadap Kepadatan Penumpang Kereta Rel Listrik , Jurnal Teknik Informatika dan Sistem Informasi: Vol. 7 No. 2 (2021): JuTISI
Erik Dwi Anggara, Andreas Widjaja, Bernard Renaldy Suteja, Prediksi Kinerja Pegawai sebagai Rekomendasi Kenaikan Golongan dengan Metode Decision Tree dan Regresi Logistik , Jurnal Teknik Informatika dan Sistem Informasi: Vol. 8 No. 1 (2022): JuTISI
Kristiawan Kristiawan, Deon Diamanta Somali, Try Atmaja Linggan jaya, Andreas Widjaja, Deteksi Buah Menggunakan Supervised Learning dan Ekstraksi Fitur untuk Pemeriksa Harga , Jurnal Teknik Informatika dan Sistem Informasi: Vol. 6 No. 3 (2020): JuTISI
Sendy Ferdian, Tjatur Kandaga, Andreas Widjaja, Hapnes Toba, Ronaldo Joshua, Julio Narabel, Continuous Integration and Continuous Delivery Platform Development of Software Engineering and Software Project Management in Higher Education , Jurnal Teknik Informatika dan Sistem Informasi: Vol. 7 No. 1 (2021): JuTISI
Joseph Sanjaya, Erick Renata, Vincent Elbert Budiman, Francis Anderson, Mewati Ayub, Integrasi Micro-Apps Individual menjadi One-Stop Services Maranatha Application Suite , Jurnal Teknik Informatika dan Sistem Informasi: Vol. 5 No. 3 (2019): JuTISI
Yosef Ariyanto Irawan, Andreas Widjaja, Pembangkitan Pola Batik dengan Menggunakan Neural Transfer Style dengan Penggunaan Cost Warna , Jurnal Teknik Informatika dan Sistem Informasi: Vol. 6 No. 2 (2020): JuTISI
Oktavianus Yopi Wardana, Mewati Ayub, Andreas Widjaja , Accuracy’s Comparison of Machine Learning Models for Predicting State College Admission Selection , Jurnal Teknik Informatika dan Sistem Informasi: Vol. 9 No. 1 (2023): JuTISI

1 2 > >>

Article Sidebar

Main Article Content

Abstract

Downloads

Article Details

Most read articles by the same author(s)