APPLICATION OF THE N-GRAM MODEL TO THE KARAKALPAK LANGUAGE

Abstract

Most automatic speech recognition and text processing systems use statistical models called n-grams that specify the probability of occurrence for different sequences of words in a language. This article discusses the application of the n-gram model to the text in the Karakalpak language in order to analyze individual Karakalpak words or phrases and in which part of the sentence a given word occurs.

Article Info

Author(s) Norov A. M., Jorabekov T. K.

DOI

Keywordsautomatic speech analysis., formal grammar, hiding Markov models, Karakalpak language, n-gram, statistical model

DOWNLOAD