⬆️ ⬇️

Speech recognition. Part 2. Typical speech recognition system structure

Speech recognition is a multi-level pattern recognition problem in which acoustic signals are analyzed and structured into a hierarchy of structural elements (for example, phonemes), words, phrases, and sentences. Each level of the hierarchy may provide for some time constants, for example, possible word sequences or known types of pronunciation that can reduce the number of recognition errors at a lower level. The more we know (or assume) a priori information about the input signal, the better we can process and recognize it. image The structure of the standard speech recognition system is shown in the figure. Consider the basic elements of this system.UPD: Transferred to "Artificial Intelligence." If there is interest, I will continue to publish in it.


')

Source: https://habr.com/ru/post/64594/



All Articles