⬆️ ⬇️

HD-Voice for cellular communication - Khariton, Dmitry, Vladimir, Oleg, Irina, Glory!





We enabled HD-Voice on the Beeline network in Moscow. Below are sound recordings before and after, as well as technical details.



The HD-voice mode is based on AMR-WB technology (Adaptive Multi Rate Rate Wadeband) - a broadband adaptive coding with variable speed, also this codec is known as the G.722.22 standard. In theory, a person can hear sounds in the range of 20 Hz - 20 kHz, but in practice it is enough to transmit sound in a much narrower range. When encoding speech in AMR-WB, a frequency band of 50Hz-7kHz is used, which is absolutely enough for full voice transmission. For example, the AMR codec uses a range of 300-3 kHz. The method can be used in 2G and 3G networks.

')

Simply put, now the human voice will sound more natural , because it will cease to “squeeze” into a rather narrow band, the need of which was once dictated by the limitations of infrastructure. In addition, with HD coding, ambient noise has less of an effect on speech intelligibility.



How did we get to HD-Voice, and how does it work?



A person is able to hear a certain band of sound frequencies, usually it is assumed to be equal to 20Hz-20kHz. Transmitting the entire spectrum over wireless communication channels without compression is difficult, so the voice without cutting the band is transmitted only in wired systems. For speech compression, voice codecs or vocoders are used. Their side effect is the distortion of the transmitted voice by limiting the frequency band of the voice message.



The very first voice codec used in mobile networks was “Full Rate” - FR. Simultaneously with the FR, the Half Rate codec was introduced, the only purpose of which is to increase the number of simultaneously served subscribers, and, as its name implies, it occupies half of the FR codec band in the radio channel. However, the voice quality after transcoding to FR was not very high, and for some countries (for example, Arabic, where speech speed is high and many high-frequency sounds), it is completely unacceptable. Therefore, the FR codec has been reworked, and the “Enhanced Full Rate” appeared, providing significantly better voice quality, with a lower bitrate being created.



With the increasing penetration of mobile networks, the requirements for the quality of voice services began to grow, and a problem appeared with the deterioration of voice quality, with a low radio signal level from the base station. To solve this problem, a new codec was developed - “Adaptive Multi Rate”, which used the same frequency range for analysis and compression, but the algorithm was implemented in such a way that the codec bitrate dynamically changed, depending on the quality of the received signal. Due to this, an opportunity to provide excellent voice quality under good radio conditions, and for bad ones, to keep the conversation even where FR / EFR codecs could not work at all.



All 3G networks work only with the use of the AMR codec or its subsequent implementations, for example AMR-WB. But since the network remains a large number of subscribers using phones that support only work with EFR / HR codecs, the 2G network continues to work with all codecs (FR / EFR / HR / AMR FR / AMR HR) at once, providing voice to any phone.



Currently, voice quality requirements in mobile networks continue to grow, and the time has come to use AMR-WB technology. All the codecs described above use the frequency range up to 200Hz-3.4kHz for compression, while AMR-WB uses the voice message band at 50Hz-7kHz for coding. AMR-WB allows you to transmit twice the bandwidth, which gives an increase in the quality and saturation of the high and low frequencies of the transmitted voice.



Some characteristics of codecs are listed in the table:



Codec



Standard



Year of creation



Compressible frequency range



Generated bitrate



Full Rate - FR



GSM 06.10



1990



200-3400 Hz



13 kbit / s



Half Rate - HR



GSM 06.20



1990



200-3400 Hz



5.6 kbit / s



Enhanced Full Rate - EFR



GSM 06.60



1995



200-3400 Hz



12.2 kbit / s



Adaptive Multi Rate - AMR



3GPP TS 26.071



1999



200-3400 Hz



4.75 - 12.20 kbit / s



Adaptive Multi Rate - WideBand - AMR-WB



3GPP TS 26.190



2001



50-7000 Hz



6.60 - 23.85 kbit / s



Adaptive Multi Rate-WideBand + - AMR-WB +



3GPP TS 26.290



2004



50-7000 Hz



6 - 36 kbit / s (mono)

7 - 48 kbit / s (stereo)







Hint for curious programmers: a sample of the implementation of the AMR codec in the C language is available in the 3GPP TS 26.073 standard (you can easily find it, download it and try to implement it).



In the diagram below you can see a comparison of voice quality, according to the MOS (Mean Opinion Score) scale, depending on the codec used. The MOS scale is a subjective assessment of voice quality from 0 to 5, where 0 is a total lack of audibility, and 5 is live speech. G.711 codec is used in wired telephony, that is, this is the quality that you should hear from your landline phone (if it is, of course, not transferred to the IP channel, but this is a completely different topic).







Update: The comments rightly indicated that the average MOS score for the G.711 codec is about 4.1, this picture distorts reality a little.

As can be seen from the diagram (and taking into account the information from the Update), the AMR-WB codec provides, under good radio conditions, the voice quality is similar to the one we are used to when using a wired home phone.



For example, two fragments of a musical composition, which were compressed by AMR and AMR-WB codecs, you can compare the sound quality yourself and visually assess the difference in the sound picture created by different codecs.



AMR coded top track



Bottom track encoded in AMR-WB



Tests



We are not the first to launch this technology in Russia, so some developments have already been made. Our first part of the network with HD-Voice was built within the Moscow Ring Road, on the equipment of Ericsson. Activating AMR-WB took a long time: first, it was tested on one controller (with numerous drive tests to measure speech quality). And even after everything was tested on one controller, the subsequent activation proceeded in stages, controller by controller. The whole activation process took almost half a year.



Here are the sample samples from the final tests , which were evaluated by hardware using accurate measurements at different times.



Does it already work?



Yes, now the technology is implemented in the 3G network for Moscow. In order to use the mode, you do not need to make additional settings, it is activated itself, if both subscribers are in the 3G network and both their phones support the work of the codec.







The diagram shows the conditions that are necessary for the operation of AMR-WB. Now, as mentioned above, the codec is running on the 3G network, the next stage is to ensure the codec works between the 2G and 3G networks, then between the networks and between the operators.



What devices are supported?



List. Carefully, under the spoiler a lot of letters.
Alcatel One Touch 903

Alcatel One Touch 916 / 916D

Alcatel One Touch 918

Alcatel One Touch 983

Alcatel One Touch 985 / 985D

Alcatel One Touch 991 / 991D / 991T

Alcatel One Touch 992D

Alcatel One Touch 993 / 993D

Alcatel One Touch 995/996

Alcatel One Touch Idol

Alcatel One Touch Idol Ultra

Alcatel One Touch Scribe Easy

Alcatel One Touch Scribe HD

Alcatel One Touch Star

Alcatel One Touch Tribe

Alcatel One X'POP

Apple iPhone 5

Apple iPhone 5S

Apple iPhone 5C

Blackberry q10

Blackberry q5

BlackBerry Torch 9810

Blackberry z10

Blackberry z30

HTC Desire 500

HTC ChaCha

HTC Desire C

HTC Desire HD

HTC Desire S

HTC Desire X

HTC Desire Z

HTC EVO 3D

HTC Incredible

HTC One

HTC One mini

HTC One S

HTC One SV

HTC One V

HTC One X

HTC One X +

HTC One XL

HTC One (M8)

HTC Radar

HTC Raider

HTC Rhyme

HTC Sensation

HTC Sensation XE

HTC Sensation XL

HTC Titan

HTC Touch HD

HTC Wildfire S

HTC Windows Phone 8X

HTC Windows Phone 8S

Huawei Ascend D Quad

Huawei Ascend D2 LTE (VoLTE)

Huawei Ascend G510

Huawei Ascend G740

Huawei Ascend P1 U9200

Huawei Ascend P2

Huawei Ascend P6

Huawei Boulder U8350

LG A310

LG Nexus 4 E960

LG Nexus 5

LG Optimus 3D

LG Optimus G

LG Optimus G Pro

LG Optimus L4 II

LG Optimus L5

LG Optimus L5 II

LG Optimus L7

LG Optimus L7 II

LG Optimus L9

LG Optimus LTE2 F160LV (VoLTE)

LG Optimus F7

LG Optimus F180S (VoLTE)

LG Optimus Vu2 F200 (K, L, S) (VoLTE)

LG Optimus Vu3 F300 (K, L, S) (VoLTE)

LG Optimus GX F310L (VoLTE)

LG Optimus G2 F320S (VoLTE)

LG Optimus G Pro F240S (VoLTE)

LG Optimus LTE III F260S (VoLTE)

LG Optimus G Flex F340S (VoLTE)

LG Optimus 3D Max (P720)

LG Optimus 3D (P920)

LG Optimus 4X HD (P880)

LG Prada (P940)

LG Prada 3.0

Motorola quench

Motorola RAZR (XT910)

Motorola RAZR i (XT890)

Motorola Moto G

Nokia 300

Nokia 301

Nokia 302

Nokia 500

Nokia 515

Nokia 600

Nokia 603

Nokia 700

Nokia 701

Nokia 808 PureView

Nokia 5230

Nokia 5330 Mobile TV Edition

Nokia 5530 XpressMusic

Nokia 5630 XpressMusic

Nokia 5730 XpressMusic

Nokia 6260 Slide

Nokia 6600i slide

Nokia 6700 classic

Nokia 6700 slide

Nokia 6710 Navigator

Nokia 6720 classic

Nokia 6760 slide

Nokia 6790 slide

Nokia 7230

Nokia Asha 300

Nokia Asha 301

Nokia Asha 302

Nokia Asha 303

Nokia Asha 311

Nokia C2-01

Nokia C3-01 Touch and Type

Nokia C5-03

Nokia C5-04

Nokia C6-00

Nokia C6-01

Nokia C7-00

Nokia E5-00

Nokia E52

Nokia E55

Nokia E6-00

Nokia E63

Nokia E7 Communicator

Nokia E7-00

Nokia E71x

Nokia E72

Nokia E73 Mode

Nokia Lumia 510

Nokia Lumia 520

Nokia Lumia 520.2

Nokia Lumia 521

Nokia Lumia 610

Nokia Lumia 610 NFC

Nokia Lumia 620

Nokia Lumia 625

Nokia Lumia 710

Nokia Lumia 720

Nokia Lumia 800

Nokia Lumia 810

Nokia Lumia 820

Nokia Lumia 822

Nokia Lumia 900

Nokia Lumia 920

Nokia Lumia 920T

Nokia Lumia 925

Nokia Lumia 925 Superman Edition

Nokia Lumia 925T

Nokia Lumia 928

Nokia Lumia 929 LTE-A (Cat 4)

Nokia Lumia 1020.2

Nokia Lumia 1020 3G

Nokia Lumia 1020 LTE

Nokia Lumia 1520

Nokia N7

Nokia N8

Nokia N86 8MP

Nokia N9

Nokia N97

Nokia N97 mini

Nokia X3-02

Nokia X5

Nokia X6-00

Nokia X7-00

Samsung Galaxy Star

Samsung Galaxy Light

Samsung Ativ S

Samsung galaxy ace

Samsung Galaxy Ace 2

Samsung Galaxy Express

Samsung Galaxy Fame

Samsung Galaxy Mini

Samsung Galaxy Mini 2

Samsung Galaxy Nexus

Samsung Galaxy Note

Samsung Galaxy Note II

Samsung Galaxy Note 3

Samsung Galaxy Note 10.1

Samsung Galaxy S II

Samsung Galaxy S III

Samsung Galaxy S4

Samsung Galaxy S4 Mini

Samsung Galaxy S5

Samsung Galaxy S Advance

Samsung Galaxy S Plus

Samsung Galaxy W

Samsung Galaxy Y

Samsung Omnia 7

Samsung Omnia Pro B7350

Samsung S5610

Samsung Wave 3

Samsung Trend Plus

Sony Ericsson Elm

Sony Ericsson Hazel

Sony Ericsson Live with Walkman

Sony Ericsson Xperia X8

Sony Ericsson Xperia X10 Mini

Sony Ericsson Xperia X10 mini pro

Sony Ericsson Xperia acro

Sony Ericsson Xperia active

Sony Ericsson Xperia Arc

Sony Ericsson Xperia arc S

Sony Ericsson Xperia mini

Sony Ericsson Xperia Mini Pro

Sony Ericsson Xperia neo

Sony Ericsson Xperia Neo V

Sony Ericsson Xperia Play

Sony Ericsson Xperia ray

Sony Ericsson W995

Sony Xperia A SO-04E

Sony xperia acro s

Sony xperia e

Sony Xperia E dual C1605

Sony Xperia Go (ST27i)

Sony Xperia i1 HSPA + C6902

Sony Xperia i1 LTE

Sony xperia ion

Sony xperia j

Sony xperia l

Sony xperia m

Sony Xperia M35t (VoLTE)

Sony xperia miro

Sony Xperia P

Sony xperia s

Sony xperia sola

Sony Xperia SP

Sony xperia t

Sony xperia tipo

Sony Xperia Tipo Dual

Sony xperia u

Sony xperia v

Sony Xperia Z

Sony Xperia ZL

Sony Xperia ZR

Sony Xperia Z1

Sony Xperia Z1S (T Mobile US only)

Sony Xperia Z1 Compact LTE-A D5503

Sony Xperia Z2 LTE-A D6543 / D6503

TechFaith Wildfire 80

ZTE Blade V880

ZTE Crescent (San Francisco 2 on Orange)

ZTE Era

ZTE F160 Atlanta

ZTE Grand X

ZTE Grand X IN

ZTE Orbit

ZTE R252 / Orange Tara

ZTE Kis Pro / Orange Zali

ZTE Skate (Monte Carlo on Orange)

ZTE Smart Netphone 701

ZTE Tania

ZTE TMN smart A15




Recognition problems



Human speech is such that in the radio and on a narrow band of the audio spectrum, the sounds of “C” and “F” are poorly distinguished. Couples “T” and “P”, “M” and “H” also suffer. Hence the mnemonic system with the names: “I dictate, number two-zero-zero, three Olga, as I understand it, reception?”. From here, things like the assignment of the most critical parts of the call sign to “p” grow up - for example, a sniper who in no case should confuse his call sign will most likely be “thirty-third” because it is very well discernible on air in narrow conditions. stripes.



Voice recognition is also growing with HD-Voice. This is important both for personal communication and for additional voice authentication gaining popularity, for example, in banks.

Source: https://habr.com/ru/post/228231/



All Articles