📜 ⬆️ ⬇️

Call Center Analysis

To test the technology, I recorded several calls to different call centers. Then they will appear under the code names: water, mosenergo, rigla, transaero and worldclass.


The first step is to break the record into replicas. As it turned out, this process is called diarization . There are several ready-made tools: LIUM , ALIZE , SDT . I chose LIUM, because it looked more solid than the rest. I wrote a small wrapper and marked my tracks. The quality turned out to be normal, the picture shows the markup for water:

mosenergo, rigla, transaero and worldclass.
mosenergo

rigla

transaero

worldclass


To decrypt the replicas, I used Yandex SpeechKit . To evaluate the error of the work of their API, I first fed there the replicas obtained from the manual reference markup. Errors are significant, but you can live. The picture shows the decoding for water:





mosenergo, rigla, transaero and worldclass.
mosenergo

rigla

transaero

worldclass



Then I deciphered the replicas received automatically. He made sure that the errors did not change much, introduced himself as the owner of the call center and tried to get some benefit:
  1. I tried to evaluate the success of hits by keywords:
    show_query_results(query_transcripts(u'  ', transcripts, top=10)) 

     show_query_results(query_transcripts(u'    ', transcripts, top=5)) 

     show_query_results(query_transcripts(u' ', transcripts)) 

     show_query_results(query_transcripts(u' ', transcripts)) 

  2. I checked the skills of the operators for parsing services:
     show_query_results(query_transcripts(u'   ', transcripts)) 

     show_query_results(query_transcripts(u'     ', transcripts)) 

     show_query_results(query_transcripts(u'        ', transcripts)) 

     show_query_results(query_transcripts(u'  ', transcripts)) 

     show_query_results(query_transcripts(u'      ', transcripts)) 

     show_query_results(query_transcripts(u'     ', transcripts)) 

     show_query_results(query_transcripts(u'       ', transcripts)) 

     show_query_results(query_transcripts(u'    ', transcripts)) 

     show_query_results(query_transcripts(u'    ', transcripts)) 

  3. I tried to determine the purpose of the call:
     show_query_results(query_transcripts(u'            ', transcripts, top=5)) 

     show_query_results(query_transcripts(u' ', transcripts)) 



In my opinion, technology is viable and can be of some benefit.

')

Source: https://habr.com/ru/post/261993/


All Articles