Hola, Habr. Como parte del curso "Machine Learning. Avanzado" hemos preparado una traducción de material interesante para ti.
También invitamos a todos a ver una lección abierta sobre el tema "Bandidos con múltiples brazos para optimizar las pruebas AB".
, Python.
, IBM Speech to Text API . API, , , , . API . .
, , « » . , , — .
— , , - . . - . , , . .
. . , .
:
1 -
2 -
3 -
4 -
-
. , . API- , . , , . , Microsoft Azure.
:
1 —
. SpeechRecognition
. SpeechRecognition
. API. ; Microsoft Azure Speech, Google Cloud Speech, API IBM Watson Speech to Text . IBM Watson Speech to Text API. SpeechRecognition .
. pip, Python.
pip install SpeechRecognition
. Jupyter Notebook.
import speech_recognition as s_r
2 -
. m4a, m4a. wav .
audio_file = s_r.AudioFile('my_clip.wav')
3 -
, , . . .
rcgnzr = s_r.Recognizer()
4 -
! IBM speech to text . , “adjust_for_ambient_noise
” “record
”, . , .
with audio_file as source:
rcgnzr.adjust_for_ambient_noise(source)
clean_audio = rcgnzr.record(source)
, . IBM. ( , , IBM Speech-to-Text API Python SpeechRecogniton
). API:
recognized_speech_ibm = r.recognize_ibm(clean_audio, username="apkikey", password= "your API Key")
: API IBM API-. IBM Watson. Speech-to-Text . IBM, , 500 , , .
-
. . . , . , .
, :
print(recognized_speech_ibm)
. . . , . , , .
. «ready!» .
with open('recognized_speech.txt',mode ='w') as file:
file.write("Recognized Speech:")
file.write("\n")
file.write(recognized)
print("ready!")
! , . , - . — . , . , .
. YouTube, . .