The so called “Web Speech API” is a set of APIs that allow you to integrate support for speech in your websites or applications. It is made of by two main parts:

  1. SpeechSynthesis, or Text to Speech (TTS). This API allows you to generate spoken output from text content within a browser, leveraging the voices available on the user’s system.
  2. SpeechRecognition, or Speech to Text. This API allows you to convert speech input generated by the user into the corresponding text equivalent within a browser, leveraging the APIs provided by the operating system.

You can decide wether you want to use TTS, speech to text or a combination of both depending on your specific needs. The API is easily accessible on “modern” browser via JavaScript.

