Automatic Speech Recognition - behind the Scenes

Alexa, Google, Cortana, Watson.... these are household names today. Speech has become an important mode of digital usage. Both as a developer and as a user you would have imagined as to how this works. Automatic Speech Recognition is the buzz word. This presentation gives a sneak peek into how Audio gets converted into text and also explains the technologies and algorithm that go behind it.

Outline/Structure of the Talk

Automatic Speech recognition (ASR)

  • Use-cases of ASR
  • Difficulty faced in building ASR systems
  • History of ASR
  • Introduction to Deep Learning
  • ASR process and models
  • Different types of ASR systems
  • Customising ASRs
  • Demo of ASR

Learning Outcome

This presentation aims at introducing the audience to the basics of Automatic Speech Recognition. It will also get into some deep learning basics to get the point across. So overall the audience with get a good understanding of the steps involved in converting Speech to Text

Target Audience

The session is generic in nature and caters to any kind of developer who is interested in knowing how Automatic Speech Recognition (ASR) works.

Prerequisites for Attendees

There are no pre-requisites to this session. It is a general session for anyone with a technical background.

Submitted 1 year ago

