How to 'Talk' to Your Software: Alexa, Google, Watson, and ... · How to "Talk" to Your Software: Alexa, Google, Watson, and Cortana, a Side-by-Side Comparison of Cloud Speech Recognition

1Title of the Presentation Goes Here© 2018 Carnegie Mellon University

SATURN 201814th Annual SEI Architecture Technology User Network Conference

MAY 7–10, 2018 | PLANO, TEXAS

How to “Talk” to Your Software: Alexa, Google, Watson, and Cortana, a Side-by-Side Comparison of Cloud Speech Recognition APIs

Arila Atanassova-Barnes


Agenda

Use CasesTechnology MaturityTechnology OptionsArchitecture ConsiderationsDemo


SATURN 2018

Use Cases

Ticketing Systemhttps://cloud.google.com/solutions/architecture-of-a-serverless-ml-model

Question and Answers a.k.a Chatbotshttps://dev.botframework.com/

Voice First Experiencehttps://developer.amazon.com/alexa-voice-service

Context, Content and Insightshttps://www.ibm.com/watson/services/discovery/devresources/


Technology Maturity

Functionality Watson AWS Azure Google

Speech to Text Speech to Text Amazon Transcribe Speech Services Google Speech

Translation LanguageTranslator

Amazon Translate Speech Services Google Translation

Entities, Intent, Categories

Natural Language Understanding

Amazon Comprehend

LUIS Google NLP

Sentiment Tone Analyzer Amazon Comprehend

Text Analytics Google NLP

Text to Speech Text to Speech Amazon Poly Speech Services Speech Synthesis

Bots Watson Assistant Amazon Lex Bot Framework Dialogflow

Custom Models Knowledge Studio Sage Maker LUIS App AutoML


Technology Options - Speech Recognition APIs

Cloud Vendor IBM WatsonSpeech to Text

AWS Transcribe Google Speech API Azure Speech to Text

Price $0.02 /min$1.2 /hour

$0.0004 per second60 minutes per month free$1.44/hour

60 min free$0.006 USD / 15 seconds*

$1.44/hour

5 hours per month$0.50/ hour

Interesting Features • Speaker labels• High Noise Environment

• Recognize voices• Custom vocabulary

• 120 Languages support• Automatic Punctuation

• Speaker Verification

Tools • SDKs• REST• Node-red

• SDK• REST

• SDK• REST

• SDK• REST• Node-red


SATURN 2018

Architecture Considerations

• Complexity

• Streaming or Asynchronous• Voice Commands and Events

• Custom Models and out of the box

• Smart automation with bots

• Security and Compliance - How ready for HIPPA is Alexa?• Ease of use and documentation

• Accuracy and Training

• Cost to Serve


How to “Talk” to Your Software: Alexa, Google, Watson, and Cortana, a Side-by-Side Comparison of Cloud Speech Recognition APIs

Node-red Demo


SATURN 2018

Documents

How to 'Talk' to Your Software: Alexa, Google, Watson, and ... · How to "Talk" to Your Software: Alexa, Google, Watson, and Cortana, a Side-by-Side Comparison of Cloud Speech Recognition