8
1 Title of the Presentation Goes Here © 2018 Carnegie Mellon University SATURN 2018 14 th Annual SEI Architecture Technology User Network Conference MAY 7–10, 2018 | PLANO, TEXAS How to “Talk” to Your Software: Alexa, Google, Watson, and Cortana, a Side-by-Side Comparison of Cloud Speech Recognition APIs Arila Atanassova-Barnes

How to 'Talk' to Your Software: Alexa, Google, Watson, and ... · How to "Talk" to Your Software: Alexa, Google, Watson, and Cortana, a Side-by-Side Comparison of Cloud Speech Recognition

  • Upload
    others

  • View
    10

  • Download
    0

Embed Size (px)

Citation preview

Page 1: How to 'Talk' to Your Software: Alexa, Google, Watson, and ... · How to "Talk" to Your Software: Alexa, Google, Watson, and Cortana, a Side-by-Side Comparison of Cloud Speech Recognition

1Title of the Presentation Goes Here© 2018 Carnegie Mellon University

SATURN 201814th Annual SEI Architecture Technology User Network Conference

MAY 7–10, 2018 | PLANO, TEXAS

How to “Talk” to Your Software: Alexa, Google, Watson, and Cortana, a Side-by-Side Comparison of Cloud Speech Recognition APIs

Arila Atanassova-Barnes

Page 2: How to 'Talk' to Your Software: Alexa, Google, Watson, and ... · How to "Talk" to Your Software: Alexa, Google, Watson, and Cortana, a Side-by-Side Comparison of Cloud Speech Recognition

2Title of the Presentation Goes Here© 2018 Carnegie Mellon University

Agenda

Use CasesTechnology MaturityTechnology OptionsArchitecture ConsiderationsDemo

Page 3: How to 'Talk' to Your Software: Alexa, Google, Watson, and ... · How to "Talk" to Your Software: Alexa, Google, Watson, and Cortana, a Side-by-Side Comparison of Cloud Speech Recognition

3Title of the Presentation Goes Here© 2018 Carnegie Mellon University

SATURN 2018

Use Cases

Ticketing Systemhttps://cloud.google.com/solutions/architecture-of-a-serverless-ml-model

Question and Answers a.k.a Chatbotshttps://dev.botframework.com/

Voice First Experiencehttps://developer.amazon.com/alexa-voice-service

Context, Content and Insightshttps://www.ibm.com/watson/services/discovery/devresources/

Page 4: How to 'Talk' to Your Software: Alexa, Google, Watson, and ... · How to "Talk" to Your Software: Alexa, Google, Watson, and Cortana, a Side-by-Side Comparison of Cloud Speech Recognition

4Title of the Presentation Goes Here© 2018 Carnegie Mellon University

Technology Maturity

Functionality Watson AWS Azure Google

Speech to Text Speech to Text Amazon Transcribe Speech Services Google Speech

Translation LanguageTranslator

Amazon Translate Speech Services Google Translation

Entities, Intent, Categories

Natural Language Understanding

Amazon Comprehend

LUIS Google NLP

Sentiment Tone Analyzer Amazon Comprehend

Text Analytics Google NLP

Text to Speech Text to Speech Amazon Poly Speech Services Speech Synthesis

Bots Watson Assistant Amazon Lex Bot Framework Dialogflow

Custom Models Knowledge Studio Sage Maker LUIS App AutoML

Page 5: How to 'Talk' to Your Software: Alexa, Google, Watson, and ... · How to "Talk" to Your Software: Alexa, Google, Watson, and Cortana, a Side-by-Side Comparison of Cloud Speech Recognition

5Title of the Presentation Goes Here© 2018 Carnegie Mellon University

Technology Options - Speech Recognition APIs

Cloud Vendor IBM WatsonSpeech to Text

AWS Transcribe Google Speech API Azure Speech to Text

Price $0.02 /min$1.2 /hour

$0.0004 per second60 minutes per month free$1.44/hour

60 min free$0.006 USD / 15 seconds*

$1.44/hour

5 hours per month$0.50/ hour

Interesting Features • Speaker labels• High Noise Environment

• Recognize voices• Custom vocabulary

• 120 Languages support• Automatic Punctuation

• Speaker Verification

Tools • SDKs• REST• Node-red

• SDK• REST

• SDK• REST

• SDK• REST• Node-red

Page 6: How to 'Talk' to Your Software: Alexa, Google, Watson, and ... · How to "Talk" to Your Software: Alexa, Google, Watson, and Cortana, a Side-by-Side Comparison of Cloud Speech Recognition

6Title of the Presentation Goes Here© 2018 Carnegie Mellon University

SATURN 2018

Architecture Considerations

• Complexity

• Streaming or Asynchronous• Voice Commands and Events

• Custom Models and out of the box

• Smart automation with bots

• Security and Compliance - How ready for HIPPA is Alexa?• Ease of use and documentation

• Accuracy and Training

• Cost to Serve

Page 7: How to 'Talk' to Your Software: Alexa, Google, Watson, and ... · How to "Talk" to Your Software: Alexa, Google, Watson, and Cortana, a Side-by-Side Comparison of Cloud Speech Recognition

7Title of the Presentation Goes Here© 2018 Carnegie Mellon University

How to “Talk” to Your Software: Alexa, Google, Watson, and Cortana, a Side-by-Side Comparison of Cloud Speech Recognition APIs

Node-red Demo

Page 8: How to 'Talk' to Your Software: Alexa, Google, Watson, and ... · How to "Talk" to Your Software: Alexa, Google, Watson, and Cortana, a Side-by-Side Comparison of Cloud Speech Recognition

8Title of the Presentation Goes Here© 2018 Carnegie Mellon University

SATURN 2018