Upload
others
View
10
Download
0
Embed Size (px)
Citation preview
1Title of the Presentation Goes Here© 2018 Carnegie Mellon University
SATURN 201814th Annual SEI Architecture Technology User Network Conference
MAY 7–10, 2018 | PLANO, TEXAS
How to “Talk” to Your Software: Alexa, Google, Watson, and Cortana, a Side-by-Side Comparison of Cloud Speech Recognition APIs
Arila Atanassova-Barnes
2Title of the Presentation Goes Here© 2018 Carnegie Mellon University
Agenda
Use CasesTechnology MaturityTechnology OptionsArchitecture ConsiderationsDemo
3Title of the Presentation Goes Here© 2018 Carnegie Mellon University
SATURN 2018
Use Cases
Ticketing Systemhttps://cloud.google.com/solutions/architecture-of-a-serverless-ml-model
Question and Answers a.k.a Chatbotshttps://dev.botframework.com/
Voice First Experiencehttps://developer.amazon.com/alexa-voice-service
Context, Content and Insightshttps://www.ibm.com/watson/services/discovery/devresources/
4Title of the Presentation Goes Here© 2018 Carnegie Mellon University
Technology Maturity
Functionality Watson AWS Azure Google
Speech to Text Speech to Text Amazon Transcribe Speech Services Google Speech
Translation LanguageTranslator
Amazon Translate Speech Services Google Translation
Entities, Intent, Categories
Natural Language Understanding
Amazon Comprehend
LUIS Google NLP
Sentiment Tone Analyzer Amazon Comprehend
Text Analytics Google NLP
Text to Speech Text to Speech Amazon Poly Speech Services Speech Synthesis
Bots Watson Assistant Amazon Lex Bot Framework Dialogflow
Custom Models Knowledge Studio Sage Maker LUIS App AutoML
5Title of the Presentation Goes Here© 2018 Carnegie Mellon University
Technology Options - Speech Recognition APIs
Cloud Vendor IBM WatsonSpeech to Text
AWS Transcribe Google Speech API Azure Speech to Text
Price $0.02 /min$1.2 /hour
$0.0004 per second60 minutes per month free$1.44/hour
60 min free$0.006 USD / 15 seconds*
$1.44/hour
5 hours per month$0.50/ hour
Interesting Features • Speaker labels• High Noise Environment
• Recognize voices• Custom vocabulary
• 120 Languages support• Automatic Punctuation
• Speaker Verification
Tools • SDKs• REST• Node-red
• SDK• REST
• SDK• REST
• SDK• REST• Node-red
6Title of the Presentation Goes Here© 2018 Carnegie Mellon University
SATURN 2018
Architecture Considerations
• Complexity
• Streaming or Asynchronous• Voice Commands and Events
• Custom Models and out of the box
• Smart automation with bots
• Security and Compliance - How ready for HIPPA is Alexa?• Ease of use and documentation
• Accuracy and Training
• Cost to Serve
7Title of the Presentation Goes Here© 2018 Carnegie Mellon University
How to “Talk” to Your Software: Alexa, Google, Watson, and Cortana, a Side-by-Side Comparison of Cloud Speech Recognition APIs
Node-red Demo
8Title of the Presentation Goes Here© 2018 Carnegie Mellon University
SATURN 2018