Not lex but Comprehend?

Not lex but Comprehend?

Neil-TutorialsDojo updated 3 months, 2 weeks ago 5 Members · 6 Posts
AWS Certified Solutions Architect Professional
aparna

Member
January 10, 2023 at 7:48 pm

for the below question, i think it is comprehend to identify sentiment but the correct answer says it’s lex…can someone help why

9. QUESTION

Category: CSAP – Continuous Improvement for Existing Solutions

A retail company runs its customer support call system on its in-house data center. The Solutions Architect was tasked to migrate the call system to AWS and leverage the use of managed services to reduce management overhead. The solution must be able to handle the current tasks such as receiving calls and creating contact flows. It must be able to scale to handle more calls as the customer base grows. The company also wants to add deep learning capabilities to the call system to reduce the need to speak to an agent. It must be able to recognize the intent of the caller based on certain keywords and handle basic tasks, as well as provide information to the call center agents.

Which combination of actions should the Solutions Architect implement to meet the company’s requirements? (Select TWO.)

<ul data-question_id=”11063″ data-type=”multiple”>

Correct

Amazon Connect is an easy-to-use omnichannel cloud contact center that helps companies provide superior customer service across voice, chat, and tasks at a lower cost than traditional contact center systems. You can set up a contact center in a few steps, add agents who are located anywhere, and start engaging with your customers.

You can create personalized experiences for your customers using omnichannel communications. For example, you can dynamically offer chat and voice contact based on such factors as customer preference and estimated wait times. Agents, meanwhile, conveniently handle all customers from just one interface. For example, they can chat with customers and create or respond to tasks as they are routed to them.

The following diagram shows these key characteristics of Amazon Connect.

To help provide a better contact center, you can use Amazon Connect to integrate with several AWS services to provide Machine Learning (ML) and Artificial Intelligence (AI) capabilities.

Amazon Connect uses the following services for ML/AI:

Amazon Lex—Let you create a chatbot to use as an Interactive Voice Response (IVR).

Amazon Polly—Provides text-to-speech in all contact flows.

Amazon Transcribe—Grabs conversation recordings from Amazon S3 and transcribes them to text so you can review them.

Amazon Comprehend—Takes the transcription of recordings and applies speech analytics machine learning to the call to identify sentiment, keywords, adherence to company policies, and more.

Amazon Lex is a service for building conversational interfaces into any application using voice and text. Amazon Lex provides the advanced deep learning functionalities of automatic speech recognition (ASR) for converting speech to text and natural language understanding (NLU) to recognize the intent of the text to enable you to build applications with highly engaging user experiences and lifelike conversational interactions.

By using an Amazon Lex chatbot in your call center, callers can perform tasks such as changing a password, requesting a balance on an account, or scheduling an appointment without needing to speak to an agent. These chatbots use automatic speech recognition and natural language understanding to ascertain a caller’s intent, maintain context and fluidly manage the conversation. Amazon Lex uses AWS Lambda functions to query your business applications, provide information back to callers, and make updates as requested.

The option that says: Use the Amazon Connect service to create an omnichannel cloud-based contact center for the agents is correct. Amazon Connect is the AWS service you will want to use to build a cloud-based call center system.

The option that says: Buffer the incoming customer calls to an Amazon SQS queue and process their voice on Amazon Lex to recognize their intent is correct. Amazon Lex has deep learning functionalities of automatic speech recognition as well as natural language understanding (NLU) to recognize the intent of the text.

The option that says: Leverage Amazon Rekognition to identify the caller and process the voice through Amazon Polly to determine the intent based on the customer’s voice is incorrect. Amazon Rekognition is used to identify persons on photos or videos, not on voice calls. Amazon Polly is used for text-to-speech services. You should use Amazon Lex for this scenario.

The option that says: Build a conversational interface on Amazon Alexa for Business to have AI-based answers to customer queries thereby reducing the need to speak to an agent is incorrect. Alexa for Business gives you the tools you need to manage Alexa devices, enroll your users, and assign skills at scale for your organization. For conversational interfaces such as chatbots, you can use Amazon Lex.

The option that says: Send incoming customer calls to an Amazon Kinesis stream and process their voice through Amazon Comprehend to determine the customer’s intent is incorrect. Amazon Polly provides text-to-speech services. It can’t recognize or interpret the intent of the customer based on their voice. The scenario requires understanding the intent of the user based on their speech, so Amazon Lex is better suited for this.

References:

https://aws.amazon.com/connect/

https://docs.aws.amazon.com/connect/latest/adminguide/related-services-amazon-connect.html

https://aws.amazon.com/lex/

https://docs.aws.amazon.com/lexv2/latest/dg/what-is.html

Check out the Amazon Lex Cheat Sheet:

https://tutorialsdojo.com/amazon-lex/

Kenneth-Samonte-Tutorials-Dojo

Member

January 13, 2023 at 4:23 pm

Hi aparna,

Thank you for the feedback.

Amazon Lex—Let you create a chatbot to use as an Interactive Voice Response (IVR).

Amazon Polly—Provides text-to-speech in all contact flows.

Amazon Transcribe—Grabs conversation recordings from Amazon S3 and transcribes them to text so you can review them.

Amazon Comprehend—Takes the transcription of recordings and applies speech analytics machine learning to the call to identify sentiment, keywords, adherence to company policies, and more.

The correct answer is Lex, because Comprehend only works with Text based input. The question says they are receiving calls – thus voice based input.

Hope this helps.

Let us know if you need further assistance. The Tutorials Dojo team is dedicated to helping you pass your AWS exam!

Regards,

Kenneth Samonte @ Tutorials Dojo

kaws8902

Member

December 12, 2023 at 7:45 am

Hello Tutorial Dojo, “Buffer the incoming customer calls to an Amazon SQS queue” is incorrect. Streamer audio uses SIP and RTP and those are not supported over SQS. There is no reference architecture showing audio packets being processed through SQS.

ccatchings

Member

December 12, 2023 at 4:03 pm

While you can technically store the output of an audio in an SQS message in binary format, it would most likely be limited to 15 seconds per message since SQS messages have a max size of 256kB. I’d like to see a reference architecture for this as well. How many SQS queues do you use? One per Lex bot? How do the audio packets get sent to Lex? Lambda? I’d like to see sample code for that too.

AWS Documentation actually recommends using Kinesis Video Streams to manage live audio streaming in Amazon Connect. Maybe then you could pass the streaming audio data to the Lex bot?

https://docs.aws.amazon.com/connect/latest/adminguide/access-media-stream-data.html

I think this question needs to be re-visited as the explanation contains multiple architectural gaps and creates more confusion than clarifying a solution.

kaws8902

Member

December 13, 2023 at 12:28 am

Thanks @ccatchings, your explanation of 256 kB per binary packet makes sense. But it would be difficult for consumers to to reassemble the packets in the correct sequence, especially since typically SQS may have multiple consumers in parallel. At least SQS FIFO queue should be mentioned in the question. I agree that sample code/reference architecture is needed.

Neil-TutorialsDojo

Member

January 16, 2024 at 5:52 pm

Hi @Aparna-Bl, @ccatchings , @kaws8902

Thank you for raising your question, aparna. Following up Kenneth’s response Amazon Lex is the preferred choice for recognizing caller intent based on voice input. Amazon Comprehend, on the other hand, is more suited for text-based input. In the context of receiving calls and creating contact flows, the voice-based capabilities of Amazon Lex make it the correct solution for this scenario.

To kaws8902 and ccatchings, your concerns about SQS limitations for streaming audio data are valid. Thanks for pointing this out. In a call center setup where someone is talking directly to a Lex bot and getting immediate responses, you don’t really need SQS. Amazon Lex has its own API that you can use to facilitate the conversation between the caller and the bot. Though here’s how I imagine SQS can be used with Amazon Lex. Instead of putting the audio itself in the SQS message, we can store the audio somewhere like in Amazon S3, and just reference its URL in the message. Then, a Lambda function can pick up this message from SQS, process the audio file from S3, and subsequently invoke the Lex API for further actions.

With all that said, we apologize for any confusion this question might have caused. We really appreciate your input and will make sure to refine and improve this question based on your feedback.

Viewing 1 - 6 of 6 replies