MemberOctober 19, 2022 at 3:36 pm
<div>Confusion in the selected solution for below question.</div><div>
Q. A retail company stores inventory and historical transaction records in an Amazon S3 bucket integrated with AWS Glue Data Catalog. The customer sales report data is sent and stored every evening to an Amazon Redshift cluster. To complete the processing, the historical transactions must be joined with the sales report data. The Data Analyst is looking for a solution to significantly reduce the workload of the cluster as it is already overutilized. The solution must be serverless and requires minimal configuration effort.
Which of the following configurations will be able to meet the above requirements?
Ans: Using Amazon Redshift Spectrum, provision an external table for the
customer sales report data and join the tables using Redshift SQL
I think the answer should be rephrased like “Using Amazon Redshift Spectrum, provision an external table for the historical transaction data in S3 and join the tables using Redshift SQL
Customer sales data is already in Redshift, historical transaction data is in s3 so external table is required for s3 data. Please correct me if my understanding is wrong.
AdministratorOctober 20, 2022 at 2:55 am
Good catch, shreyask. Yes. The Redshift Spectrum table should be referencing the data in S3. Thanks for the feedback and suggestion as well. We’ll have the correct answer reworded.
Let me know if there’s something I can help you with.
Carlo @ Tutorials Dojo
Log in to reply.