Ends in

AWS Cloud Practitioner Sale - Get a Chance to Win FREE Access to our new DEA Practice Exams

Find answers, ask questions, and connect with our
community around the world.

Home Forums AWS AWS Certified Data Engineer Associate DEA-C01 Confusing question about Glue & Quicksight

  • Confusing question about Glue & Quicksight

  • or-3

    April 18, 2024 at 8:04 am

    The question: “An organization collects daily user interaction and app performance logs from its mobile app. These initially raw logs are transformed by AWS Glue for Spark Jobs and then stored in an Amazon S3 bucket for visualization with Amazon QuickSight.

    The company’s Data Engineer has noticed a gradual slowdown in the QuickSight Dashboard’s query performance over time. Upon investigation, the engineer determined that the long duration of the AWS Glue Jobs is the root cause.

    What combination of actions should the Data Engineer consider to improve query performance?”

    -The question says that the data is “stored in an Amazon S3 bucket for visualization with Amazon QuickSight.”, so presumably Quicksight is reading the data directly from S3. If so, how could it also be that “the long duration of the AWS Glue Jobs is the root cause.”? So the premise here is confusing right from the start.

    -Furthermore, the correct answers to the questions are said to be:

    <ul data-question_id=”14075″ data-type=”multiple”>

  • Modify the AWS Glue Job worker type configuration to a larger instance type.
  • Arrange the data within the S3 bucket into partitions by year, month, and day.
  • One of these affects the Glue Job, the other the way the data is stored. How is a Quicksight Report performance affected by a Glue job worker type? It could be that the data would be more readily available (refreshed faster) if the Worker Type is larger but as far as I know, Quicksight doesn’t “kick off” a Glue Job when a Report is run.

    Perhaps I’m missing something?

  • Neil-TutorialsDojo

    April 23, 2024 at 1:39 pm

    Hi or-3,

    Thank you for posting here.
    Amazon QuickSight uses the cleaned and structured data prepared by AWS Glue for visualizations and dashboards. By upgrading the Glue instance size and partitioning the data, you not
    only improve the efficiency of the data processing jobs but also enhance
    the responsiveness and performance of the analytics and dashboards in
    I hope this helps.

    Neil @ Tutorials Dojo

  • Viewing 1 - 2 of 2 replies

    Log in to reply.

    Original Post
    0 of 0 posts June 2018