Ends in
00
hrs
00
mins
00
secs
SHOP NOW

🎉 PlayCloud Sale Extension - Get 10% OFF and Save Big on All PlayCloud Subscription Plans!

Find answers, ask questions, and connect with our
community around the world.

Home Forums AWS AWS Certified Machine Learning – Specialty Incorrect answers?

Tagged: ,

  • Ziwei Gao

    Member
    August 18, 2025 at 9:00 pm

    Hello, I would like to confirm the answer of the following question. I tthink the answer is B, but given answer is C

    A Data Engineer is designing a solution for customer data analysis using Amazon Athena. An on-premises application produces the data as CSV files in near real-time. The Engineer needs to convert the data to Apache Parquet format before saving it on an Amazon S3 bucket.

    Which method provides the LEAST configuration overhead?

    A. Configure an Amazon EMR cluster with Apache Spark Structured Streaming to consume and transform the customer data into Apache Parquet.

    B. Use Amazon Kinesis Data Streams to ingest customer data and configure a Firehose stream as a consumer to convert the data into Apache Parquet.

    C. Use Amazon Kinesis Data Streams to consume customer data. Create a streaming ETL job in AWS Glue to convert data into Apache Parquet.

    D. Configure an Amazon EC2 instance with Apache Kafka to consume the customer data. Export the data to the S3 bucket in Parquet format with Kafka Connect S3 sink connector.

  • JR-TutorialsDojo

    Administrator
    August 19, 2025 at 11:28 am

    Hello Ziwei Gao,

    Thank you for your feedback.

    Could you please elaborate on why Option B is considered correct?

    From the explanation provided:

    “The option that says: Use Amazon Kinesis Data Streams to ingest customer data and configure a Firehose stream as a consumer to convert the data into Apache Parquet is incorrect. Although this could be a valid solution, it typically entails more development effort as Data Firehose does not support converting CSV files directly into Apache Parquet, unlike JSON.”

    This seems to suggest that Option B is not ideal due to format conversion limitations and added development overhead. I’d appreciate your insights to better understand the rationale.

    Best regards
    JR @ Tutorials Dojo

Viewing 1 - 2 of 2 replies

Log in to reply.

Original Post
0 of 0 posts June 2018
Now
Skip to content