Home › Forums › AWS › AWS Certified Data Engineer Associate DEA-C01 › A data engineer needs to perform a one-time, ad-hoc query › Reply To: A data engineer needs to perform a one-time, ad-hoc query
-
Hi drsparrow,
Thank you for pointing this out.
The scenario in which this item is written leaves out an important detail. Athena itself doesn’t automatically know about the data stored in S3. To query it, the dataset must first be registered in the AWS Glue Data Catalog. You can accomplish this by running a Glue crawler, manually creating a table, or using other Glue-supported methods. Once the table is defined in the catalog, Athena can efficiently query only the specific columns you need without managing any infrastructure.
We’ll update this item to reflect the accurate workflow better. Again, thanks for highlighting this.
Regards,
Nikee @ Tutorials Dojo