AdministratorJanuary 16, 2024 at 4:24 pm
Thank you for sending this question over and apology for the late response. Let’s further analyze the scenario you shared:
A retail company is using Amazon OpenSearch Service to analyze its sales and inventory data. Every week, new data from an Amazon S3 Standard bucket is indexed and loaded into a 20-data node Amazon OpenSearch cluster. Read-only queries are performed on this data to monitor recent trends. After 1 week, it’s occasionally accessed for identifying long-term patterns. After three months, the index containing the older data is deleted from the system. However, due to audit requirements, the company needs to keep a complete copy of all processed data.
The company is looking for strategies to reduce storage costs without abandoning Amazon OpenSearch. A slower query response time on infrequently accessed data is acceptable as long as it can be retrieved on demand.
Which solution fits the requirements while being the MOST cost-effective?
It is important to note that the company allows “A slower query response time on infrequently accessed data is acceptable as long as it can be retrieved on demand” and the company is looking for a MOST cost-effective solution. Hence the correct answer is “Downsize the OpenSearch cluster by reducing the number of its data nodes. Add UltraWarm nodes to compensate for the read capacity. Create an Index State Management (ISM) policy that moves data to cold storage after 1 week. Use an S3 lifecycle policy to transition data older than 3 months to the S3 Glacier Deep archive.”
Transitioning data older than 3 months to the S3 Glacier Deep Archive meets the company’s requirements to keep a complete copy of all processed data for audit purposes. Amazon S3 Glacier Deep Archive is a secure, durable, and extremely low-cost Amazon S3 cloud storage class, designed for the long-term retention of data archival accessed once or twice a year. On the other hand, the S3 Infrequently Access tier is a more expensive storage tier than the S3 Glacier Deep Archive with a minimum storage duration is only 30 days, considering the company’s requirement to store data for at least three months and only occasionally access it.
For further reading, you can check the links below:
Hope this clarifies any confusion. Please don’t hesitate to drop any message/question if you need further assistance.
Nikee @ Tutorials Dojo