On the note of checking chunk content, I think it’s also worth mentioning that AWS doesn’t charge for data transfer between an S3 bucket and other AWS services in the same region.
So, if you really want (or need) to check the content of chunks stored in S3 for some reason, it might be cheaper and/or faster to perform the check using an EC2 instance. (This is especially true if you already have an EC2 instance that you’re willing to use.)
To be clear, there are still some costs associated with checking chunk content with S3:
- You’ll still be charged for S3 GET and LIST requests.
- If you’re using S3 Intelligent Tiering, reading older chunks will still transition them back to the more expensive frequent access tier.
- If you’re using other storage classes that have retrieval fees, you’ll still be billed those retrieval fees.
With that said though, if you already have your mind set on checking chunk content, these costs are relevant regardless of whether you use an EC2 instance or not.
This means the main thing to consider is whether the cost of the EC2 instance would be cheaper than the S3 data transfer costs.
I haven’t personally tried this, so it’s hard for me to say for sure whether it’s actually cheaper or not. I figured I’d at least point it out for those interested in checking chunk content with S3 storage though.