I just wanted to capture some of the observations and documentation that I think are useful for continued exploration of AWS (specifically S3, EMR, and Redshift). Here are some notes broken up by service that I think may aid others in their research…
Simple Storage Service (S3)
S3 is pretty simple and very easy to use. I don’t think it requires very much in the way of explanation, so here is a link or two to the Amazon documentation.
And because I’m partial to Python:
Elastic Map Reduce (EMR)
General EMR docs:
Big Data specific things: