I am about to go live with my first production Hadoop job for a client as a proof of concept.
I found that a lot of the documentation out there is quite text dense, unnecessarily detailed, or out of date, which is frustrating when you’re just trying to get your first cluster up and your first MapReduce job submitted.
For that reason, I’ve decided to write up the guide I wish I had whilst first getting up and running with Hadoop – a simple step by step guide Hadoop up my preferred host, Amazon EC2.
Please click here for the PowerPoint presentation. I hope it’s helpful to someone.