Cloud Zone is brought to you in partnership with:

Enterprise Architect in HCL Technologies a $7Billion IT services organization. My role is to work as a Technology Partner for large enterprise customers providing them low cost opensource solutions around Java, Spring and vFabric stack. I am also working on various projects involving, Cloud base solution, Mobile application and Business Analytics around Spring and vFabric space. Over 23 yrs, I have build repository of technologies and tools I liked and used extensively in my day to day work. In this blog, I am putting all these best practices and tools so that it will help the people who visit my website. Krishna is a DZone MVB and is not an employee of DZone and has posted 64 posts at DZone. You can read more from them at their website. View Full User Profile

Meet Amazon EC2: Big Data on the Cloud

  • submit to reddit

I have been exploring Amazon EC2 as a Cloud based alternative to Midsized Software Product Company. I stumbled across this Slideshow Presentation from Netflix, a frontier in running their entire IT on Amazon’s EC2 platform. As per Netflix, they don’t have any data center, amazing isn’t it?

So I started exploring Amazon EC2 and how a company can run their entire IT in Amazon EC2. I was more interested in the technology stand point.

For a starter, in Amazon you can create various Linux instances including Ubuntu for free. They are elastic servers, where you can increase the RAM, Processing power on demand. Once you setup the instance, you can ssh on to the machine and do pretty much whatever you want. Refer this youtube link for how to setup Amazon.

As per Amazon, in a month, you get 750hrs free server usage, in simple words, that is plenty for testing your business idea. There is standard Amazon Machine Images (AMI) which has various pre-configured stacks including LAMP. Developing a decent Web application and exposing to the users is easy.

The interesting thing I noticed, it does have good Hadoop, MapReduce support. For more details of how to setup Hadoop in Amazon refer this youtube link. There are few commandline interface (cli) tools to manage EMR.

In Amazon the equivalent of HDFS is s3. Equivalent of Hadoop is Elastic MapReduce.


Published at DZone with permission of Krishna Prasad, author and DZone MVB. (source)

(Note: Opinions expressed in this article and its replies are the opinions of their respective authors and not those of DZone, Inc.)