Cloud Zone is brought to you in partnership with:

I am a Webscience PhD student at the university of Koblenz and the Founder of http://www.metalcon.de Social news streams are my research interest. René is a DZone MVB and is not an employee of DZone and has posted 36 posts at DZone. You can read more from them at their website. View Full User Profile

Apache Giraph: Distributed Graph Processing in the Cloud

05.21.2012
| 6893 views |
  • submit to reddit
Claudio Martella introduces Apache Giraph which according to him is a loose implementation of Google Pregel which was introduced  on SIGMOD in 2010. He points out that Map Reduce cannot be used to do graph processing.

He then gave an example on how MapReduce can be used to to do page rank calculation. He points out that Pagerank can be calculated as a local property of a graph in a distributed way by calculating local pagerank from the knowledge of the neighbours. He did this to show what the Drawbacks of this method are in his oppinion:

  • job boostrap take some time
  • disk is hit about 6  times
  • Data is sorted
  • Graph is passed through

Like in the Pregel Paper he says that other Graphalgorithms like singlesource shortest paths have the same problems. 

 

Claudio Martella from Apache explains how giraph works at in the graph dev room @ Fosdem 2012
Claudio Martella from Apache explains how giraph works at in the graph dev room @ Fosdem 2012

 

After introducing more about implementing Pregle ontop of the existing MapReduce structure for distributing he says that this system has some advantages over MapReduce

  • it’s a stateful computation
  • Disk is hit if/only for checkpoints
  • No sorting is necessary
  • Only messages hit the network

He points out that the advantages of Giraph over other methods (Hama, GoldenOrb, Signal/Collect) are especially an active community (Facebook, Yahoo, Linkedin, Twitter) behind this project. I personally think another advantage is that it is run by Apache who already run MapReduce (Hadoop) with great success. So it is something that people trust…

Claudio points out explicitly that they are searching for more contributors and I think this is really an interesting topic to work on! So thank Claudio for your inspiring work!

here the video streams from the graph dev room:

 

 

Published at DZone with permission of René Pickhardt, author and DZone MVB. (source)

(Note: Opinions expressed in this article and its replies are the opinions of their respective authors and not those of DZone, Inc.)

Comments

Herry Johnson replied on Tue, 2012/06/12 - 2:04pm

Wouldn't it be more like write the app once, but make sure it can run as both an application (executable jar, run specific main() method which initializes app as a JFrame) and an applet?

I'm asking out of general interest, I never bother with applets so I've never had to look into it before.

John Smith replied on Sun, 2013/02/17 - 4:16am

 I am definitely enjoying your website. You definitely have some great insight and great stories.
        Relationship Advice by Genesis INC
    

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.