Web Page Ranking using Hadoop

Abstract

With a dramatic growth of the world-wide web exceeding 800 million pages, quality of the search results are given importance more than the content of the page. The quality of the page is determined by using web page ranking where the importance of the page depends on the importance of its parent page. For very large sub-graphs of the web, page rank can be computed with limited memory using Hadoop.

Pages in XML format are given as input for Page Ranking program. The forward and backward links are used to compute the rank of a page.

Related Projects

49 Replies to “Web Page Ranking using Hadoop”

  1. Hi,

    I’m working with Flume on fetching Live Streaming Data i.e . Tweets from Twitter app. Please send me this project so that I can Link this facility for more interactive way of representing data!

    Thanks

Leave a Reply

Your email address will not be published. Required fields are marked *