Web Page Ranking using Hadoop


With a dramatic growth of the world-wide web exceeding 800 million pages, quality of the search results are given importance more than the content of the page. The quality of the page is determined by using web page ranking where the importance of the page depends on the importance of its parent page. For very large sub-graphs of the web, page rank can be computed with limited memory using Hadoop.

Pages in XML format are given as input for Page Ranking program. The forward and backward links are used to compute the rank of a page.

56 Replies to “Web Page Ranking using Hadoop”

Leave a Reply

Your email address will not be published. Required fields are marked *