pagerank-master(java4)
所属分类:Java编程
开发工具:Java
文件大小:7KB
下载次数:0
上传日期:2018-05-28 19:26:43
上 传 者:
1710218787@qq.com
说明: PageRank是Google算法的重要内容。2001年9月被授予美国专利,专利人是Google创始人之一拉里·佩奇(Larry Page)
(PageRank is an important part of the Google algorithm. In September 2001, it was granted the US patent. The patentee is one of the founders of Google, Larry Page (Larry).)
文件列表:
src (0, 2013-12-07)
src\rank (0, 2013-12-07)
src\rank\Graph.java (3536, 2013-12-07)
src\rank\InitRanks.java (2617, 2013-12-07)
src\rank\PageRank.java (4745, 2013-12-07)
src\rank\Top10.java (4391, 2013-12-07)
src\rank\Trigger.java (1586, 2013-12-07)
pagerank
========
This is implementation of PageRank algorithm using Java
Trigger.java
The Main() function of the program is located in this file. The execution starts from here. It collects the command line arguments and calls the subsequent jobs from the main function. Initially Graph properties job is called first, InitRanks second, PageRank third and Top10 ranks fourth.
Graph.java
This class creates a job, Mapper & Reducer for calculating graph properties like Minimum out-degree, Maximum out- degree, Average out-degree, Number of nodes and Number of edges. It creates the job in GraphPropertiesJob() function.
InitRanks.java
This class contains a job, Mapper & Reducer for initializing the page ranks to all nodes. Call to mapper is made from the job function, InitRankJob().
PageRank.java
This class contains a job, Mapper & Reducer for calculating the Page Ranks for all the nodes. It also considers the Damping factor while calculating the rank for next iteration.
Top10.java
This class contains a job, Mapper & Reducer implementations for getting the Top 10 nodes in the result from Page Rank job. The ranks are ordered in Descending order.
steps to run
============
The pagerank.jar file takes 3 command line arguments They are:
1. input directory name
2. output directory starting name (this name will be used to create the directories for all jobs, eg,. If argument is “output”
then output for graph properties will be created as “outputGraph”, for Initialize ranks it will be created as “output1” and the remaining “output” will be created for calculating page ranks till they converge and for Top 10 will be created as “outputTop10”.
3. Damping Factor
近期下载者:
相关文件:
收藏者: