pagerank-master(java4)

所属分类:Java编程
开发工具:Java
文件大小:7KB
下载次数:0
上传日期:2018-05-28 19:26:43
上 传 者1710218787@qq.com
说明:  PageRank是Google算法的重要内容。2001年9月被授予美国专利,专利人是Google创始人之一拉里·佩奇(Larry Page)
(PageRank is an important part of the Google algorithm. In September 2001, it was granted the US patent. The patentee is one of the founders of Google, Larry Page (Larry).)

文件列表:
src (0, 2013-12-07)
src\rank (0, 2013-12-07)
src\rank\Graph.java (3536, 2013-12-07)
src\rank\InitRanks.java (2617, 2013-12-07)
src\rank\PageRank.java (4745, 2013-12-07)
src\rank\Top10.java (4391, 2013-12-07)
src\rank\Trigger.java (1586, 2013-12-07)

pagerank ======== This is implementation of PageRank algorithm using Java Trigger.java The Main() function of the program is located in this file. The execution starts from here. It collects the command line arguments and calls the subsequent jobs from the main function. Initially Graph properties job is called first, InitRanks second, PageRank third and Top10 ranks fourth. Graph.java This class creates a job, Mapper & Reducer for calculating graph properties like Minimum out-degree, Maximum out- degree, Average out-degree, Number of nodes and Number of edges. It creates the job in GraphPropertiesJob() function. InitRanks.java This class contains a job, Mapper & Reducer for initializing the page ranks to all nodes. Call to mapper is made from the job function, InitRankJob(). PageRank.java This class contains a job, Mapper & Reducer for calculating the Page Ranks for all the nodes. It also considers the Damping factor while calculating the rank for next iteration. Top10.java This class contains a job, Mapper & Reducer implementations for getting the Top 10 nodes in the result from Page Rank job. The ranks are ordered in Descending order. steps to run ============ The pagerank.jar file takes 3 command line arguments They are: 1. input directory name 2. output directory starting name (this name will be used to create the directories for all jobs, eg,. If argument is “output” then output for graph properties will be created as “outputGraph”, for Initialize ranks it will be created as “output1” and the remaining “output” will be created for calculating page ranks till they converge and for Top 10 will be created as “outputTop10”. 3. Damping Factor

近期下载者

相关文件


收藏者