parallel-high-betweenness-centrality

所属分类:图形图像处理
开发工具:C++
文件大小:425KB
下载次数:0
上传日期:2014-06-30 19:54:06
上 传 者sh-1993
说明:  基于MPI的大型图高中心点检测算法
(MPI based algorithm for detecting high centrality vertices in large graphs)

文件列表:
GeneralInstallationGuide.md (5254, 2014-07-01)
QuickStart.md (2761, 2014-07-01)
msl-bc-mpi (0, 2014-07-01)
msl-bc-mpi\4elt.graph (516441, 2014-07-01)
msl-bc-mpi\4elt.graph.part.2 (31212, 2014-07-01)
msl-bc-mpi\4elt.txt (970436, 2014-07-01)
msl-bc-mpi\ebetwn.cpp (4850, 2014-07-01)
msl-bc-mpi\makefile (483, 2014-07-01)
msl-bc-mpi\parallel_safe_leaf_compression.hpp (2584, 2014-07-01)
msl-bc-mpi\parallel_safe_partitioned_sssp.hpp (83917, 2014-07-01)
msl-bc-mpi\run-msl-bc.sh (494, 2014-07-01)

Overview -------- Each community is comprised of several key players which have an impact on its overall behavior and play critical roles. Detecting what these players are is critical in various domains such as identifying key individuals or places when tracking criminal and terrorist activities. Our algorithm is able to quickly uncover the top n influential nodes located at major communication cross points inside a given community. It is designed to work on distributed environments making it suited for processing geographically spread data found in cloud systems. Objectives ---------- The objective of the algorithm is to enable fast identification of top n nodes in a distributed sparse graph. Benefits -------- It is beneficial to data scientists seeking fast identification of top central players in large geographically distributed graph data. Measures of effectiveness ------------------------- The algorithm has been tested using MPI on an HPC cluster with a varying number of nodes ranging from 4 to *** with 2 workers per node, each worker with 8 GB memory. Each node consists of two Quad-core AMD Opteron 2376 2.3 GHz processors. Speed-ups of up to 12x have been observed for synthetic and real-life sparse graphs up to 1M vertices and 3M edges. Required Skill Sets ------------------- * To use the algorithm on a given data set: * Required * Familiarity with Linux * Manipulating graph data (potentially to convert the given data to the Metis graph format) * Ability to run MPI programs on a single node or cluster depending on graph size * Good to have * To use the quick start guide, familiarity with Virtual Box and virtualization * C++ * MPI Library * To setup the environment on a cluster * Cluster administration knowledge * Setting up MPI environment on cluster How to get it --------------- The algorithm is hosted at the git hub repository at https://github.com/usc-cloud/parallel-high-betweenness-centrality Clone the repository using git clone https://github.com/usc-cloud/parallel-high-betweenness-centrality.git Note: You may need to install a git client to download the repository. Installation ------------ A quick start guide can be found [here](https://github.com/usc-cloud/parallel-high-betweenness-centrality/blob/master/QuickStart.md) together with a precompiled VM to help you get started. A detailed guide on how to install the software on a distributed setup can be found [here](https://github.com/usc-cloud/parallel-high-betweenness-centrality/blob/master/GeneralInstallationGuide.md). Future enhancements ------------------- Given the increasing popularity of cloud oriented graph analytics frameworks a GoFFish version of the proposed algorithm will be delivered.

近期下载者

相关文件


收藏者