<html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta charset="utf-8">
<meta name="generator" content="pdf2htmlEX">
<meta http-equiv="X-UA-Compatible" content="IE=edge,chrome=1">
<link rel="stylesheet" href="https://static.pudn.com/base/css/base.min.css">
<link rel="stylesheet" href="https://static.pudn.com/base/css/fancy.min.css">
<link rel="stylesheet" href="https://static.pudn.com/prod/directory_preview_static/62b726d7405aad31f7009eed/raw.css">
<script src="https://static.pudn.com/base/js/compatibility.min.js"></script>
<script src="https://static.pudn.com/base/js/pdf2htmlEX.min.js"></script>
<script>
try{
pdf2htmlEX.defaultViewer = new pdf2htmlEX.Viewer({});
}catch(e){}
</script>
<title></title>
</head>
<body>
<div id="sidebar" style="display: none">
<div id="outline">
</div>
</div>
<div id="pf1" class="pf w0 h0" data-page-no="1"><div class="pc pc1 w0 h0"><img class="bi x0 y0 w1 h1" alt="" src="https://static.pudn.com/prod/directory_preview_static/62b726d7405aad31f7009eed/bg1.jpg"><div class="c x0 y1 w2 h2"><div class="t m0 x1 h3 y2 ff1 fs0 fc0 sc0 ls0 ws0">-----<span class="_ _0"></span>-----<span class="_ _0"></span>-----<span class="_ _1"> </span><span class="ff2">分布式开发框架</span></div></div></div><div class="pi" data-data='{"ctm":[1.333333,0.000000,0.000000,1.333333,0.000000,0.000000]}'></div></div>
</body>
</html>
<div id="pf2" class="pf w0 h0" data-page-no="2"><div class="pc pc2 w0 h0"><img class="bi x0 y0 w1 h1" alt="" src="https://static.pudn.com/prod/directory_preview_static/62b726d7405aad31f7009eed/bg2.jpg"></div><div class="pi" data-data='{"ctm":[1.333333,0.000000,0.000000,1.333333,0.000000,0.000000]}'></div></div>
<div id="pf3" class="pf w0 h0" data-page-no="3"><div class="pc pc3 w0 h0"><img class="bi x0 y0 w1 h1" alt="" src="https://static.pudn.com/prod/directory_preview_static/62b726d7405aad31f7009eed/bg3.jpg"><div class="c x0 y1 w2 h2"><div class="t m0 x2 h4 y3 ff3 fs1 fc1 sc0 ls0 ws0"></div><div class="t m0 x3 h5 y4 ff2 fs2 fc2 sc0 ls0 ws0">纽约证券交易所每天<span class="_ _0"></span>产生<span class="_ _2"> </span><span class="ff1">1TB<span class="_ _1"> </span></span>的交易<span class="_ _0"></span>数据</div><div class="t m0 x2 h4 y5 ff3 fs1 fc1 sc0 ls0 ws0"></div><div class="t m0 x3 h5 y6 ff2 fs2 fc2 sc0 ls0 ws0">社交网站<span class="_ _2"> </span><span class="ff1">facebook<span class="_ _1"> </span></span>的<span class="_ _0"></span>主机存储着约<span class="_ _2"> </span><span class="ff1">10<span class="_ _1"> </span></span>亿张照片<span class="_ _0"></span>,</div><div class="t m0 x4 h5 y7 ff2 fs2 fc2 sc0 ls0 ws0">占据<span class="_ _1"> </span><span class="ff1">PB<span class="_ _2"> </span></span>级存储空间</div><div class="t m0 x2 h4 y8 ff3 fs1 fc1 sc0 ls0 ws0"></div><div class="t m0 x3 h5 y9 ff2 fs2 fc2 sc0 ls0 ws0">互联网档案馆存储着<span class="_ _0"></span>约<span class="_ _1"> </span><span class="ff1">2PB<span class="_ _2"> </span></span>数据,并以每月<span class="_ _0"></span>至少</div><div class="t m0 x4 h5 ya ff1 fs2 fc2 sc0 ls0 ws0">20TB<span class="_ _1"> </span><span class="ff2">的<span class="_ _0"></span>速度增长。</span></div><div class="t m0 x2 h4 yb ff3 fs1 fc1 sc0 ls0 ws0"></div><div class="t m0 x3 h5 yc ff2 fs2 fc2 sc0 ls0 ws0">瑞士日内瓦附近的大<span class="_ _0"></span>型强子对撞机每年<span class="_ _0"></span>产生约<span class="_ _2"> </span><span class="ff1">15PB</span></div><div class="t m0 x4 h5 yd ff2 fs2 fc2 sc0 ls0 ws0">的数据。</div><div class="t m0 x2 h6 ye ff1 fs2 fc2 sc0 ls0 ws0"> <span class="fc3 sc0"> </span><span class="fc3 sc0"> </span><span class="fc3 sc0"> </span><span class="fc3 sc0"> </span><span class="fc3 sc0"> </span><span class="fc3 sc0"> </span><span class="fc3 sc0"> </span><span class="fc3 sc0"> </span><span class="fc3 sc0"> </span><span class="fc3 sc0"> </span> </div><div class="t m0 x2 h4 yf ff3 fs1 fc1 sc0 ls0 ws0"></div><div class="t m0 x3 h5 y10 ff4 fs2 fc2 sc0 ls0 ws0"><span class="fc3 sc0"> </span><span class="fc3 sc0"> </span><span class="fc3 sc0"> </span><span class="fc3 sc0"> </span><span class="fc3 sc0"> </span><span class="fc3 sc0"> </span><span class="fc3 sc0"> </span><span class="fc3 sc0"> </span> <span class="_ _3"> </span><span class="ff2">这样的数据该怎么存储和<span class="_ _0"></span>读取?</span></div></div></div><div class="pi" data-data='{"ctm":[1.333333,0.000000,0.000000,1.333333,0.000000,0.000000]}'></div></div>
<div id="pf4" class="pf w0 h0" data-page-no="4"><div class="pc pc4 w0 h0"><img class="bi x0 y0 w1 h1" alt="" src="https://static.pudn.com/prod/directory_preview_static/62b726d7405aad31f7009eed/bg4.jpg"><div class="c x0 y1 w2 h2"><div class="t m0 x2 h5 y11 ff1 fs2 fc2 sc0 ls0 ws0">Facebook<span class="_ _2"> </span><span class="ff2">的服务器<span class="_ _0"></span>大概<span class="_ _1"> </span>1<span class="_ _4"> </span>万台,按照<span class="_ _2"> </span></span>oracle<span class="_ _1"> </span><span class="ff2">的标准</span></div><div class="t m0 x4 h5 y12 ff1 fs2 fc2 sc0 ls0 ws0">10g<span class="_ _1"> </span><span class="ff2">版本计<span class="_ _0"></span>算大约需要<span class="_ _2"> </span></span>21<span class="_ _1"> </span><span class="ff2">亿元</span></div></div></div><div class="pi" data-data='{"ctm":[1.333333,0.000000,0.000000,1.333333,0.000000,0.000000]}'></div></div>
<div id="pf5" class="pf w0 h0" data-page-no="5"><div class="pc pc5 w0 h0"><img class="bi x0 y0 w1 h1" alt="" src="https://static.pudn.com/prod/directory_preview_static/62b726d7405aad31f7009eed/bg5.jpg"><div class="c x0 y1 w2 h2"><div class="t m0 x2 h7 y13 ff3 fs3 fc4 sc0 ls0 ws0"></div><div class="t m0 x5 h3 y14 ff4 fs0 fc2 sc0 ls0 ws0">Hadoop <span class="_ _5"> </span><span class="ff2">一个分布式系统基础架构,由<span class="_ _5"> </span></span>Apache<span class="_ _5"> </span><span class="ff2">基金会开</span></div><div class="t m0 x5 h3 y15 ff2 fs0 fc2 sc0 ls0 ws0">发。用户可以在不了解分布式底层细节的情况下,开发分</div><div class="t m0 x5 h3 y16 ff2 fs0 fc2 sc0 ls0 ws0">布式程序。充分利用集群的威力高速运算和存储 。</div><div class="t m0 x2 h7 y17 ff3 fs3 fc4 sc0 ls0 ws0"></div><div class="t m0 x5 h3 y18 ff4 fs0 fc2 sc0 ls0 ws0"> Hadoop<span class="_ _5"> </span><span class="ff2">是项目的总称,主要是由分布式存储(<span class="_ _5"> </span></span>HDF</div><div class="t m0 x5 h3 y19 ff2 fs0 fc2 sc0 ls0 ws0">S<span class="_ _1"> </span>)、分布式计算(<span class="_ _5"> </span><span class="ff4">MapReduce<span class="_ _5"> </span></span>)组成 。</div><div class="t m0 x2 h7 y1a ff3 fs3 fc4 sc0 ls0 ws0"></div><div class="t m0 x5 h3 y1b ff4 fs0 fc2 sc0 ls0 ws0"> Hadoop<span class="_ _5"> </span><span class="ff2">程序目前只能运行在<span class="_ _5"> </span></span>Linux<span class="_ _5"> </span><span class="ff2">系统上,<span class="_ _5"> </span></span>window</div><div class="t m0 x5 h3 y1c ff2 fs0 fc2 sc0 ls0 ws0">上运行需要安装其他插件,安装过程见《<span class="_ _5"> </span><span class="ff4">hadoop<span class="_ _5"> </span></span>安装说</div><div class="t m0 x5 h3 y1d ff2 fs0 fc2 sc0 ls0 ws0">明<span class="_ _5"> </span><span class="ff4">.docx<span class="_ _5"> </span></span>》 。</div></div></div><div class="pi" data-data='{"ctm":[1.333333,0.000000,0.000000,1.333333,0.000000,0.000000]}'></div></div>