<html xmlns="http://www.w3.org/1999/xhtml"><head><meta charset="utf-8"><meta name="generator" content="pdf2htmlEX"><meta http-equiv="X-UA-Compatible" content="IE=edge,chrome=1"><link rel="stylesheet" href="https://csdnimg.cn/release/download_crawler_static/css/base.min.css"><link rel="stylesheet" href="https://csdnimg.cn/release/download_crawler_static/css/fancy.min.css"><link rel="stylesheet" href="https://csdnimg.cn/release/download_crawler_static/8852531/raw.css"><script src="https://csdnimg.cn/release/download_crawler_static/js/compatibility.min.js"></script><script src="https://csdnimg.cn/release/download_crawler_static/js/pdf2htmlEX.min.js"></script><script>try{pdf2htmlEX.defaultViewer = new pdf2htmlEX.Viewer({});}catch(e){}</script><title></title></head><body><div id="sidebar" style="display: none"><div id="outline"></div></div><div id="pf1" class="pf w0 h0" data-page-no="1"><div class="pc pc1 w0 h0"><img class="bi x0 y0 w1 h1" alt="" src="https://csdnimg.cn/release/download_crawler_static/8852531/bg1.jpg"><div class="t m0 x1 h2 y1 ff1 fs0 fc0 sc0 ls0 ws0">www.it-ebooks.info</div><a class="l" rel='nofollow' onclick='return false;'><div class="d m1"></div></a></div><div class="pi" data-data='{"ctm":[1.568627,0.000000,0.000000,1.568627,0.000000,0.000000]}'></div></div></body></html>
<div id="pf2" class="pf w0 h0" data-page-no="2"><div class="pc pc2 w0 h0"><img class="bi x0 y0 w1 h1" alt="" src="https://csdnimg.cn/release/download_crawler_static/8852531/bg2.jpg"><div class="t m0 x2 h3 y2 ff2 fs1 fc1 sc0 ls0 ws0">Hadoop R<span class="_ _0"></span>eal-W<span class="_ _0"></span>or<span class="_ _1"></span>ld </div><div class="t m0 x2 h3 y3 ff2 fs1 fc1 sc0 ls0 ws0">Solutions Cookbook</div><div class="t m0 x2 h4 y4 ff3 fs2 fc1 sc0 ls0 ws0">R<span class="_ _2"></span>ealistic, simple code e<span class="_ _2"></span>xam<span class="_ _2"></span>ples t<span class="_ _2"></span>o solv<span class="_ _2"></span>e pr<span class="_ _2"></span>oblems at </div><div class="t m0 x2 h4 y5 ff3 fs2 fc1 sc0 ls0 ws0">scale with Hadoop and relat<span class="_ _2"></span>ed t<span class="_ _2"></span>echnologies</div><div class="t m0 x2 h5 y6 ff4 fs3 fc1 sc0 ls0 ws0">Jonathan R. Owens</div><div class="t m0 x2 h5 y7 ff4 fs3 fc1 sc0 ls0 ws0">Jon Lentz</div><div class="t m0 x2 h5 y8 ff4 fs3 fc1 sc0 ls0 ws0">Brian F<span class="_ _2"></span>emiano</div><div class="t m0 x2 h6 y9 ff3 fs0 fc1 sc0 ls0 ws0"> BIRMINGHAM - MUMBAI</div><div class="t m0 x1 h2 y1 ff1 fs0 fc0 sc0 ls0 ws0">www.it-ebooks.info</div><a class="l" rel='nofollow' onclick='return false;'><div class="d m1"></div></a></div><div class="pi" data-data='{"ctm":[1.568627,0.000000,0.000000,1.568627,0.000000,0.000000]}'></div></div>
<div id="pf3" class="pf w0 h0" data-page-no="3"><div class="pc pc3 w0 h0"><img class="bi x0 y0 w1 h1" alt="" src="https://csdnimg.cn/release/download_crawler_static/8852531/bg3.jpg"><div class="t m0 x2 h7 ya ff2 fs2 fc1 sc0 ls0 ws0">Hadoop R<span class="_ _2"></span>eal-W<span class="_ _3"></span>or<span class="_ _4"></span>ld Solutions Cookbook</div><div class="t m0 x2 h6 yb ff3 fs0 fc1 sc0 ls0 ws0">Copyright © 20<span class="_ _3"></span>13 Packt Publishing</div><div class="t m0 x2 h6 yc ff3 fs0 fc1 sc0 ls0 ws0">All rights reserved. No par<span class="_ _4"></span>t of this book ma<span class="_ _3"></span>y be reproduced, stored in a re<span class="_ _2"></span>triev<span class="_ _2"></span>al system, </div><div class="t m0 x2 h6 yd ff3 fs0 fc1 sc0 ls0 ws0">or transmitted in an<span class="_ _3"></span>y form or b<span class="_ _2"></span>y any means, without the prior writt<span class="_ _3"></span>en permission of the </div><div class="t m0 x2 h6 ye ff3 fs0 fc1 sc0 ls0 ws0">publisher<span class="_ _3"></span>, ex<span class="_ _3"></span>cept in the case of brief quotations embedded in critical ar<span class="_ _5"></span>ticles or revie<span class="_ _3"></span>ws.</div><div class="t m0 x2 h6 yf ff3 fs0 fc1 sc0 ls0 ws0">Every ef<span class="_ _5"></span>fort has been made in the preparation of this book to ensure the accuracy of the </div><div class="t m0 x2 h6 y10 ff3 fs0 fc1 sc0 ls0 ws0">information present<span class="_ _3"></span>ed. Howe<span class="_ _3"></span>ver<span class="_ _3"></span>, the information contained in this book is sold without </div><div class="t m0 x2 h6 y11 ff3 fs0 fc1 sc0 ls0 ws0">warranty<span class="_ _0"></span>, either express or implied. Neither the authors, nor Packt Publishing, and its </div><div class="t m0 x2 h6 y12 ff3 fs0 fc1 sc0 ls0 ws0">dealers and distributors will be held liable f<span class="_ _3"></span>or any damages caused or alleged to be </div><div class="t m0 x2 h6 y13 ff3 fs0 fc1 sc0 ls0 ws0">caused directly or indirectly by this book.</div><div class="t m0 x2 h6 y14 ff3 fs0 fc1 sc0 ls0 ws0">Packt Publishing has endea<span class="_ _3"></span>vored to pr<span class="_ _2"></span>ovide trademark inf<span class="_ _3"></span>ormation about all of the </div><div class="t m0 x2 h6 y15 ff3 fs0 fc1 sc0 ls0 ws0">companies and pr<span class="_ _2"></span>oducts mentioned in this book by the appr<span class="_ _3"></span>opriate use of capitals. </div><div class="t m0 x2 h6 y16 ff3 fs0 fc1 sc0 ls0 ws0">How<span class="_ _2"></span>ev<span class="_ _3"></span>er<span class="_ _3"></span>, Packt Publishing cannot guarantee the accuracy of this inf<span class="_ _3"></span>ormation.</div><div class="t m0 x2 h6 y17 ff3 fs0 fc1 sc0 ls0 ws0">First published: February 20<span class="_ _2"></span>13</div><div class="t m0 x2 h6 y18 ff3 fs0 fc1 sc0 ls0 ws0">Production R<span class="_ _3"></span>eference: 1280<span class="_ _3"></span>113</div><div class="t m0 x2 h6 y19 ff3 fs0 fc1 sc0 ls0 ws0">Published by Packt Publishing Ltd. </div><div class="t m0 x2 h6 y1a ff3 fs0 fc1 sc0 ls0 ws0">Livery Place </div><div class="t m0 x2 h6 y1b ff3 fs0 fc1 sc0 ls0 ws0">35 Livery Street </div><div class="t m0 x2 h6 y1c ff3 fs0 fc1 sc0 ls0 ws0">Birmingham B3 2PB, UK.</div><div class="t m0 x2 h6 y1d ff3 fs0 fc1 sc0 ls0 ws0">ISBN 9<span class="_ _0"></span>78-1-8<span class="_ _5"></span>495<span class="_ _3"></span>1-9<span class="_ _3"></span>12-0</div><div class="t m0 x2 h8 y1e ff5 fs4 fc1 sc0 ls0 ws0">www.packtpub.com</div><div class="t m0 x2 h6 y1f ff3 fs0 fc1 sc0 ls0 ws0">Cov<span class="_ _3"></span>er Image by iStockPhot<span class="_ _3"></span>o</div><div class="t m0 x1 h2 y1 ff1 fs0 fc0 sc0 ls0 ws0">www.it-ebooks.info</div><a class="l" rel='nofollow' onclick='return false;'><div class="d m1"></div></a></div><div class="pi" data-data='{"ctm":[1.568627,0.000000,0.000000,1.568627,0.000000,0.000000]}'></div></div>
<div id="pf4" class="pf w0 h0" data-page-no="4"><div class="pc pc4 w0 h0"><img class="bi x0 y0 w1 h1" alt="" src="https://csdnimg.cn/release/download_crawler_static/8852531/bg4.jpg"><div class="t m0 x3 h9 y20 ff2 fs5 fc1 sc0 ls0 ws0">Cr<span class="_ _5"></span>edits</div><div class="t m0 x4 ha y21 ff4 fs6 fc1 sc0 ls0 ws0">Authors</div><div class="t m0 x5 ha y22 ff3 fs6 fc1 sc0 ls0 ws0">Jonathan R. Owens</div><div class="t m0 x5 ha y23 ff3 fs6 fc1 sc0 ls0 ws0">Jon Lentz</div><div class="t m0 x5 ha y24 ff3 fs6 fc1 sc0 ls0 ws0">Brian Femiano</div><div class="t m0 x4 ha y25 ff4 fs6 fc1 sc0 ls1 ws1">Reviewer<span class="_ _5"></span>s</div><div class="t m0 x5 ha y26 ff3 fs6 fc1 sc0 ls0 ws0">Edwar<span class="_ _2"></span>d J. Cody</div><div class="t m0 x5 ha y27 ff3 fs6 fc1 sc0 ls0 ws0">Daniel Jue</div><div class="t m0 x5 ha y28 ff3 fs6 fc1 sc0 ls0 ws0">Bruce C. Miller</div><div class="t m0 x4 ha y29 ff4 fs6 fc1 sc0 ls0 ws0">Acquisition Editor</div><div class="t m0 x5 ha y2a ff3 fs6 fc1 sc0 ls0 ws0">Robin de Jongh</div><div class="t m0 x4 ha y2b ff4 fs6 fc1 sc0 ls0 ws0">Lead T<span class="_ _3"></span>echnical Editor</div><div class="t m0 x5 ha y2c ff3 fs6 fc1 sc0 ls0 ws0">Azharuddin Sheikh</div><div class="t m0 x4 ha y2d ff4 fs6 fc1 sc0 ls0 ws0">T<span class="_ _3"></span>echnical Editor</div><div class="t m0 x5 ha y2e ff3 fs6 fc1 sc0 ls0 ws0">Dennis John</div><div class="t m0 x4 ha y2f ff4 fs6 fc1 sc0 ls0 ws0">Copy Editors</div><div class="t m0 x5 ha y30 ff3 fs6 fc1 sc0 ls0 ws0">Brandt D'Mello</div><div class="t m0 x5 ha y31 ff3 fs6 fc1 sc0 ls0 ws0">Insiya Morbiwala</div><div class="t m0 x5 ha y32 ff3 fs6 fc1 sc0 ls0 ws0">Aditya Nair</div><div class="t m0 x5 ha y33 ff6 fs6 fc1 sc0 ls0 ws0">Alda Paiva</div><div class="t m0 x5 ha y34 ff3 fs6 fc1 sc0 ls0 ws0">Ruta W<span class="_ _3"></span>aghmare</div><div class="t m0 x6 ha y21 ff4 fs6 fc1 sc0 ls0 ws0">Project Coordinator</div><div class="t m0 x7 ha y22 ff3 fs6 fc1 sc0 ls0 ws0">Abhishek K<span class="_ _2"></span>ori</div><div class="t m0 x6 ha y35 ff4 fs6 fc1 sc0 ls0 ws0">Proofreader</div><div class="t m0 x7 ha y36 ff3 fs6 fc1 sc0 ls0 ws0">Stephen Silk</div><div class="t m0 x6 ha y37 ff4 fs6 fc1 sc0 ls0 ws0">Index<span class="_ _3"></span>er</div><div class="t m0 x7 ha y38 ff3 fs6 fc1 sc0 ls0 ws0">Monica Ajmera Mehta</div><div class="t m0 x6 ha y39 ff4 fs6 fc1 sc0 ls0 ws0">Graphics</div><div class="t m0 x7 ha y3a ff3 fs6 fc1 sc0 ls0 ws0">Conidon Miranda</div><div class="t m0 x6 ha y3b ff4 fs6 fc1 sc0 ls0 ws0">Layout Coordinator</div><div class="t m0 x7 ha y3c ff3 fs6 fc1 sc0 ls0 ws0">Conidon Miranda</div><div class="t m0 x6 ha y3d ff4 fs6 fc1 sc0 ls0 ws0">Cover W<span class="_ _3"></span>ork</div><div class="t m0 x7 ha y3e ff3 fs6 fc1 sc0 ls0 ws0">Conidon Miranda</div><div class="t m0 x1 h2 y1 ff1 fs0 fc0 sc0 ls0 ws0">www.it-ebooks.info</div><a class="l" rel='nofollow' onclick='return false;'><div class="d m1"></div></a></div><div class="pi" data-data='{"ctm":[1.568627,0.000000,0.000000,1.568627,0.000000,0.000000]}'></div></div>
<div id="pf5" class="pf w0 h0" data-page-no="5"><div class="pc pc5 w0 h0"><img class="bi x0 y0 w1 h1" alt="" src="https://csdnimg.cn/release/download_crawler_static/8852531/bg5.jpg"><div class="t m0 x8 h9 y20 ff2 fs5 fc1 sc0 ls0 ws0">About the A<span class="_ _2"></span>uthor<span class="_ _4"></span>s</div><div class="t m0 x2 hb y3f ff4 fs7 fc1 sc0 ls0 ws0">Jonathan R. Owens<span class="ff3 fs0"> has a background in Ja<span class="_ _3"></span>va and C++, and has work<span class="_ _3"></span>ed in both privat<span class="_ _2"></span>e </span></div><div class="t m0 x2 h6 y40 ff3 fs0 fc1 sc0 ls0 ws0">and public sectors as a software engineer<span class="_ _3"></span>. Most recently<span class="_ _3"></span>, he has been working with Hadoop </div><div class="t m0 x2 h6 y41 ff3 fs0 fc1 sc0 ls0 ws0">and related distribut<span class="_ _3"></span>ed processing technologies.</div><div class="t m0 x2 h6 y42 ff3 fs0 fc1 sc0 ls0 ws0">Currently<span class="_ _3"></span>, he works f<span class="_ _3"></span>or comScore, Inc., a widely regarded digital measurement and analytics </div><div class="t m0 x2 h6 y43 ff3 fs0 fc1 sc0 ls0 ws0">compan<span class="_ _3"></span>y. A<span class="_ _3"></span>t comScore, he is a member of the core processing team, which uses Hadoop </div><div class="t m0 x2 h6 y44 ff3 fs0 fc1 sc0 ls0 ws0">and other cust<span class="_ _2"></span>om distributed syst<span class="_ _3"></span>ems to aggregate, analyze, and manage o<span class="_ _3"></span>ver 40 billion </div><div class="t m0 x2 h6 y45 ff3 fs0 fc1 sc0 ls0 ws0">transactions per day<span class="_ _0"></span>.</div><div class="t m0 x9 h6 y46 ff3 fs0 fc1 sc0 ls0 ws0">I would lik<span class="_ _3"></span>e to thank my parents James and Patricia Ow<span class="_ _3"></span>ens, for their support </div><div class="t m0 x9 h6 y47 ff3 fs0 fc1 sc0 ls0 ws0">and introducing me t<span class="_ _3"></span>o technology at a young age.</div><div class="t m0 x2 hb y48 ff4 fs7 fc1 sc0 ls0 ws0">Jon Lentz<span class="ff3 fs0"> is a Sof<span class="_ _5"></span>tware Engineer on the core pr<span class="_ _3"></span>ocessing team at comScore, Inc., an online </span></div><div class="t m0 x2 h6 y49 ff3 fs0 fc1 sc0 ls0 ws0">audience measurement and analytics compan<span class="_ _3"></span>y. He pref<span class="_ _3"></span>ers to do most of his coding in Pig. </div><div class="t m0 x2 h6 y4a ff3 fs0 fc1 sc0 ls0 ws0">Before w<span class="_ _3"></span>orking at comScore, he developed software t<span class="_ _2"></span>o optimize supply chains and allocate </div><div class="t m0 x2 h6 y4b ff6 fs0 fc1 sc0 ls0 ws0">xed-income securities.</div><div class="t m0 x9 h6 y4c ff3 fs0 fc1 sc0 ls0 ws0">T<span class="_ _0"></span>o my daughter<span class="_ _0"></span>, Emma, born during the writing of this book. Thanks for the </div><div class="t m0 x9 h6 y4d ff3 fs0 fc1 sc0 ls0 ws0">compan<span class="_ _3"></span>y on late nights.</div><div class="t m0 x1 h2 y1 ff1 fs0 fc0 sc0 ls0 ws0">www.it-ebooks.info</div><a class="l" rel='nofollow' onclick='return false;'><div class="d m1"></div></a></div><div class="pi" data-data='{"ctm":[1.568627,0.000000,0.000000,1.568627,0.000000,0.000000]}'></div></div>