crawler

所属分类人工智能/神经网络/深度学习
开发工具:Java
文件大小:9871KB
下载次数:37
上传日期:2013-07-02 23:10:47
上 传 者蔷薇花
说明:  爬虫分布式版本实现,基于Map-Reduce进行了实现,非常有用
(Reptile distributed version achieved, based on Map-Reduce was realized very useful)

文件列表:[举报垃圾]
crawler
.......\bin
.......\...\crawler.sh,276,2010-02-01
.......\build.xml,4891,2010-02-01
.......\conf
.......\....\configuration.xsl,1311,2010-01-13
.......\....\joycrawler-csb.xml,2132,2010-02-01
.......\....\joycrawler-default.xml,2248,2010-02-01
.......\....\joycrawler-hadoop.xml,1971,2010-02-01
.......\....\joycrawler-hadoopcn.xml,1998,2010-02-01
.......\....\joycrawler-hi.xml,391,2010-02-01
.......\....\joycrawler-localhost.xml,1969,2010-02-01
.......\....\joycrawler-nyt.xml,2179,2010-02-01
.......\....\joycrawler-scst.xml,1965,2010-02-01
.......\....\joycrawler-wiki.xml,2153,2010-02-01
.......\....\log4j.properties,2955,2010-01-13
.......\lib
.......\...\commons-cli-2.0-SNAPSHOT.jar,258337,2010-01-13
.......\...\commons-httpclient-3.1.jar,305001,2010-01-13
.......\...\commons-logging-1.0.4.jar,38015,2010-01-13
.......\...\db.jar,632527,2010-01-13
.......\...\hadoop-0.20.1-core.jar,2682112,2010-01-13
.......\...\log4j-1.2.15.jar,391834,2010-01-13
.......\...\lucene-core-3.0.0.jar,1021623,2010-01-13
.......\...\lucene-smartcn-3.0.0.jar,3590447,2010-02-01
.......\...\lucene-snowball-3.0.0.jar,115093,2010-02-01
.......\...\native
.......\...\......\libdb_java48.dll,131072,2010-01-13
.......\...\nekohtml.jar,121981,2010-01-13
.......\...\xercesImpl.jar,1229289,2010-01-13
.......\...\xercesMinimal.jar,41531,2010-01-13
.......\...\xml-apis.jar,194354,2010-01-13
.......\LISENCE,17775,2010-01-13
.......\README.txt,1420,2010-02-01
.......\seeds-csb.txt,25,2010-02-01
.......\seeds-hadoop.txt,25,2010-01-13
.......\seeds-hadoopcn.txt,48,2010-01-15
.......\seeds-hi.txt,21,2010-02-01
.......\seeds-localhost.txt,17,2010-02-01
.......\seeds-nyt.txt,23,2010-01-15
.......\seeds-scst.txt,24,2010-01-13
.......\seeds-wiki.txt,42,2010-01-13
.......\src
.......\...\contrib
.......\...\.......\java
.......\...\.......\....\org
.......\...\.......\....\...\joy
.......\...\.......\....\...\...\analyzer
.......\...\.......\....\...\...\........\Analyzer.java,1657,2010-01-13
.......\...\.......\....\...\...\........\Document.java,2458,2010-01-13
.......\...\.......\....\...\...\........\DocumentCreationException.java,248,2010-01-13
.......\...\.......\....\...\...\........\DocumentFactory.java,3730,2010-01-13
.......\...\.......\....\...\...\........\Hit.java,2210,2010-01-13
.......\...\.......\....\...\...\........\HitAnalyzer.java,2529,2010-01-29
.......\...\.......\....\...\...\........\html
.......\...\.......\....\...\...\........\....\Anchor.java,1381,2010-01-13
.......\...\.......\....\...\...\........\....\HTMLDocument.java,4756,2010-01-13
.......\...\.......\....\...\...\........\....\Main.form,10002,2010-01-13
.......\...\.......\....\...\...\........\....\Main.java,13200,2010-01-29
.......\...\.......\....\...\...\........\....\ParagraphSplitter.java,3308,2010-01-13
.......\...\.......\....\...\...\........\....\ParseException.java,281,2010-01-13
.......\...\.......\....\...\...\........\....\Parser.java,2489,2010-01-13
.......\...\.......\....\...\...\........\....\TagWindow.java,7124,2010-01-13
.......\...\.......\....\...\...\........\....\TextExtractor.java,6355,2010-01-13
.......\...\.......\....\...\...\........\....\Utility.java,9769,2010-01-13
.......\...\.......\....\...\...\........\Main.java,2331,2010-01-29
.......\...\.......\....\...\...\........\Paragraph.java,1600,2010-01-13
.......\...\.......\....\...\...\........\PipelineAnalyzer.java,771,2010-01-13
.......\...\.......\....\...\...\........\scoring
.......\...\.......\....\...\...\........\.......\FrequencyScorer.java,682,2010-01-13
.......\...\.......\....\...\...\........\.......\PWFScorer.java,1406,2010-01-23
.......\...\.......\....\...\...\........\.......\Scorer.java,1588,2010-01-13
.......\...\.......\....\...\...\........\.......\ZeroScorer.java,483,2010-01-13
.......\...\.......\....\...\...\........\terms
.......\...\.......\....\...\...\........\.....\SimpleTermExtractor.java,808,2010-01-29
.......\...\.......\....\...\...\........\.....\TermExtractor.java,587,2010-01-29
.......\...\.......\....\...\...\........\TokenAnalyzer.java,797,2010-01-29
.......\...\.......\....\...\...\db
.......\...\.......\....\...\...\..\DB.java,3004,2010-02-01
.......\...\.......\....\...\...\..\DBCursor.java,1019,2010-01-13
.......\...\.......\....\...\...\..\DocHit.java,626,2010-02-01
.......\...\.......\....\...\...\..\DocumentDB.java,626,2010-02-01
.......\...\.......\....\...\...\..\DocumentEntry.java,2094,2010-02-01
.......\...\.......\....\...\...\..\Entry.java,275,2010-01-13
.......\...\.......\....\...\...\..\Env.java,2701,2010-01-13
.......\...\.......\....\...\...\..\IndexDB.java,923,2010-01-29
.......\...\.......\....\...\...\..\IndexEntry.java,2469,2010-02-01
.......\...\.......\....\...\...\..\MergedDocHits.java,748,2010-01-13
.......\...\.......\....\...\...\..\Proximity.java,2667,2010-01-13
.......\...\.......\....\...\...\..\query
.......\...\.......\....\...\...\..\.....\Query.java,934,2010-01-13
.......\...\.......\....\...\...\..\.....\SocketClient.java,6150,2010-01-29
.......\...\.......\....\...\...\..\.....\SocketServer.java,2294,2010-01-13
.......\...\.......\....\...\...\..\QueryServer.java,4527,2010-01-29
.......\...\.......\....\...\...\..\ResultEntry.java,1718,2010-01-23
.......\...\.......\....\...\...\..\SearchEntry.java,1494,2010-02-01
.......\...\.......\....\...\...\..\Searcher.java,1330,2010-02-01
.......\...\.......\....\...\...\nlp
.......\...\.......\....\...\...\...\ChineseTokenizer.java,5712,2010-02-01
.......\...\.......\....\...\...\...\LuceneTokenizer.java,3027,2010-02-01

近期下载者

相关文件

评论我要评论

收藏者