qcontent

所属分类:聚类算法
开发工具:C++
文件大小:28625KB
下载次数:0
上传日期:2012-08-23 07:16:14
上 传 者sh-1993
说明:  有针对性的cralwer系统,包括爬虫、主内容提取器、分词器、类分类等。
(Foucused cralwer system, includer a crawler, main content extractor, word segmenter, class classification, etc.)

文件列表:
INSTALL (1192, 2012-08-23)
LICENSE (0, 2012-08-23)
base (0, 2012-08-23)
base\basictypes.h (3643, 2012-08-23)
base\logging.cc (12965, 2012-08-23)
base\logging.h (19272, 2012-08-23)
base\scoped_ptr.h (6945, 2012-08-23)
base\string16.cc (2990, 2012-08-23)
base\string16.h (7494, 2012-08-23)
base\utf8 (0, 2012-08-23)
base\utf8\utf8_decode.c (4753, 2012-08-23)
base\utf8\utf8_decode.h (226, 2012-08-23)
base\utf8\utf8_decode_loose.c (4564, 2012-08-23)
base\utf8\utf8_to_utf16.c (1829, 2012-08-23)
base\utf8\utf8_to_utf16.h (94, 2012-08-23)
etc (0, 2012-08-23)
etc\qextractor.conf (1873, 2012-08-23)
etc\qfetcher.conf (730, 2012-08-23)
googleurl (0, 2012-08-23)
googleurl\googleurl.pro (1391, 2012-08-23)
googleurl\src (0, 2012-08-23)
googleurl\src\gurl.cc (14903, 2012-08-23)
googleurl\src\gurl.h (15752, 2012-08-23)
googleurl\src\url_canon.h (36768, 2012-08-23)
googleurl\src\url_canon_etc.cc (15890, 2012-08-23)
googleurl\src\url_canon_fileurl.cc (8961, 2012-08-23)
googleurl\src\url_canon_host.cc (17765, 2012-08-23)
googleurl\src\url_canon_icu.cc (7797, 2012-08-23)
googleurl\src\url_canon_icu.h (2589, 2012-08-23)
googleurl\src\url_canon_internal.cc (18101, 2012-08-23)
googleurl\src\url_canon_internal.h (20191, 2012-08-23)
googleurl\src\url_canon_internal_file.h (7045, 2012-08-23)
googleurl\src\url_canon_ip.cc (26558, 2012-08-23)
googleurl\src\url_canon_ip.h (4961, 2012-08-23)
googleurl\src\url_canon_mailtourl.cc (5373, 2012-08-23)
googleurl\src\url_canon_path.cc (17857, 2012-08-23)
googleurl\src\url_canon_pathurl.cc (5218, 2012-08-23)
... ...

These files contain some shared code. You can define your own assertion macros to eliminate the dependency on logging.h.

近期下载者

相关文件


收藏者