SearchEngine4SeuNews
所属分类:搜索引擎
开发工具:C#
文件大小:425KB
下载次数:0
上传日期:2012-10-08 09:18:09
上 传 者:
sh-1993
说明: 东南大学新闻网站搜索引擎
(Search Engine For SEU News Website)
文件列表:
Analysis (0, 2012-10-08)
Analysis\Analysis.csproj (2908, 2012-10-08)
Analysis\MongodbAccess.cs (4803, 2012-10-08)
Analysis\Program.cs (3439, 2012-10-08)
Analysis\Properties (0, 2012-10-08)
Analysis\Properties\AssemblyInfo.cs (1348, 2012-10-08)
Analysis\TextExtractor.cs (6969, 2012-10-08)
Analysis\Util.cs (1164, 2012-10-08)
Crawler (0, 2012-10-08)
Crawler\Crawler.csproj (2744, 2012-10-08)
Crawler\MongodbAccess.cs (1347, 2012-10-08)
Crawler\Program.cs (6601, 2012-10-08)
Crawler\Properties (0, 2012-10-08)
Crawler\Properties\AssemblyInfo.cs (1346, 2012-10-08)
Crawler\Util.cs (648, 2012-10-08)
SearchEngine4SeuNews.sln (5602, 2012-10-08)
SearchServiceDemon (0, 2012-10-08)
SearchServiceDemon\MongodbAccess.cs (7302, 2012-10-08)
SearchServiceDemon\Program.cs (6873, 2012-10-08)
SearchServiceDemon\Properties (0, 2012-10-08)
SearchServiceDemon\Properties\AssemblyInfo.cs (1368, 2012-10-08)
SearchServiceDemon\SearchServiceDemon.csproj (2885, 2012-10-08)
SearchServiceDemon\Util.cs (1164, 2012-10-08)
TextExtract (0, 2012-10-08)
TextExtract\MongodbAccess.cs (2375, 2012-10-08)
TextExtract\Program.cs (830, 2012-10-08)
TextExtract\Properties (0, 2012-10-08)
TextExtract\Properties\AssemblyInfo.cs (1354, 2012-10-08)
TextExtract\TextExtract.csproj (2795, 2012-10-08)
TextExtract\TextExtractor.cs (6969, 2012-10-08)
TextExtract\Util.cs (648, 2012-10-08)
TextExtract\html.txt (5966, 2012-10-08)
WebApplication1 (0, 2012-10-08)
WebApplication1\Default.aspx (3189, 2012-10-08)
WebApplication1\Default.aspx.cs (948, 2012-10-08)
WebApplication1\Default.aspx.designer.cs (2886, 2012-10-08)
... ...
This is a simple search engine for SEU news website(http://news.seu.edu.cn).
Crawler
crawler the news pages and stroe to MongoDB.
Analysis
wordsegment
create forward index
create inverted index
calculate idf of each word
SearchServiceDemon
a simple http server that processes every query.
WebApplication1
the web interface for this search engine.
近期下载者:
相关文件:
收藏者: