SearchEngine4SeuNews

所属分类:搜索引擎
开发工具:C#
文件大小:425KB
下载次数:0
上传日期:2012-10-08 09:18:09
上 传 者sh-1993
说明:  东南大学新闻网站搜索引擎
(Search Engine For SEU News Website)

文件列表:
Analysis (0, 2012-10-08)
Analysis\Analysis.csproj (2908, 2012-10-08)
Analysis\MongodbAccess.cs (4803, 2012-10-08)
Analysis\Program.cs (3439, 2012-10-08)
Analysis\Properties (0, 2012-10-08)
Analysis\Properties\AssemblyInfo.cs (1348, 2012-10-08)
Analysis\TextExtractor.cs (6969, 2012-10-08)
Analysis\Util.cs (1164, 2012-10-08)
Crawler (0, 2012-10-08)
Crawler\Crawler.csproj (2744, 2012-10-08)
Crawler\MongodbAccess.cs (1347, 2012-10-08)
Crawler\Program.cs (6601, 2012-10-08)
Crawler\Properties (0, 2012-10-08)
Crawler\Properties\AssemblyInfo.cs (1346, 2012-10-08)
Crawler\Util.cs (648, 2012-10-08)
SearchEngine4SeuNews.sln (5602, 2012-10-08)
SearchServiceDemon (0, 2012-10-08)
SearchServiceDemon\MongodbAccess.cs (7302, 2012-10-08)
SearchServiceDemon\Program.cs (6873, 2012-10-08)
SearchServiceDemon\Properties (0, 2012-10-08)
SearchServiceDemon\Properties\AssemblyInfo.cs (1368, 2012-10-08)
SearchServiceDemon\SearchServiceDemon.csproj (2885, 2012-10-08)
SearchServiceDemon\Util.cs (1164, 2012-10-08)
TextExtract (0, 2012-10-08)
TextExtract\MongodbAccess.cs (2375, 2012-10-08)
TextExtract\Program.cs (830, 2012-10-08)
TextExtract\Properties (0, 2012-10-08)
TextExtract\Properties\AssemblyInfo.cs (1354, 2012-10-08)
TextExtract\TextExtract.csproj (2795, 2012-10-08)
TextExtract\TextExtractor.cs (6969, 2012-10-08)
TextExtract\Util.cs (648, 2012-10-08)
TextExtract\html.txt (5966, 2012-10-08)
WebApplication1 (0, 2012-10-08)
WebApplication1\Default.aspx (3189, 2012-10-08)
WebApplication1\Default.aspx.cs (948, 2012-10-08)
WebApplication1\Default.aspx.designer.cs (2886, 2012-10-08)
... ...

This is a simple search engine for SEU news website(http://news.seu.edu.cn). Crawler crawler the news pages and stroe to MongoDB. Analysis wordsegment create forward index create inverted index calculate idf of each word SearchServiceDemon a simple http server that processes every query. WebApplication1 the web interface for this search engine.

近期下载者

相关文件


收藏者