数据挖掘-网页数据集

  • Z5_467229
    了解作者
  • 1.8MB
    文件大小
  • zip
    文件格式
  • 0
    收藏次数
  • VIP专享
    资源类型
  • 0
    下载次数
  • 2022-05-03 22:57
    上传日期
网页数据集是通过抓获网页数据形成的网页数据,用于数据挖掘的数据测试和数据训练。
数据集.zip
  • 数据集
  • 数据集.txt
    3.5MB
  • 数据集.doc
    10.8MB
内容介绍
<html xmlns="http://www.w3.org/1999/xhtml"> <head> <meta charset="utf-8"> <meta name="generator" content="pdf2htmlEX"> <meta http-equiv="X-UA-Compatible" content="IE=edge,chrome=1"> <link rel="stylesheet" href="https://static.pudn.com/base/css/base.min.css"> <link rel="stylesheet" href="https://static.pudn.com/base/css/fancy.min.css"> <link rel="stylesheet" href="https://static.pudn.com/prod/directory_preview_static/6271b354d973ef42a45af173/raw.css"> <script src="https://static.pudn.com/base/js/compatibility.min.js"></script> <script src="https://static.pudn.com/base/js/pdf2htmlEX.min.js"></script> <script> try{ pdf2htmlEX.defaultViewer = new pdf2htmlEX.Viewer({}); }catch(e){} </script> <title></title> </head> <body> <div id="sidebar" style="display: none"> <div id="outline"> </div> </div> <div id="pf1" class="pf w0 h0" data-page-no="1"><div class="pc pc1 w0 h0"><img class="bi x0 y0 w1 h1" alt="" src="https://static.pudn.com/prod/directory_preview_static/6271b354d973ef42a45af173/bg1.jpg"><div class="c x0 y1 w2 h2"><div class="t m0 x1 h3 y2 ff1 fs0 fc0 sc0 ls0 ws0">version: 1.0</div><div class="t m0 x1 h3 y3 ff1 fs0 fc0 sc0 ls0 ws0">url: http://ff<span class="_ _0"></span>ff.363.net/</div><div class="t m0 x1 h3 y4 ff1 fs0 fc0 sc0 ls0 ws0">date: Fri, 04 Jun 2004 14:47:03 GMT</div><div class="t m0 x1 h3 y5 ff1 fs0 fc0 sc0 ls0 ws0">ip: 202.102.16.24</div><div class="t m0 x1 h3 y6 ff1 fs0 fc0 sc0 ls0 ws0">length: 1666</div><div class="t m0 x1 h3 y7 ff1 fs0 fc0 sc0 ls0 ws0">HTTP/1.1 200 OK</div><div class="t m0 x1 h3 y8 ff1 fs0 fc0 sc0 ls0 ws0">Date: Fri, 04 Jun 2004 02:46:13 GMT</div><div class="t m0 x1 h3 y9 ff1 fs0 fc0 sc0 ls0 ws0">Server: <span class="_ _0"></span>A<span class="_ _0"></span>pache/1.3.27 (Unix) PHP/4.2.3</div><div class="t m0 x1 h3 ya ff1 fs0 fc0 sc0 ls0 ws0">Last-Modified: Sat, 12 Jul 2003 23:28:41 GMT</div><div class="t m0 x1 h3 yb ff1 fs0 fc0 sc0 ls0 ws0">ET<span class="_ _1"></span>ag: "27d19b-55b-3f1099a9"</div><div class="t m0 x1 h3 yc ff1 fs0 fc0 sc0 ls0 ws0">Accept-Ranges: bytes</div><div class="t m0 x1 h3 yd ff1 fs0 fc0 sc0 ls0 ws0">Content-Length: 1371</div><div class="t m0 x1 h3 ye ff1 fs0 fc0 sc0 ls0 ws0">Keep-Alive: timeout=4, max=100</div><div class="t m0 x1 h3 yf ff1 fs0 fc0 sc0 ls0 ws0">Connection: Keep-Alive</div><div class="t m0 x1 h3 y10 ff1 fs0 fc0 sc0 ls0 ws0">Content-T<span class="_ _1"></span>ype: text/html</div><div class="t m0 x1 h3 y11 ff1 fs0 fc0 sc0 ls0 ws0">&lt;html&gt;</div><div class="t m0 x1 h3 y12 ff1 fs0 fc0 sc0 ls0 ws0">&lt;head&gt;</div><div class="t m0 x1 h3 y13 ff1 fs0 fc0 sc0 ls0 ws0">&lt;meta http-equiv="Content-T<span class="_ _1"></span>ype" content="text/html; charset=gb2312"&gt;</div><div class="t m0 x1 h4 y14 ff1 fs0 fc0 sc0 ls0 ws0">&lt;title&gt;<span class="ff2">&#24778;&#22825;&#32477;&#25216;</span>&lt;/title&gt;</div><div class="t m0 x1 h3 y15 ff1 fs0 fc0 sc0 ls0 ws0">&lt;/head&gt;</div><div class="t m0 x1 h3 y16 ff1 fs0 fc0 sc0 ls0 ws0">&lt;body bgcolor="#77B7F7"&gt;</div><div class="t m0 x1 h3 y17 ff1 fs0 fc0 sc0 ls0 ws0">&lt;p<span class="_ _2"> </span> <span class="_ _2"> </span>align="center"&gt;&lt;script<span class="_ _2"> </span> <span class="_ _2"> </span>language="javascript"<span class="_ _2"> </span> <span class="_ _2"> </span>src="http://www<span class="_ _1"></span>.beyes.com/ad/img.php?</div><div class="t m0 x1 h3 y18 ff1 fs0 fc0 sc0 ls0 ws0">aid=1<span class="_ _0"></span>1&amp;amp;page=1"&gt; &lt;/script&gt;&lt;/p&gt;</div><div class="t m0 x1 h3 y19 ff1 fs0 fc0 sc0 ls0 ws0">&lt;div align="center"&gt;&lt;center&gt;</div><div class="t m0 x1 h3 y1a ff1 fs0 fc0 sc0 ls0 ws0">&lt;table border="0" width="424" height="20" cellspacing="1" cellpadding="0"&gt;</div><div class="t m0 x1 h3 y1b ff1 fs0 fc0 sc0 ls0 ws0"> &lt;tr&gt;</div><div class="t m0 x1 h3 y1c ff1 fs0 fc0 sc0 ls0 ws0"> <span class="_ _3"> </span> <span class="_ _4"> </span> <span class="_ _3"> </span> &lt;td<span class="_ _5"> </span> <span class="_ _6"> </span>width="424"<span class="_ _6"> </span> <span class="_ _6"> </span>height="20"&gt;&lt;p<span class="_ _5"> </span> <span class="_ _6"> </span>align="center"&gt;&lt;font<span class="_ _6"> </span> <span class="_ _5"> </span>color="#FF00FF"&gt;&lt;a</div><div class="t m0 x1 h4 y1d ff1 fs0 fc0 sc0 ls0 ws0">href="index2.htm"&gt;&lt;strong&gt;&lt;span<span class="_ _7"> </span> <span class="_ _7"> </span>style="font-size:<span class="_ _7"> </span> <span class="_ _8"> </span>25"&gt;<span class="_ _9"> </span><span class="ff2">&#12298;<span class="_ _a"> </span>&#24778;<span class="_ _a"> </span>&#22825;<span class="_ _a"> </span>&#32477;<span class="_ _a"> </span>&#25216;<span class="_ _a"> </span>&#12299;<span class="_ _a"> </span>&#27426;<span class="_ _a"> </span>&#36814;<span class="_ _a"> </span>&#24744;<span class="_ _a"> </span>&#30340;<span class="_ _a"> </span>&#21040;<span class="_ _a"> </span>&#26469;<span class="_ _9"> </span>&#65281;</span></div><div class="t m0 x1 h3 y1e ff1 fs0 fc0 sc0 ls0 ws0">&lt;/span&gt;&lt;/strong&gt;&lt;/a&gt;&lt;/font&gt;&lt;/td&gt;</div><div class="t m0 x1 h3 y1f ff1 fs0 fc0 sc0 ls0 ws0"> &lt;/tr&gt;</div><div class="t m0 x1 h3 y20 ff1 fs0 fc0 sc0 ls0 ws0">&lt;/table&gt;</div><div class="t m0 x1 h3 y21 ff1 fs0 fc0 sc0 ls0 ws0">&lt;/center&gt;&lt;/div&gt;</div><div class="t m0 x1 h4 y22 ff1 fs0 fc0 sc0 ls0 ws0">&lt;p&gt;<span class="ff2">&#12288;</span>&lt;/p&gt;</div><div class="t m0 x1 h3 y23 ff1 fs0 fc0 sc0 ls0 ws0">&lt;p<span class="_ _b"> </span> <span class="_ _b"> </span>align="center"&gt;&lt;strong&gt;&lt;font<span class="_ _b"> </span> <span class="_ _b"> </span>color="#f<span class="_ _0"></span>ff<span class="_ _0"></span>ff<span class="_ _0"></span>f"&gt;&lt;script<span class="_ _b"> </span> <span class="_ _b"> </span>language="javascript"</div><div class="t m0 x1 h3 y24 ff1 fs0 fc0 sc0 ls0 ws0">src="http://usms.tom.com/js/468_60.js?tomuserid=14330&amp;amp;tomusername=atomyu"&gt;&lt;/</div><div class="t m0 x1 h3 y25 ff1 fs0 fc0 sc0 ls0 ws0">script&gt;&lt;/font&gt;&lt;/strong&gt;&lt;/p&gt;</div></div></div><div class="pi" data-data='{"ctm":[1.611850,0.000000,0.000000,1.611850,0.000000,0.000000]}'></div></div> </body> </html>
评论
    相关推荐