WebCrawlerExample:PyQt + Scrapy + MongoDB

上传者: 42134094 | 上传时间: 2023-06-12 11:47:10 | 文件大小: 2.86MB | 文件类型: ZIP
README 残留的问题: 文档完成; 抓取准确率已经进一步提高; 抓取效率和时间,空间性能暂不考虑。 实际存在的不可克服的问题: 部分数据确实没有中标金额; 部分数据把供应商和金额放在单独的附件里; 不标注金额的名称,直接放在供应商名字的后面; 中标结果由多包构成。 程序运行须知: 安装PyQt,Scrapy,MongoDB,PyMongo; 命令行启动MongoDB服务; 命令行运行:python Scraper.py(即界面程序)。

文件下载

资源详情

[{"title":"( 34 个子文件 2.86MB ) WebCrawlerExample:PyQt + Scrapy + MongoDB","children":[{"title":"WebCrawlerExample-master","children":[{"title":"pylint_lab_1.Scraper.txt <span style='color:#111;'> 43.45KB </span>","children":null,"spread":false},{"title":"Scraper.py <span style='color:#111;'> 15.09KB </span>","children":null,"spread":false},{"title":"shzfcg","children":[{"title":"shzfcg","children":[{"title":"settings.py <span style='color:#111;'> 3.07KB </span>","children":null,"spread":false},{"title":"pipelines.py <span style='color:#111;'> 997B </span>","children":null,"spread":false},{"title":"__init__.py <span style='color:#111;'> 0B </span>","children":null,"spread":false},{"title":"items.py <span style='color:#111;'> 411B </span>","children":null,"spread":false},{"title":"spiders","children":[{"title":"__init__.py <span style='color:#111;'> 161B </span>","children":null,"spread":false},{"title":"shzfcgSpider.py <span style='color:#111;'> 5.15KB </span>","children":null,"spread":false}],"spread":true}],"spread":true},{"title":"pylint_shzfcg.txt <span style='color:#111;'> 0B </span>","children":null,"spread":false},{"title":"pylint_shzfcg.settings.txt <span style='color:#111;'> 142B </span>","children":null,"spread":false},{"title":"pylint_global.txt <span style='color:#111;'> 4.95KB </span>","children":null,"spread":false},{"title":"scrapy.cfg <span style='color:#111;'> 256B </span>","children":null,"spread":false},{"title":"pylint_shzfcg.spiders.shzfcgSpider.txt <span style='color:#111;'> 3.63KB </span>","children":null,"spread":false},{"title":"pylint_shzfcg.pipelines.txt <span style='color:#111;'> 318B </span>","children":null,"spread":false},{"title":"pylint_shzfcg.spiders.__init__.txt <span style='color:#111;'> 0B </span>","children":null,"spread":false},{"title":"pylint_shzfcg.items.txt <span style='color:#111;'> 91B </span>","children":null,"spread":false}],"spread":true},{"title":"pylint_lab_1.txt <span style='color:#111;'> 0B </span>","children":null,"spread":false},{"title":"pylint_lab_1.Scraper_rc.txt <span style='color:#111;'> 834B </span>","children":null,"spread":false},{"title":"Scraper_rc.py <span style='color:#111;'> 72.17KB </span>","children":null,"spread":false},{"title":"images","children":[{"title":"爬取.png <span style='color:#111;'> 8.10KB </span>","children":null,"spread":false},{"title":"配置.png <span style='color:#111;'> 6.89KB </span>","children":null,"spread":false},{"title":"@C~UPI~(BXEMY~~6A0M0$U1.jpg <span style='color:#111;'> 543.65KB </span>","children":null,"spread":false},{"title":"查询.png <span style='color:#111;'> 2.22KB </span>","children":null,"spread":false}],"spread":true},{"title":"pylint_global.txt <span style='color:#111;'> 4.46KB </span>","children":null,"spread":false},{"title":"Scraper.qrc <span style='color:#111;'> 168B </span>","children":null,"spread":false},{"title":"__init__.py <span style='color:#111;'> 0B </span>","children":null,"spread":false},{"title":"doc","children":[{"title":"系统设计.pdf <span style='color:#111;'> 444.82KB </span>","children":null,"spread":false},{"title":"需求规格说明书(最终版).pdf <span style='color:#111;'> 312.30KB </span>","children":null,"spread":false},{"title":"用户手册.pdf <span style='color:#111;'> 877.72KB </span>","children":null,"spread":false},{"title":"系统分析.pdf <span style='color:#111;'> 231.47KB </span>","children":null,"spread":false},{"title":"需求报告.pdf <span style='color:#111;'> 250.18KB </span>","children":null,"spread":false},{"title":"测试报告.pdf <span style='color:#111;'> 663.29KB </span>","children":null,"spread":false}],"spread":true},{"title":"README.md <span style='color:#111;'> 547B </span>","children":null,"spread":false},{"title":".gitignore <span style='color:#111;'> 702B </span>","children":null,"spread":false}],"spread":false}],"spread":true}]

评论信息

免责申明

【只为小站】的资源来自网友分享,仅供学习研究,请务必在下载后24小时内给予删除,不得用于其他任何用途,否则后果自负。基于互联网的特殊性,【只为小站】 无法对用户传输的作品、信息、内容的权属或合法性、合规性、真实性、科学性、完整权、有效性等进行实质审查;无论 【只为小站】 经营者是否已进行审查,用户均应自行承担因其传输的作品、信息、内容而可能或已经产生的侵权或权属纠纷等法律责任。
本站所有资源不代表本站的观点或立场,基于网友分享,根据中国法律《信息网络传播权保护条例》第二十二条之规定,若资源存在侵权或相关问题请联系本站客服人员,zhiweidada#qq.com,请把#换成@,本站将给予最大的支持与配合,做到及时反馈和处理。关于更多版权及免责申明参见 版权及免责申明