ajax采集数据,Ajax站点数据采集研究综述*

最新推荐文章于 2022-12-17 22:12:24 发布

weixin_39831902

最新推荐文章于 2022-12-17 22:12:24 发布

阅读量256

点赞数

文章标签： ajax采集数据

［1］ Garrett J. Ajax: A New Approach to Web Applications［EB/OL］. (2005-02-18).［2010-01-15］. http://www.adaptivepath.com/ideas/essays/archives/000385.php.

［2］ Mesbah A, Van Deursen A. An Architectural Style for Ajax［C］. In: Proceedings of the 6th Working IEEE/IFIP Conference on Software Architecture,Mumbai, India. Washington, DC, USA :IEEE Computer Society,2007: 44-53.

［3］ Bozdag E, Mesbah A, Van Deursen A. A Comparison of Push and Pull Techniques for Ajax［C］. In: Proceedings of the 9th IEEE International Symposium on Web Site Evolution,Paris, France.2007: 15-22.

［4］ Mesbah A, Van Deursen A. Exposing the Hidden-Web Induced by Ajax［R/OL］. ［2009-08-01］. http://swerl.tudelft.nl/twiki/pub/Main/TechnicalReports/TUD-SERG-2008-001.pdf.

［5］ Frey G. Indexing Ajax Web Applications［D］. Zurich: Swiss Federal Institute of Technology Zurich, 2007.

［6］ Matter R. Ajax Crawl: Making Ajax Applications Searchable［D］. Zurich: Swiss Federal Institute of Technology Zurich, 2008.

［7］ Mesbah A， Bozdag E, Van Deursen A.Crawling Ajax by Inferring User Interface State Changes［C］. In: Proceedings of the 8th International Conference on Web Engineering,Yorktown Heights, NJ. Washington, DC, USA: IEEE Computer Society,2008: 122-134.

［8］郭浩, 陆余良, 刘金红. 一种基于状态转换图的Ajax 爬行算法［J］. 计算机应用研究, 2009, 26(11): 4266-4269.

［9］ Duda C, Frey G, Kossmann D, et al. AjaxSearch: Crawling, Indexing and Searching Web 2.0 Applications［J］. Proceedings of the VLDB Endowment Archive, 2008, 1(2): 1440-1443.

［10］夏冰, 高军, 王腾蛟，等. 一种高效的动态脚本网站有效页面获取方法［J］. 软件学报, 2009, 20(z): 176-183.

［11］ Xia T. Extracting Structured Data from Ajax Site［C］. In: Proceedings of 2009 International IEEE Workshop on Database Technology and Applications,Wuhan, China.2009: 259-262.

［12］ Shah S. Crawling Ajax-driven Web 2.0 Applications［R/OL］. (2007-02-14). ［2010-01-15］.http://www.infosecwriters.com/text_resources/pdf/Crawling_AJAX_SShah.pdf.

［13］罗兵. 支持Ajax的互联网搜索引擎爬虫设计与实现［D］. 杭州: 浙江大学, 2007.

［14］肖卓磊. 基于Ajax技术的搜索引擎研究［D］. 武汉: 武汉理工大学, 2009.

［15］曾伟辉, 李淼. 基于JavaScript切片的Ajax框架网络爬虫技术研究［J］. 计算机系统应用, 2009, 18(7): 169-171.

［16］ Mozilla. Rhino: JavaScript for Java ［EB/OL］. ［2009-03-22］. http://www.mozilla.org/rhino/.

［17］ Cobra: Java HTML Renderer & Parser ［EB/OL］. ［2009-01-19］.http://lobobrowser.org/cobra.jsp.

［18］袁小节. 基于协议驱动与事件驱动的综合聚焦爬虫研究与实现［D］. 长沙: 国防科学技术大学, 2009.

［19］ Reis D C, Golgher P B, Silva A S, et al. Automatic Web News Extraction Using Tree Edit Distance［C］. In: Proceedings of the 13th International Conference on World Wide Web, New York. New York, NY, USA: ACM Press, 2004: 502-511.

［20］ Xia T. Extracting Multi-Records from Web Pages［C］. In: Proceedings of the 4th International Conference on Semantics, Knowledge and Grid, Beijing, China.2008: 396-399.

［21］ Marzal A, Vidal E. Computation of Normalized Edit Distance and Applications［J］. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1993, 15(9): 926-932.

［22］ Buttler D. A Short Survey of Document Structure Similarity Algorithm［C］. In: Proceedings of the 5th International Conference on Internet Computing,Las Vegas, US.2004: 3-9.

［23］ Webrenderer ［EB/OL］. ［2010-02-16］. http://www.webrenderer.com/.

［24］ Webclient ［EB/OL］. (2007-09-23). ［2010-02-16］.http://www.mozilla.org/projects/blackwood/webclient/.

［25］ JRex-The Java Browser Component［EB/OL］. ［2009-06-21］. http://jrex.mozdev.org/.

［26］ JExplorer ［EB/OL］.［2010-01-29］. http://www.teamdev.com/jexplorer/.

［27］ Watij［EB/OL］. ［2009-11-16］. http://watij.com/.

［28］ Watir ［EB/OL］. ［2009-11-16］. http://watir.com/.

［29］ HtmlUnit［EB/OL］. ［2010-02-09］. http://htmlunit.sourceforge.net/.

［30］ XHTML Renderer Project［EB/OL］. ［2009-07-01］. https://xhtmlrenderer.dev.java.net/.

［31］ CSS Parser［EB/OL］. ［2009-11-16］. http://cssparser.sourceforge.net/.

［32］ Crowbar［EB/OL］. ［2010-01-16］. http://simile.mit.edu/wiki/Crowbar.

［33］ FireWatir［EB/OL］. ［2010-01-14］. http://code.google.com/p/firewatir/.

weixin_39831902

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
ajax采集数据,Ajax站点数据采集研究综述*

［1］ Garrett J. Ajax: A New Approach to Web Applications［EB/OL］. (2005-02-18).［2010-01-15］. http://www.adaptivepath.com/ideas/essays/archives/000385.php.［2］ Mesbah A, Van Deursen A. An Architectural St...
复制链接

扫一扫