用户操作
[留言]  [发消息]  [加为好友] 
订阅我的博客
XML聚合    FeedSky
订阅到鲜果
订阅到Google
订阅到抓虾
accesine960的公告
<div> <h3><a target="_blank" href="http://www.domolo.com">天天多么乐</a></h3> <a href="http://accesine.blog.techweb.com.cn/">田春峰的IT网志</a> <hr /> <a target="_blank" href="http://www.domolo.com" > <img src="http://p.blog.csdn.net/images/p_blog_csdn_net/accesine960/16853/o_tcf-at-cofco.JPG" width=120 height=150 border="0" alt="田春峰" /></a> </p> <!-- <a target="_blank" href="http://www.domolo.net/readme">互联网广告识别技术</a> <br /> <a target="_blank" href="http://snshelper.appspot.com/">开心网 加好友 外挂</a> <br /> <a target="_blank" href="http://data.domolo.com/snsbrowser">校内网 加好友 外挂</a> <br /> <a target="_blank" href="http://www.mmfans.net">碎戏明星班</a> <br /> <!-- <a href="http://u.domolo.com/readme">Discuz On Rails</a> --> <br /> <a href="http://seo.domolo.com/" alt="seo 客户端工具">seo 客户端工具下载</a> <br /> <a target = "_blank" href="http://seo.domolo.com/alexa"> Alexa Top 100 索引量研究报告 </a> <br /> --> </p> <object width="178" classid="clsid:D27CDB6E-AE6D-11cf-96B8-444553540000"><param name="movie" value="http://www.hada.cc/res/davinci.swf?bid=93"></param><param name="wmode" value="transparent"></param><embed width="178" src="http://www.hada.cc/res/davinci.swf?bid=93" type="application/x-shockwave-flash" wmode="transparent"></embed></object> <!-- csdn ad --> <div> <script type="text/javascript"><!-- google_ad_client = "pub-8976573831271056"; google_ad_width = 160; google_ad_height = 600; google_ad_format = "160x600_as"; google_ad_type = "text"; //2007-05-12: csdn-donews-blog-160-600 google_ad_channel = "4419634545"; google_color_border = "FFFFFF"; google_color_bg = "FFFFCC"; google_color_link = "2D8930"; google_color_text = "FFFFFF"; google_color_url = "FFFFFF"; //--> </script> <script type="text/javascript" src="http://pagead2.googlesyndication.com/pagead/show_ads.js"> </script> </div> <!-- csdn ad --> <p> <script type="text/javascript" src="http://pub.mybloglog.com/comm2.php?mblID=2005112020373600&amp;c_width=180&amp;c_sn_opt=y&amp;c_rows=5&amp;c_img_size=f&amp;c_heading_text=Recent+Readers&amp;c_color_heading_bg=005A94&amp;c_color_heading=ffffff&amp;c_color_link_bg=E3E3E3&amp;c_color_link=005A94&amp;c_color_bottom_bg=005A94"></script> </p> <div align="left"> <iframe scrolling="yes" frameborder="0" marginheight="0" marginwidth="0" width="175" height="250" src="http://u.domolo.com/lucene_mail_list_rediscuss/list" ></iframe> </div> <p /> <a href="http://www.rapleaf.com/profile/view/awgquJWC"><img alt="accesine's Rapleaf Score" border="0" src="http://www.rapleaf.com/image/awgquJWC.jpg" /></a> <p>关于我的信息链接<br> <script type="text/javascript"><!-- google_ad_client = "pub-8976573831271056"; google_ad_width = 125; google_ad_height = 125; google_ad_format = "125x125_as_rimg"; google_cpa_choice = "CAAQ4aSdzgEaCByiq1J203ivKM_M93M"; //--></script> <script type="text/javascript" src="http://pagead2.googlesyndication.com/pagead/show_ads.js"> </script> <br> <a target="_blank" href="http://www.donews.net/accesine">Donews Blog</a> </p> </p> <a href="http://www.bloglines.com/sub/http://blog.csdn.net/accesine960"> <img src="http://www.bloglines.com/images/sub_modern3.gif" border="0" alt="Subscribe with Bloglines" /> </a> </p> <img src="http://blog.csdn.net/images/blog_csdn_net/accesine960/16853/o_tianchunfeng.jpg" > </p> </div> </p> <!-- Site Meter XHTML Strict 1.0 --> <script type="text/javascript">var site="s11accesine"</script> <script type="text/javascript" src="http://s11.sitemeter.com/js/counter.js?site=s11accesine"> </script> <noscript><div> <a href="http://s11.sitemeter.com/stats.asp?site=s11accesine"> <img src="http://s11.sitemeter.com/meter.asp?site=s11accesine" alt="Site Meter" /></a> </div></noscript> <!-- Copyright (c)2005 Site Meter --> </p> <a href="http://dushucun.googlepages.com/">洪洞县杜戍村</a>
文章分类
.net
C++
os
sp
工作流
开发工具
名人连接
数据库相关
搜索引擎
网络开发
我的好朋友
移动开发
组件
存档

原创  贝叶斯论坛垃圾帖屏蔽演示系统 Beta 1 收藏

 

  

介绍:

    作为论坛的版主,肩负的任务之一就是维护论坛发言的质量,删除广告贴,灌水贴 垃圾贴等等.
    本系统的开发目的就是为减轻版主的工作负担,自动识别垃圾贴的一个演示系统
    理论依据是朴素贝叶斯原理.

    使用的过程如下:
    1、首先在多么乐注册帐号,登陆系统。
    2、录入训练系统的原始数据,分两类垃圾贴 和 非垃圾贴。
    3、录入需要检测的帖子,查看帖子是垃圾贴的百分比。

 

欢迎一起  讨论 完善这个程序.
 

微软亚洲研究院-自然语言计算组

论文
  1. 信息检索的依存语言模型
    Jianfeng Gao, Jian-Yun Nie, Guangyuan Wu and Guihong Cao."Dependence language model for information retrieval", In SIGIR-2004. Sheffield, UK, July 25-29, 2004.
  2. 一种英-汉命名实体对齐的新方法
    Dong-Hui Feng, Ya-Juan Lv, Ming Zhou,"A New Approach for English-Chinese Named Entity Alignment", 2004 Conference on Empirical Methods in Natural Language Processing, Barcelona, Spain, Jul. 2004.
  3. 基于单语语料库的搭配翻译自动获取
    Ya-Juan Lv,Ming Zhou,"Collocation Translation Acquisition Using Monolingual Corpora", 42nd Annual Meeting of the Association for Computational Linguistics, Barcelona, Spain, Jul. 2004.
  4. 可适应性的中文分词
    Jianfeng Gao, Andi Wu, Mu Li, Chang-Ning Huang, Hongqiao Li, Xinsong Xia and Haowei Qin."Adaptive Chinese word segmentation" , 42nd Annual Meeting of the Association for Computational Linguistics, Barcelona, Spain, Jul. 2004.
  5. 采用支持向量机识别中文新词
    Hongqiao Li, Chang-Ning Huang, Jianfeng Gao and Xiaozhong Fan, "The use of SVM for Chinese new word identification", In IJCNLP-04. Sanya City, Hainan Island, China, March 22-24, 2004.
  6. 语言模型中获取长距离依存的经验探讨
    Jianfeng Gao and Hisami Suzuki,"Capturing long distance dependency for language modeling: an empirical study", In IJCNLP-04. Sanya City, Hainan Island, China, March 22-24, 2004.
  7. Word Translation Disambiguation Using Bilingual Bootstrapping
    Hang Li and Cong Li," Word Translation Disambiguation Using Bilingual Bootstrapping", Computational Linguistics 30(1), 1-22, 2004.
  8. Text Classification Using Stochastic Keyword Generation
    Cong Li, Ji-Rong Wen, and Hang Li, "Text Classification Using Stochastic Keyword Generation", Proc. of ICML'03, 464-471.
  9. Uncertainty Reduction in Collaborative Bootstrapping: Measure and Algorithm
    Yunbo Cao, Hang Li, and Li Lian, "Uncertainty Reduction in Collaborative Bootstrapping: Measure and Algorithm", Proc. of ACL'03, 327-334.
  10. 改进的信源-信道模型在中文分词中的应用
    Ya-JJianfeng Gao, Mu Li and Chang-Ning Huang, "Improved Source-Channel Models for Chinese Word Segmentation", 41nd Annual Meeting of the Association for Computational Linguistics. Sapporo. Japan, July 7-12, 2003.
  11. Topic Analysis Using a Finite Mixture Model
    Hang Li and Kenji Yamanishi, "Topic Analysis Using a Finite Mixture Model", Information Processing & Management, 39(4), 521-541, (2003).
  12. Using Bilingual Web Data to Mine and Rank Translations
    Hang Li, Yunbo Cao, and Cong Li,"Using Bilingual Web Data to Mine and Rank Translations", IEEE Intelligent Systems, Vol. 18(4), 54-59, (2003)


   

发表于 @ 2005年03月14日 23:44:00 | 评论( loading... ) | 编辑| 举报| 收藏

旧一篇:从搜狐的说吧,谈谈网络公司的创新  | 新一篇:MapReduce:Google的人间大炮

  • 发表评论
  • 评论内容:
  •  
Copyright © accesine960
Powered by CSDN Blog