从全拼反导出词典,不过有很多词组,我写了个程序,去分析,只把单个汉字的提取,就可以了,核心代码:
<script type="text/javascript"><!-- google_ad_client = "pub-4775661300876650"; /* 300x250, 创建于 08-8-31 */ google_ad_slot = "0379449264"; google_ad_width = 300; google_ad_height = 250; //--> </script><script src="http://pagead2.googlesyndication.com/pagead/show_ads.js" type="text/javascript"> </script>
- InputStreamReaderisr=newInputStreamReader(newFileInputStream(
- "c:/WINPY_ys.TXT"),"UTF-16");
- BufferedReaderbr=newBufferedReader(isr);
- Stringstr=br.readLine();
- while(str!=null){
- //判断是不是单个字
- if(((int)str.charAt(1))<128){
- //是单个汉字
- Stringhanzi=String.valueOf(str.charAt(0));
- Stringpinyin=str.substring(1);
- CnPycnPy=newCnPy();
- cnPy.setCn(hanzi);
- cnPy.setPy(pinyin);
- this.cnPyManager.merge(cnPy);
- }
- str=br.readLine();
- }
其它的大家去发挥吧,如果需要已经导入的数据脚本,请跟我联系:web@fangwei.name
<!--google链接-->
<script type="text/javascript"><!-- google_ad_client = "pub-4775661300876650"; /* 728x15, 创建于 08-8-31 */ google_ad_slot = "5036927009"; google_ad_width = 728; google_ad_height = 15; //--> </script><script src="http://pagead2.googlesyndication.com/pagead/show_ads.js" type="text/javascript"> </script>