- 博客(8)
- 资源 (4)
- 收藏
- 关注
原创 shell for循环
#!/bin/bashcleari=1while(($i < 9))doecho $icrf_test -m cv.all.wd_pos_score.model ${i}.0.txt >> ${i}.0.test.wd_pos_score.txt && ./stat_word.py ${i}.0.test.wd_pos_score.txt >> ${i}.0.kw.wd_pos_sc
2012-03-30 18:51:07 806
原创 计算信息增益(Information Gain),考虑交叉feature
package com.lexcotech.utils;import java.io.BufferedReader;import java.io.FileReader;import java.util.ArrayList;import java.util.Collections;import java.util.Comparator;import java.util.HashMap;
2012-03-30 18:05:21 1844
原创 计算熵
package com.lexcotech.utils;import java.util.Arrays;import java.util.HashMap;import java.util.List;import java.util.Map;import java.util.Map.Entry;public class InformationGain { /** * calc
2012-03-30 17:07:35 965
原创 文件分割,用于将一个文件产生k份文档(仿linux的split,考虑遇到空行再分开)
#!/usr/bin/pythonimport sys,os,commandsif __name__=='__main__': if len(sys.argv)==4 and sys.argv[1]=='help': print 'usage:*.py file2seg num_file des_folder' else: #get tota
2012-03-28 16:13:46 982
原创 产生K-folder交叉验证的代码
static void genKFolder(String CVFolder) { try { // String[] files = { "1.0.txt", "2.0.txt", "3.0.txt", "4.0.txt", // "5.0.txt", "6.0.txt", "7.0.txt", "8.0.txt", "9.0.txt", // "10.0.txt" };
2012-03-28 16:11:57 2165 2
原创 CRF测试语料中统计准确度(最后两列是正确label与预测label)
#!/usr/bin/pythonfrom __future__ import divisionimport sys,os,timeif __name__=='__main__': f=open(sys.argv[1],'r') total=0 right=0 for line in f: if len(line.strip()
2012-03-28 16:09:59 1258 2
转载 relang入门
http://www.blogjava.net/killme2008/archive/2007/06/13/123860.html读erlang.org上面的Erlang Course四天教程1.数字类型,需要注意两点1)B#Val表示以B进制存储的数字Val,比如7> 2#101.5二进制存储的101就是10进制的5了2)$Char表示字符Char的asc
2012-03-01 12:05:14 788
原创 emacs plus erlang
1.download erlang from its website2../configure & make & sudo make install3.quick tip for emacs:c+x c+s:save file;c+x c+c: exit from emacs4.program the first erlang(test.erl):-module(test).-e
2012-03-01 11:46:54 1121
p6spy改造去掉resultset和添加每日归档
2013-07-31
僵尸网络研究
2008-05-04
空空如也
TA创建的收藏夹 TA关注的收藏夹
TA关注的人