java统计字符串单词的个数

版权声明:本文为博主原创文章,未经博主允许不得转载。 https://blog.csdn.net/li4562258fj155/article/details/51489886

在一些项目中可能需要对一段字符串中的单词进行统计,我在这里写了一个简单的demo,有需要的同学可以拿去看一下。

本人没怎么写个播客,如果有啥说的不对的地方,希望大家指正。

查询指定字符串出现次数可以直接使用:StringUtils.countMatches(str, sub)

统计所有单词出现次数代码:

/**
	 * 统计各个单词出现的次数
	 * @param text
	 */
	public static void findEnglishNum(String text){
		//找出所有的单词
		String[] array = {".", " ", "?", "!"};
		for (int i = 0; i < array.length; i++) {
			text = text.replace(array[i],",");
		}
		String[] textArray = text.split(",");
		
		//遍历 记录
		Map<String, Integer> map = new HashMap<String, Integer>();
		for (int i = 0; i < textArray.length; i++) {
			String key = textArray[i];
			//转为小写
			String key_l = key.toLowerCase();
			if(!"".equals(key_l)){
				Integer num = map.get(key_l);
				if(num == null || num == 0){
					map.put(key_l, 1);
				}else if(num > 0){
					map.put(key_l, num+1);
				}
			}
		}
		//输出到控制台
		System.out.println("各个单词出现的频率为:");
		Iterator<String> iter = map.keySet().iterator();
		while(iter.hasNext()){
			String key = iter.next();
			Integer num = map.get(key);
			System.out.println(key + "\n\t\t" + num + "次\n-------------------");
		}
	}


测试代码:

public static void main(String[] args) {
		String text = "Welcome welcome to ADempiere, a commons-based peer-production of Open Source ERP Applications. This Wiki is for the global community to contribute and share know-how and domain expertise. We hope you can find as much open information and participate in making it most usable for everyone. This project has a bazaar of Citizens with a Community Council Team which work in theFunctional Team and Technical Team along the Software Development Procedure supported and funded by the foundation ADempiere";
		
		findEnglishNum(text);
	}

运行结果:

后面还有一些没有全部截下来


本人代码功力不够深厚,如果代码中有什么不对或不好的地方,欢迎各位大神指点!可怜

阅读更多
换一批

没有更多推荐了,返回首页