三十五、字符集到底是怎么回事呀

最新推荐文章于 2024-09-14 20:32:24 发布

程序媛征男友

最新推荐文章于 2024-09-14 20:32:24 发布

阅读量654

点赞数

分类专栏： Java

本文链接：https://blog.csdn.net/zhaonanemail/article/details/24461695

版权

Java 专栏收录该内容

19 篇文章 0 订阅

订阅专栏

字符集真是个恼人的问题，明明System.out.println(System.getProperty("file.encoding"));的结果是UTF-8，但是如果InputStreamReader reader = new InputStreamReader(is, "UTF-8")就是乱码，只有InputStreamReader reader = new InputStreamReader(is, "GB2312")或者InputStreamReader reader = new InputStreamReader(is, "GBK")才正常。这到底是怎么回事呢，百度了也没看懂，谁给我解释下呀

public static void main(String[] args){
	String dir = "C:/Users/tao/config/nvshen-sensitive-words.properties";
	File file = new File(dir);
	InputStream is = null;
	try {
		is = new FileInputStream(file);
		System.out.println(System.getProperty("file.encoding"));
		InputStreamReader reader = new InputStreamReader(is, "GB2312");
		Properties prop = new Properties();
		prop.load(reader);
		Enumeration<String> en = (Enumeration<String>) prop.propertyNames();
		while (en.hasMoreElements()) {
			String word = en.nextElement();
			String[] strarr = prop.getProperty(word).split("。");
			for(String str:strarr){
				if("共产党".contains(str)){
					System.out.println("contain");
				}
			}
		}
		return;
	} catch (UnsupportedEncodingException e) {
		System.out.println("not contain");
	} catch (IOException e) {
		System.out.println("not contain");
	} finally {
		if (is != null)
			try {
				is.close();
			} catch (IOException e) {
			}
	}
	System.out.println("not contain");
}