1. 主要是属性文件的字符转换问题.具体步骤和检测如下:
1) 用一般的编辑器输入所要显示的汉字
2) 使用Java JDK 下的native2ascii. 详细说明见
http://java.sun.com/j2se/1.4.2/docs/tooldocs/windows/native2ascii.html
native2ascii - Native-to-ASCII Converter
Converts a file with native-encoded characters (characters which are non-Latin 1 and non-Unicode) to one with Unicode-encoded characters.
SYNOPSIS
native2ascii [options] [inputfile [outputfile]]
DESCRIPTION
The Java compiler and other Java tools can only process files which contain Latin-1 and/or Unicode-encoded (/udddd notation) characters. native2ascii converts files which contain other character encodings into files containing Latin-1 and/or Unicode-encoded charaters.
If outputfile is omitted, standard output is used for output. If, in addition, inputfile is omitted, standard input is used for input.
OPTIONS
-reverse
Perform the reverse operation: convert a file with Latin-1 and/or Unicode encoded characters to one with native-encoded characters.
-encoding encoding_name
Specify the encoding name which is used by the conversion procedure. The default encoding is taken from System property file.encoding. The encoding_name string must be taken from the first column of the table of supported encodings in the Supported Encodings document.
-Joption
Pass option to the Java virtual machine, where option is one of the options described on the reference page for the java application launcher. For example, -J-Xms48m sets the startup memory to 48 megabytes.
For encoding name you can reference:
http://java.sun.com/j2se/1.4.2/docs/guide/intl/encoding.doc.html
For chinese use normally use: GBK, Big5. In Sun's encoding document, it is mentioned this conversion need: lib/rt.jar or lib/charsets.jar. These two file only can file Sun's JRE package. They cant find in Sun's JDK package.
但是我在转换的时候没有这两个文件似乎也没有问题.
Example: native2ascii -encoding GBK Inputfilename outputfilename
转换后的校验.
native2ascii 会因为你的系统不同,可能会有不同的转换结果. 如果发现问题,你可以去:
http://www.unicode.org/charts/ 查询,依据目前的版本,你可以找到
East Asian Scripts-> Han ideographs->Unified CJK ideographs(5M)
http://www.unicode.org/charts/PDF/U4E00.pdf 这个文件.
下载后,你用PDF reader 的search功能找到你要转换的汉字,比较转换后的
字符是否正确,比如:我,在转换后为 /u6211.
所有的JSP文件采用UTF-8格式遍写.并在文件头强制:
<%@ page contentType="text/html;charset=UTF-8" language="java" %>
这样你的问题就解决了.
1) 用一般的编辑器输入所要显示的汉字
2) 使用Java JDK 下的native2ascii. 详细说明见
http://java.sun.com/j2se/1.4.2/docs/tooldocs/windows/native2ascii.html
native2ascii - Native-to-ASCII Converter
Converts a file with native-encoded characters (characters which are non-Latin 1 and non-Unicode) to one with Unicode-encoded characters.
SYNOPSIS
native2ascii [options] [inputfile [outputfile]]
DESCRIPTION
The Java compiler and other Java tools can only process files which contain Latin-1 and/or Unicode-encoded (/udddd notation) characters. native2ascii converts files which contain other character encodings into files containing Latin-1 and/or Unicode-encoded charaters.
If outputfile is omitted, standard output is used for output. If, in addition, inputfile is omitted, standard input is used for input.
OPTIONS
-reverse
Perform the reverse operation: convert a file with Latin-1 and/or Unicode encoded characters to one with native-encoded characters.
-encoding encoding_name
Specify the encoding name which is used by the conversion procedure. The default encoding is taken from System property file.encoding. The encoding_name string must be taken from the first column of the table of supported encodings in the Supported Encodings document.
-Joption
Pass option to the Java virtual machine, where option is one of the options described on the reference page for the java application launcher. For example, -J-Xms48m sets the startup memory to 48 megabytes.
For encoding name you can reference:
http://java.sun.com/j2se/1.4.2/docs/guide/intl/encoding.doc.html
For chinese use normally use: GBK, Big5. In Sun's encoding document, it is mentioned this conversion need: lib/rt.jar or lib/charsets.jar. These two file only can file Sun's JRE package. They cant find in Sun's JDK package.
但是我在转换的时候没有这两个文件似乎也没有问题.
Example: native2ascii -encoding GBK Inputfilename outputfilename
转换后的校验.
native2ascii 会因为你的系统不同,可能会有不同的转换结果. 如果发现问题,你可以去:
http://www.unicode.org/charts/ 查询,依据目前的版本,你可以找到
East Asian Scripts-> Han ideographs->Unified CJK ideographs(5M)
http://www.unicode.org/charts/PDF/U4E00.pdf 这个文件.
下载后,你用PDF reader 的search功能找到你要转换的汉字,比较转换后的
字符是否正确,比如:我,在转换后为 /u6211.
所有的JSP文件采用UTF-8格式遍写.并在文件头强制:
<%@ page contentType="text/html;charset=UTF-8" language="java" %>
这样你的问题就解决了.