java native2ascii,native2ascii命令

将非Unicode编码字符转化为Unicode编码的字符,即国际化。

语法:native2ascii [options] [inputfile [outputfile]]

描述:如果outputfile没有指定,标准输出将用于结果输出;如果inputfile没有指定,标准输入设备将用于输入。

参数

-reverse

使用该参数将Unicode编码字符转换为本地编码字符

-encoding encoding_name

用于指定转化时使用的字符编码。默认编码从系统属性file.encoding获取。后面的表格式字符编码,指定encoding_name使用表格第一栏。

-Joption

该参数一般无需使用,用于指定java虚拟机的启动参数。例如:-J-Xms48m设置虚拟机启动时分配内存为48M 。

Example1:

native2ascii test.txt test_unicode.txt

test.txt文件内容:native2ascii测试

test_unicode.txt文件内容:native2asciiu6d4bu8bd5

Example2:

native2ascii test_unicode.txt test_gbk.txt -reverse

test_gbk.txt内容:native2ascii测试

Basic Encoding Set (contained in lib/rt.jar)

Supported by java.nio, java.io and java.lang APIs

Canonical Name for java.nio APICanonical Name for java.io and java.lang APIDescriptionUS-ASCIIASCIIAmerican Standard Code for Information Interchange

windows-1250Cp1250Windows Eastern European

windows-1251Cp1251Windows Cyrillic

windows-1252Cp1252Windows Latin-1

windows-1253Cp1253Windows Greek

windows-1254Cp1254Windows Turkish

windows-1257Cp1257Windows Baltic

ISO-8859-1ISO8859_1ISO 8859-1, Latin Alphabet No. 1

ISO-8859-2ISO8859_2Latin Alphabet No. 2

ISO-8859-4ISO8859_4Latin Alphabet No. 4

ISO-8859-5ISO8859_5Latin/Cyrillic Alphabet

ISO-8859-7ISO8859_7Latin/Greek Alphabet

ISO-8859-9ISO8859_9Latin Alphabet No. 5

ISO-8859-13ISO8859_13Latin Alphabet No. 7

ISO-8859-15ISO8859_15Latin Alphabet No. 9

KOI8-RKOI8_RKOI8-R, Russian

UTF-8UTF8Eight-bit UCS Transformation Format

UTF-16UTF-16Sixteen-bit UCS Transformation Format, byte order identified by an optional byte-order mark

UTF-16BEUnicodeBigUnmarkedSixteen-bit Unicode Transformation Format, big-endian byte order

UTF-16LEUnicodeLittleUnmarkedSixteen-bit Unicode Transformation Format, little-endian byte order

Not availableUnicodeBigSixteen-bit Unicode Transformation Format, big-endian byte order, with byte-order mark

Not availableUnicodeLittleSixteen-bit Unicode Transformation Format, little-endian byte order, with byte-order mark

Extended Encoding Set (contained in lib/charsets.jar)

Supported by java.nio, java.io and java.lang APIs

Canonical Name for java.nio APICanonical Name for java.io and java.lang APIDescriptionwindows-1255Cp1255Windows Hebrew

windows-1256Cp1256Windows Arabic

windows-1258Cp1258Windows Vietnamese

ISO-8859-3ISO8859_3Latin Alphabet No. 3

ISO-8859-6ISO8859_6Latin/Arabic Alphabet

ISO-8859-8ISO8859_8Latin/Hebrew Alphabet

windows-31jMS932Windows Japanese

EUC-JPEUC_JPJISX 0201, 0208 and 0212, EUC encoding Japanese

x-EUC-JP-LINUXEUC_JP_LINUXJISX 0201, 0208 , EUC encoding Japanese

Shift_JISSJISShift-JIS, Japanese

ISO-2022-JPISO2022JPJIS X 0201, 0208, in ISO 2022 form, Japanese

x-mswin-936MS936Windows Simplified Chinese

GB18030GB18030Simplified Chinese, PRC standard

x-EUC-CNEUC_CNGB2312, EUC encoding, Simplified Chinese

GBKGBKGBK, Simplified Chinese

ISCII91ISCII91ISCII91 encoding of Indic scripts

x-windows-949MS949Windows Korean

EUC-KREUC_KRKS C 5601, EUC encoding, Korean

ISO-2022-KRISO2022KRISO 2022 KR, Korean

x-windows-950MS950Windows Traditional Chinese

x-MS950-HKSCSMS950_HKSCSWindows Traditional Chinese with Hong Kong extensions

x-EUC-TWEUC_TWCNS11643 (Plane 1-3), EUC encoding, Traditional Chinese

Big5Big5Big5, Traditional Chinese

Big5-HKSCSBig5_HKSCSBig5 with Hong Kong extensions, Traditional Chinese

TIS-620TIS620TIS620, Thai

Extended Encoding Set (contained in lib/charsets.jar)

Supported by java.io and java.lang APIs

Canonical NameDescriptionBig5_SolarisBig5 with seven additional Hanzi ideograph character mappings for the Solaris zh_TW.BIG5 locale

Cp037USA, Canada (Bilingual, French), Netherlands, Portugal, Brazil, Australia

Cp273IBM Austria, Germany

Cp277IBM Denmark, Norway

Cp278IBM Finland, Sweden

Cp280IBM Italy

Cp284IBM Catalan/Spain, Spanish Latin America

Cp285IBM United Kingdom, Ireland

Cp297IBM France

Cp420IBM Arabic

Cp424IBM Hebrew

Cp437MS-DOS United States, Australia, New Zealand, South Africa

Cp500EBCDIC 500V1

Cp737PC Greek

Cp775PC Baltic

Cp838IBM Thailand extended SBCS

Cp850MS-DOS Latin-1

Cp852MS-DOS Latin-2

Cp855IBM Cyrillic

Cp856IBM Hebrew

Cp857IBM Turkish

Cp858Variant of Cp850 with Euro character

Cp860MS-DOS Portuguese

Cp861MS-DOS Icelandic

Cp862PC Hebrew

Cp863MS-DOS Canadian French

Cp864PC Arabic

Cp865MS-DOS Nordic

Cp866MS-DOS Russian

Cp868MS-DOS Pakistan

Cp869IBM Modern Greek

Cp870IBM Multilingual Latin-2

Cp871IBM Iceland

Cp874IBM Thai

Cp875IBM Greek

Cp918IBM Pakistan (Urdu)

Cp921IBM Latvia, Lithuania (AIX, DOS)

Cp922IBM Estonia (AIX, DOS)

Cp930Japanese Katakana-Kanji mixed with 4370 UDC, superset of 5026

Cp933Korean Mixed with 1880 UDC, superset of 5029

Cp935Simplified Chinese Host mixed with 1880 UDC, superset of 5031

Cp937Traditional Chinese Host miexed with 6204 UDC, superset of 5033

Cp939Japanese Latin Kanji mixed with 4370 UDC, superset of 5035

Cp942IBM OS/2 Japanese, superset of Cp932

Cp942CVariant of Cp942

Cp943IBM OS/2 Japanese, superset of Cp932 and Shift-JIS

Cp943CVariant of Cp943

Cp948OS/2 Chinese (Taiwan) superset of 938

Cp949PC Korean

Cp949CVariant of Cp949

Cp950PC Chinese (Hong Kong, Taiwan)

Cp964AIX Chinese (Taiwan)

Cp970AIX Korean

Cp1006IBM AIX Pakistan (Urdu)

Cp1025IBM Multilingual Cyrillic: Bulgaria, Bosnia, Herzegovinia, Macedonia (FYR)

Cp1026IBM Latin-5, Turkey

Cp1046IBM Arabic - Windows

Cp1097IBM Iran (Farsi)/Persian

Cp1098IBM Iran (Farsi)/Persian (PC)

Cp1112IBM Latvia, Lithuania

Cp1122IBM Estonia

Cp1123IBM Ukraine

Cp1124IBM AIX Ukraine

Cp1140Variant of Cp037 with Euro character

Cp1141Variant of Cp273 with Euro character

Cp1142Variant of Cp277 with Euro character

Cp1143Variant of Cp278 with Euro character

Cp1144Variant of Cp280 with Euro character

Cp1145Variant of Cp284 with Euro character

Cp1146Variant of Cp285 with Euro character

Cp1147Variant of Cp297 with Euro character

Cp1148Variant of Cp500 with Euro character

Cp1149Variant of Cp871 with Euro character

Cp1381IBM OS/2, DOS People's Republic of China (PRC)

Cp1383IBM AIX People's Republic of China (PRC)

Cp33722IBM-eucJP - Japanese (superset of 5050)

ISO2022_CN_CNSCNS11643 in ISO 2022 CN form, Traditional Chinese (conversion from Unicode only)

ISO2022_CN_GBGB2312 in ISO 2022 CN form, Simplified Chinese (conversion from Unicode only)

JISAutoDetectDetects and converts from Shift-JIS, EUC-JP, ISO 2022 JP (conversion to Unicode only)

MS874Windows Thai

MacArabicMacintosh Arabic

MacCentralEuropeMacintosh Latin-2

MacCroatianMacintosh Croatian

MacCyrillicMacintosh Cyrillic

MacDingbatMacintosh Dingbat

MacGreekMacintosh Greek

MacHebrewMacintosh Hebrew

MacIcelandMacintosh Iceland

MacRomanMacintosh Roman

MacRomaniaMacintosh Romania

MacSymbolMacintosh Symbol

MacThaiMacintosh Thai

MacTurkishMacintosh Turkish

MacUkraineMacintosh Ukraine

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值