Unicode
is a character encoding standard that is commonly used in IT in different areas. Unicode is an international standard that is created in 1987 as an alternative to the ASCII and other character sets. As of March 2020, the Unicode character set version is 13.0 and contains 143,859 characters from different languages and alphabets. Currently Unicode character set covers 154 modern alphabets with set and emoji symbols.
Unicode
是一种字符编码标准,在不同领域的IT中都很常用。 Unicode是国际标准,于1987年创建,是ASCII和其他字符集的替代方法。 截至2020年3月,Unicode字符集版本为13.0,其中包含来自不同语言和字母的143,859个字符。 当前,Unicode字符集包含154种带有设置和表情符号的现代字母。
Unicode版本和历史记录 (Unicode Versions and History)
Unicode is very popular and host encoding standard where the latest version is 13.0 which is released March 2020.
Unicode是非常流行的主机编码标准,最新版本是2020年3月发布的13.0。
- Even standardized in 1988 the version 1.0 is released in October 1991 which contains 7,129 characters and supports alphabets like Arabic, Bengali, Greek, Lao, Latin, Tibetan, etc. 甚至在1988年进行了标准化,该版本也于1991年10月发布,于1991年10月发布,它包含7129个字符,并支持阿拉伯语,孟加拉语,希腊语,老挝语,拉丁语,藏语等字母。
- Unicode 2.0 is released in July 1996 and contains 38,885 characters and some updates about existing alphabets like Hangul, Tibetan, etc. Unicode 2.0于1996年7月发布,包含38,885个字符以及有关现有字母(如韩文,藏文等)的一些更新。
- Unicode version 3.0 is released in September 1999 and contains 49,194 characters. This version added Cherokee, Ethiopic, Khmer, Mongolian, etc. alphabets. Unicode 3.0版于1999年9月发布,包含49,194个字符。 此版本添加了切诺基,埃塞俄比亚,高棉语,蒙古语等字母。
- Unicode version 4.0 is released in April 2003 and contains 96,382 characters. Unicode 4.0版于2003年4月发布,包含96382个字符。
- Unicode version 5.0 is released in July 2006. Unicode 5.0版于2006年7月发布。
- Unicode version 6.0 is released in October 2010. Unicode 6.0版于2010年10月发布。
- Unicode version 7.0 is released in June 2014. Unicode版本7.0于2014年6月发布。
- Unicode version 8.0 is released in June 2015. Unicode版本8.0于2015年6月发布。
- Unicode version 9.0 is released in June 2016. Unicode版本9.0于2016年6月发布。
- Unicode version 10.0 is released in June 2017. Unicode版本10.0于2017年6月发布。
- Unicode version 11.0 is released in June 2018. Unicode版本11.0于2018年6月发布。
- Unicode version 12.0 is released in March 2019. Unicode版本12.0于2019年3月发布。
- Unicode version 13.0 is released in March 2020. Unicode版本13.0于2020年3月发布。
Unicode编码标准 (Unicode Encoding Standard)
The Unicode standard is created in order to unify different character sets into single, standardized, and clear versions. Unicode character set is implemented in different technologies like operating systems, XML, Java programming language, PHP, Python, .Net, etc. Unicode can be implemented in different character encodings like UTF-8, UTF-16, UTF-32.
创建Unicode标准是为了将不同的字符集统一为单个,标准化和清晰的版本。 Unicode字符集以不同的技术(例如操作系统,XML,Java编程语言,PHP,Python,.Net等)实现。Unicode可以以不同的字符编码(例如UTF-8,UTF-16,UTF-32)实现。
Unicode标准编码格式 (Unicode Standard Encoding Formats)
Unicode Standard consist of multiple encoding formats with different sizes.
Unicode标准包含多种不同大小的编码格式。
UTF-8
is the smallest encoding format which uses from 1 byte to 4 bytes. UTF-8 is the most popular Unicode standard where 94% of web sites supports and uses it. First 128 characters represents ASCII characters.
UTF-8
是最小的编码格式,使用1字节至4字节。 UTF-8是最流行的Unicode标准,其中94%的网站都支持和使用它。 前128个字符表示ASCII字符。
UTF-16
and UTF-32
are other higher capacity encoding formats that use 4 bytes and a lot of different characters.
UTF-16
和UTF-32
是使用4字节和许多不同字符的其他更高容量的编码格式。
Unicode编码/字符集的用法和采用 (Unicode Encoding/Character Set Usage and Adoption)
Unicode standard is very popular and commonly adopted into different technologies.
Unicode标准非常流行,并且已广泛应用于各种技术中。
操作系统 (Operating Systems)
In order to use Unicode Encoding the operating system must support it. Windows NT operating systems like Windows 2000, Windows XP, Windows Vista, Windows 7, Windows 8, and Windows 10 support UTF-8 and UTF-16. Also modern Linux distributions and MacOSX support UTF-8 and UTF-16 too.
为了使用Unicode编码,操作系统必须支持它。 Windows NT操作系统(例如Windows 2000,Windows XP,Windows Vista,Windows 7,Windows 8和Windows 10)支持UTF-8和UTF-16。 现代Linux发行版和MacOSX也都支持UTF-8和UTF-16。
编程语言 (Programming Languages)
Programming languages like Java, Python, PHP, .NET support both UTF-8 and UTF-16 to read and write files.
Java,Python,PHP,.NET等编程语言支持UTF-8和UTF-16来读写文件。
网页 (Web)
The internet standardization consortium W3C recommends Unicode as their document character set since HTML version 4.0. Web browsers like Google Chrome, Mozilla Firefox, Microsoft Edge, Opera, Safari supports UTF-8 for many years.
自HTML版本4.0起,互联网标准化协会W3C建议将Unicode作为其文档字符集。 像Google Chrome,Mozilla Firefox,Microsoft Edge,Opera,Safari这样的网络浏览器多年来一直支持UTF-8。
Unicode对表情符号的支持 (Unicode Support For Emoji)
Ununicode standard support different emojies whose are popularly used in todays text messaging and chat applications. Also these unicode emojies also used for comments and normal text in websites and forums. Below you can see different emojies and related unicode value.
Ununicode标准支持不同的表情符号,它们在当今的文本消息和聊天应用程序中广泛使用。 这些unicode表情符号也用于网站和论坛中的评论和普通文本。 在下面,您可以看到不同的表情符号和相关的unicode值。

翻译自: https://www.poftut.com/what-is-unicode-encoding-standard/