java encoding.utf8.getbytes,为什么不是Encoding.UTF8.GetBytes（Encoding.UTF8.GetString（x））== x`...

最新推荐文章于 2024-08-28 02:17:06 发布

算法艺术家

最新推荐文章于 2024-08-28 02:17:06 发布

阅读量413

点赞数

文章标签： java encoding.utf8.getbytes

In .NET why isn't it true that:

Encoding.UTF8.GetBytes(Encoding.UTF8.GetString(x))

returns the original byte array for an arbitrary byte array x?

It is mentioned in answer to another question but the responder doesn't explain why.

解决方案

Character encodings (UTF8, specificly) may have different forms for the same code point.

So when you convert to a string and back, the actual bytes may represent a different (canonical) form.

See also String.Normalize(NormalizationForm.System.Text.NormalizationForm.FormD)

See also:

Some Unicode sequences are considered equivalent because they represent the same character. For example, the following are considered equivalent because any of these can be used to represent "ắ":

"\u1EAF"

"\u0103\u0301"

"\u0061\u0306\u0301"

However, ordinal, that is, binary, comparisons consider these sequences different because they contain different Unicode code values. Before performing ordinal comparisons, applications must normalize these strings to decompose them into their basic components.

That page comes with a nice sample that shows you what encodings are always normalized

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

算法艺术家

关注关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
分享

复制链接

分享到 QQ

分享到新浪微博

扫一扫
举报

举报

java encoding.utf8.getbytes_String的getBytes()默认编码问题

weixin_39997173的博客

02-24

2893

我们学习java基础的时候.我们都知道在main方法中String的getBytes()方法如果不指定编码格式,默认是UTF-8的方法进行的编码.我们一直认为默认的编码格式就是UTF-8.直到我们学习了javaWeb.在Servlet中.我们通过getBytes()获取的是按照GBK进行编码的.至此我们就有了疑惑.这个getBytes()方法到底是怎么选择默认编码的.我们带着疑惑,去翻一下Stri...

java encoding.utf8.getbytes_Java 中的字符编码与 String.getBytes()

weixin_35682020的博客

02-24

2068

之前一直用博客，然鹅现在用RSS的伙伴越来越少了，公众号这种主动推送机制对读者和作者都很好用，所以欢迎大家关注我的微信公众号：IT漫步。主要分享自己折腾的各类技术和一些观点。扫码关注：在任何编程语言中，存取和操作字符串都是一个常见的操作。这一切的前提，就要先规定存储和读取字符的规则，这就是字符串的编码。一、字符串的编码英文的编码就是ASCII，中文常见的编码有GBK和UTF-8编码。由于GBK和U...

参与评论您还未登录，请先登录后发表或查看评论

java encoding.utf8.getbytes_C# Encoding.UTF8.GetString 获取byte乱码问题

weixin_36125755的博客

02-24

3365

HashAlgorithm hashAlgorithm = new SHA1CryptoServiceProvider(); //MD5CryptoServiceProvider();byte[] result = hashAlgorithm.ComputeHash(Encoding.UTF8.GetBytes(member.Password));//获取方式1 正常获取StringBuilder...

java encoding.utf8.getbytes,从String.getBytes（“UTF-8”）处理UnsupportedEncodingException的推荐方法...

weixin_39622084的博客

02-24

768

What is the recommended way to handle an UnsupportedEncodingException when calling String.getBytes("UTF-8") inside a library method?If I'm reading http://docs.oracle.com/javase/6/docs/technotes/guides...

java unicode utf-8 String.getBytes

liuguxing的专栏

06-01

292

Unicode（统一码、万国码、单一码）是一种在计算机上使用的字符编码。它为每种语言中的每个字符设定了统一并且唯一的二进制编码，以满足跨语言、跨平台进行文本转换、处理的要求。1990年开始研发，1994年正式公布。随着计算机工作能力的增强，Unicode也在面世以来的十多年里得到普及。 unicode utf-8转换在Unicode中：汉字“字”对应的数字是23383。在Unicode中，我...

C#winform object[,] data 怎么将数据用UTF8格式表示dataGridView1.Rows[i - 1].Cells[j - 1].Value = Encoding.UTF8.GetString(data[i, j]);

最新发布

09-26

byte[] utf8Bytes = Encoding.UTF8.GetBytes(str); dataGridView1.Rows[i - 1].Cells[j - 1].Value = utf8Bytes; // 如果要在显示时直接显示字符串而非字节，可以这样做： // dataGridView1.Rows[i - 1].Cells[j...

FromBase64String(String)和Encoding.Default.GetBytes(String)

aqst26516的博客

08-23

5275

今天突然被问FromBase64String(String)和Encoding.Default.GetBytes(String)有啥区别，我刚开始学C#对这个一脸懵逼，于是总结一下今天查资料的内容。首先，什么是Base64？ Base64呀，是个加密算法，原理呢在这里不重要，以后有机会补充，这里仅举例。最初明文：abc——>Base64加密——>密文：YWJj ——&gt...

C#（.net）中按字节数截取字符串最后出现乱码问题的解决

01-21

前言最近需要用到按字节数截取字符串。在网上找了很多方法。... string msg= Encoding.UTF8.GetString(Encoding.UTF8.GetBytes(strcode)); 例子：2 string strcode=我是小明; byte[] buffer=Encoding.UTF8.Ge

修复下面代码：private void btnEncrypt_Click(object sender, EventArgs e) { try { // 获取密钥和向量 string key = txtKey.Text.Trim(); string iv = txtIV.Text.Trim(); byte[] keyBytes = Encoding.UTF8.GetBytes(key); byte[] ivBytes = Encoding.UTF8.GetBytes(iv); // 获取明文 string plaintext = txtPlaintext.Text.Trim(); byte[] plaintextBytes = Encoding.UTF8.GetBytes(plaintext); // 创建AES加密器 using (Aes aes = Aes.Create()) { aes.Key = keyBytes; aes.IV = ivBytes; // 创建加密流 using (MemoryStream msEncrypt = new MemoryStream()) { using (CryptoStream csEncrypt = new CryptoStream(msEncrypt, aes.CreateEncryptor(), CryptoStreamMode.Write)) { // 将明文写入加密流 csEncrypt.Write(plaintextBytes, 0, plaintextBytes.Length); csEncrypt.FlushFinalBlock(); // 获取加密结果 byte[] ciphertextBytes = msEncrypt.ToArray(); string ciphertext = Convert.ToBase64String(ciphertextBytes); // 显示加密结果 txtCiphertext.Text = ciphertext; } } } } catch (Exception ex) { MessageBox.Show(ex.Message); } } private void btnDecrypt_Click(object sender, EventArgs e) { try { // 获取密钥和向量 string key = txtKey.Text.Trim(); string iv = txtIV.Text.Trim(); byte[] keyBytes = Encoding.UTF8.GetBytes(key); byte[] ivBytes = Encoding.UTF8.GetBytes(iv); // 获取密文 string ciphertext = txtCiphertext.Text.Trim(); byte[] ciphertextBytes = Convert.FromBase64String(ciphertext); // 创建AES解密器 using (Aes aes = Aes.Create()) { aes.Key = keyBytes; aes.IV = ivBytes; // 创建解密流 using (MemoryStream msDecrypt = new MemoryStream(ciphertextBytes)) { using (CryptoStream csDecrypt = new CryptoStream(msDecrypt, aes.CreateDecryptor(), CryptoStreamMode.Read)) { // 读取解密结果 byte[] plaintextBytes = new byte[ciphertextBytes.Length]; int bytesRead = csDecrypt.Read(plaintextBytes, 0, plaintextBytes.Length); // 显示解密结果 string plaintext = Encoding.UTF8.GetString(plaintextBytes, 0, bytesRead); txtPlaintext.Text = plaintext; } } } } catch (Exception ex) { MessageBox.Show(ex.Message); } }

06-01

string plaintext = Encoding.UTF8.GetString(plaintextBytes, 0, bytesRead); txtPlaintext.Text = plaintext; } } } } catch (Exception ex) { MessageBox.Show(ex.Message); } } ``` 我们对代码进行了...

Java getBytes方法详解（字符集问题）

iteye_4939的博客

04-25

991

今天工作中又一次遇到了java字符集问题，这次是由getBytes方法导致的。以前的时候，曾经很多次的解决过java字符集以及乱码的问题，以为对这块很了解了，至到今天的又一次深入的学习，才发现以前的认识当中存在的问题，下次就getBytes方法在应用级别进行比较实际的解释。 1、Unicode是一种编码规范，是为解决全球字符通用编码而设计的，而rUTF-8,UTF-16等是这...

java getbytes 编码_Java String getBytes() encoding 编码转换

weixin_32401411的博客

02-15

1353

String newStr = new String(oldStr.getBytes(), "UTF-8");java中的String类是按照unicode进行编码的，即在java处理时为unicode方式。oldStr.getBytes(Stringencoding)则是将java内部存在的unicode编码的String处理为encoding指定格式的byte[]字节数组，默认为由jdk查询...

C#基础 string PadLeft 在字符串前、后加入指定字符，达到指定长度

心少朴

05-11

1419

.NET Framework : 4.7.2 IDE : Visual Studio Community 2019 OS : Windows 10 x64 typesetting : Markdown blog : blog.csdn.net/yushaopu gi...

Java String getBytes() encoding 编码转换

geshenyisunjie的博客

04-15

1026

String newStr = new String(oldStr.getBytes(), "UTF-8"); java中的String类是按照unicode进行编码的，即在java处理时为unicode方式。oldStr.getBytes( String encoding)则是将java内部存在的unicode编码的String处理为encoding指定格式的byte[]字节数组，默认...

Encoding.UTF8是.NET 中用于处理UTF-8编码的标准编码类

book_dw5189的博客

08-28

697

System.Text.Encoding 类的一个静态属性，提供了对 UTF-8 编码和解码的支持。

JAVA中的getBytes()

weixin_44649811的博客

01-14

301

JAVA中的getBytes()`` 在Java中，String的getBytes()方法是得到一个操作系统默认的编码格式的字节数组。这个表示在不同情况下，返回的东西不一样！ String.getBytes(String decode)方法会根据指定的decode编码返回某字符串在该编码下的byte数组表示，如： Java代码 byte[] b_gbk = “深”.getBytes(“...

encoding.utf8.getstring怎么得出正确字符的