2020/11/02随记 string类与wstring类的区别

最新推荐文章于 2023-08-02 23:51:12 发布

会飞的cc猫

最新推荐文章于 2023-08-02 23:51:12 发布

阅读量950

点赞数

本文链接：https://blog.csdn.net/qq_35429198/article/details/109444459

版权

I know the higher you climbing the harder fall, however, I never care about the future till I crash down.

string类与wstring类的区别

ANSI string类

美国制定了一套字符编码，对英语字符与二进制位之间的关系，做了统一规定。这被称为ASCII码。

ANSI是默认的编码方式。对于英文文件是ASCII编码，对于简体中文文件是GB2312编码。

注意英文占 1 个字节，汉字 2 个字节，以一个\0结尾。

string类的char中每个字符仅占据一个字节内存，没办法存储汉字

Unicode wstring类

字符串前面加L表示该字符串是Unicode字符串。

这是一种所有符号的编码，可以容纳100多万个符号。

注意以Unicode16为例每个字符(汉字、英文字母)都占 2 个字节，以 2 个连续的\0结尾。

wstring与string的存储方式有很大的不同,wstring所操作的wchar_t中每个字符占两个字节，可以存储一个汉字，特殊字符

如 L"我的字符串" 表示将ANSI字符串转换成unicode的字符串，就是每个字符占用两个字节。

strlen("asd") = 3;
strlen(L"asd") = 6;

UTF-8

UTF-8是互联网上使用最广的一种unicode的实现方式。

注意英文占 1 个字节，汉字占 3 个字节。

string CRoundHollowPierDlg::UnicodeToANSI(const wchar_t* str)
{
   char* pElementText;
   int iTextLen;
   // wide char to multi char
   iTextLen = WideCharToMultiByte(CP_ACP,
       0,
       str,
       -1,
       NULL,
       0,
       NULL,
       NULL);
   pElementText = new char[iTextLen + 1];
   memset((void*)pElementText, 0, sizeof(char) * (iTextLen + 1));
   ::WideCharToMultiByte(CP_ACP,
       0,
       str,
       -1,
       pElementText,
       iTextLen,
       NULL,
       NULL);
   string strText;
   strText = pElementText;
   delete[] pElementText;

return strText;
}

wstring CRoundHollowPierDlg::ANSIToUnicode(const string& str)
{
   int len = 0;
   len = str.length();
   int unicodeLen = ::MultiByteToWideChar(CP_ACP,
       0,
       str.c_str(),
       -1,
       NULL,
       0);
   wchar_t * pUnicode;
   pUnicode = new wchar_t[unicodeLen + 1];
   memset(pUnicode, 0, (unicodeLen + 1) * sizeof(wchar_t));
   ::MultiByteToWideChar(CP_ACP,
       0,
       str.c_str(),
       -1,
       (LPWSTR)pUnicode,
       unicodeLen);
   wstring rt;
   rt = (wchar_t*)pUnicode;
   delete pUnicode;