使用C++11 Locale转换文本编码

最新推荐文章于 2024-04-18 08:30:00 发布

weixin_34102807

最新推荐文章于 2024-04-18 08:30:00 发布

阅读量414

点赞数

文章标签： c/c++

原文链接：http://blog.51cto.com/wdx04/1022220

版权

C++11标准增加了一些新的类模板用于改进对国际化和本地化的支持。其中，std::wstring_convert、std::codecvt_utf8等类的出现解决了以往C++难以实现在Unicode到UTF-8以及CJK等本地多字节编码之间转换文本的问题，现在终于不用再去劳烦第三方库和MultiByteToWideChar/WideCharToMultiByte等繁琐的WindowsAPI了。

下面的代码演示了如何利用这些新类，结合原有的codecvt机制，将UTF-8编码的原始字符串转换为Unicode，然后再转换为中文GBK编码。

#include<tchar.h>
#include<locale>
#include<codecvt>
#include<iostream>

int_tmain(intargc,_TCHAR*argv[])
{
std::stringmystring("\xe4\xb8\xad\xe6\x96\x87");//UTF-8编码的“中文”字符串
std::wstring_convert<std::codecvt_utf8<wchar_t>>cvt_utf8;//UTF-8<->Unicode转换器
std::wstring_convert<std::codecvt<wchar_t,char,std::mbstate_t>>cvt_ansi(newstd::codecvt<wchar_t,char,std::mbstate_t>("CHS"));//GBK<->Unicode转换器
std::wstringws=cvt_utf8.from_bytes(mystring);//UTF-8转换为Unicode
std::stringmyansistr=cvt_ansi.to_bytes(ws);//Unicode转换为GBK
std::cout<<myansistr<<std::endl;
return0;
}

注：以上代码在VisualStudio2010SP1/2012环境下编译通过。

转载于:https://blog.51cto.com/wdx04/1022220

weixin_34102807

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
使用C++11 Locale转换文本编码

C++11标准增加了一些新的类模板用于改进对国际化和本地化的支持。其中，std::wstring_convert、std::codecvt_utf8等类的出现解决了以往C++难以实现在Unicode到UTF-8以及CJK等本地多字节编码之间转换文本的问题，现在终于不用再去劳烦第三方库和MultiByteToWideChar/WideCharToMultiByte等繁琐的Win...
复制链接

扫一扫