通过如下代码在剪贴板中获取含HTML格式的文本时发现当内容出现汉字的时候会有乱码
if
(Clipboard.ContainsText(TextDataFormat.Html))
textBox1.Text = Clipboard.GetText(TextDataFormat.Html);
textBox1.Text = Clipboard.GetText(TextDataFormat.Html);
通过剪贴板工具分析,HTML格式的文本是UTF8编码方式
看来Clipboard.GetText()是解码出了问题
那就自己来完成这个解码工作吧!
if
(Clipboard.ContainsText(TextDataFormat.Html))
{
MemoryStream vMemoryStream =
Clipboard.GetData( " Html Format " ) as MemoryStream;
vMemoryStream.Position = 0 ;
byte [] vBytes = new byte [vMemoryStream.Length];
vMemoryStream.Read(vBytes, 0 , ( int )vMemoryStream.Length);
textBox1.Text = Encoding.UTF8.GetString(vBytes);
}
{
MemoryStream vMemoryStream =
Clipboard.GetData( " Html Format " ) as MemoryStream;
vMemoryStream.Position = 0 ;
byte [] vBytes = new byte [vMemoryStream.Length];
vMemoryStream.Read(vBytes, 0 , ( int )vMemoryStream.Length);
textBox1.Text = Encoding.UTF8.GetString(vBytes);
}
转: http://www.csharpwin.com/csharpspace/9097r5853.shtml
WebBrowser 设置 编码 的原型代码:
IHTMLDocument2 m_hDoc = (IHTMLDocument2)axWebBrowser1.Document;
m_hDoc.charset = "gb2312";