解决采集UTF-8出现乱码的问题

' ==================================================
'
函数名:GetHttpPage
'
作  用:获取网页源码
'
参  数:HttpUrl ------网页地址
'
==================================================
Function  GetHttpPage(HttpUrl)
    
If   IsNull (HttpUrl) = True   Or   Len (HttpUrl) < 18   Or  HttpUrl = " $False$ "   Then
        GetHttpPage
= " $False$ "
        
Exit   Function  
    
End   If  
    
Dim  Http,TheContent,TheHead
    
Set  Http = server.createobject( " MSXML2.XMLHTTP " )
    Http.open 
" GET " ,HttpUrl, False
    Http.Send()
    
If  Http.Readystate <> 4   then
        
Set  Http = Nothing  
        GetHttpPage
= " $False$ "
        
Exit   Function  
    
End   If
    TheContent
= bytesToBSTR(Http.responseBody, " gb2312 " )     ' 用gb2312获得源程序
    TheHead = GetBody(TheContent, " charset= " , " > " , false , false )   ' 取头部分析
     If   InStr (TheHead, " UTF-8 " ) <> 0   Or   InStr (TheHead, " utf-8 " ) <> 0   Then
        GetHttpPage
= bytesToBSTR(Http.responseBody, " utf-8 " )
    
Else
        GetHttpPage
= bytesToBSTR(Http.responseBody, " gb2312 " )
    
End   If  
   
Set  Http = Nothing
   
If  Err.number <> 0   then
      Err.Clear
   
End   If
End Function  
 
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值