自动获取网址html代码网页,自动判断网页编码获取网页源代码内容的函数

自动判断网页编码获取网页源代码内容的函数

'页面下载的函数,url是网页地址,urlcode是网页编码,例子gethttppage("[url]http://www.sifulaoda.com/[/url]","GB2312")

function gethttppage(url,urlcode)

dim http

set http=Server.createobject("MSXML2.XMLHTTP")

Http.open "GET",url,false

Http.send()

if Http.readystate<>4 then

gethttppage=""

exit function

end if

gethttppage=readfile(Http.responseBody,urlcode)

set http=nothing

if err.number<>0 then err.Clear

end function

'字节流变换字符,url是字节流,urlcode网页编码,gb2312,utf-8例子readfile(Http.responseBody,"GB2312")

'20071119可自动判断编码方式

function readfile(url,urlcode)

dim srmobj

set srmobj=Server.CreateObject("ado" & "db.stre" & "am")

srmobj.type=1

srmobj.mode=3

srmobj.open

srmobj.write url

srmobj.position=0

srmobj.type=2

'判断编码方式开始

If AscB(MidB(url,1,1))=&HEF And AscB(MidB(url,2,1))=&HBB Then

srmobj.charset="utf-8"

ElseIf AscB(MidB(url,1,1))=&HFF And AscB(MidB(url,2,1))=&HFE Then

srmobj.charset="unicode"

Else

srmobj.charset="gb2312"

End If

'判断编码方式结束

'srmobj.charset=urlcode

readfile=srmobj.readtext()

set srmobj=nothing

end function

'剔除日文平甲字的函数:

function riwen(mm)

riwen=replace(replace(replace(replace(replace(replace(replace(replace(replace(replace(replace(replace(replace(replace(replace(replace(replace(replace(replace(replace(replace(replace(replace(replace(replace(replace(mm,"ゴ",""),"ガ",""),"ギ",""),"グ ",""),"ゲ",""),"ザ",""),"ジ",""),"ズ",""),"ヅ",""),"デ",""),"ド",""),"ポ",""),"ベ",""),"プ",""),"ビ",""),"パ",""),"ヴ",""),"ボ",""),"ペ",""),"ブ",""),"ピ",""),"バ",""),"ヂ",""),"ダ",""),"ゾ",""),"ゼ","")

end function

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值