本文实例讲述了C#获取网页HTML源码的方法,分享给大家供大家参考。具体方法如下:
关键代码如下:
///
/// 获取网页HTML源码
///
/// 链接 eg:http://www.baidu.com/
/// 编码 eg:Encoding.UTF8
/// HTML源码
public static string GetHtmlSource(string url, Encoding charset)
{
string _html = string.Empty;
try
{
HttpWebRequest _request = (HttpWebRequest)WebRequest.Create(url);
HttpWebResponse _response = (HttpWebResponse)_request.GetResponse();
using (Stream _stream = _response.GetResponseStream())
{
using (StreamReader _reader = new StreamReader(_stream, charset))
{
_html = _reader.ReadToEnd();
}
}
}
catch (WebException ex)
{
using (StreamReader sr = new StreamReader(ex.Response.GetResponseStream()))
{
_html = sr.ReadToEnd();
}
}
catch (Exception ex)
{
_html = ex.Message;
}
return _html;
}
测试代码如下:
public static void GetHtmlSourceTest()
{
string _url = "http://www.baidu.com/";
string _htmlSource = HttpWebRequestUtilsV2.GetHtmlSource(_url, Encoding.UTF8);
Console.WriteLine(_htmlSource);
}
测试效果如下图所示:
希望本文所述对大家的C#程序设计有所帮助