什么是URL
Uniform Resource Locator: 代表某个资源在互联网中的位置
URL的格式
协议标识://资源名称, 如:http://www.baidu.com
如何创建一个URL
使用java.net.URL类
public static void main(String[] args) {
try {
URL u1 = new URL("http://www.baidu.com");
URL u2 = new URL(u1, "pages/page1.html");
URL u3 = new URL("http", "www.baidu.com", "pages/page2.html");
URL u4 = new URL("http", "www.baidu.com", 80, "pages/page2.html");
} catch (MalformedURLException e) {
e.printStackTrace();
}
}
如何解析一个URL
public static void main(String[] args) throws Exception {
URL aURL = new URL("http://www.baidu.com:80/pages/"
+ "/index.html?param=abc#hahaha");
System.out.println("protocol = " + aURL.getProtocol());
System.out.println("authority = " + aURL.getAuthority());
System.out.println("host = " + aURL.getHost());
System.out.println("port = " + aURL.getPort());
System.out.println("path = " + aURL.getPath());
System.out.println("query = " + aURL.getQuery());
System.out.println("filename = " + aURL.getFile());
System.out.println("ref = " + aURL.getRef());
}
运行结果如下
protocol = http
authority = www.baidu.com:80
host = www.baidu.com
port = 80
path = /pages//index.html
query = param=abc
filename = /pages//index.html?param=abc
ref = hahaha
使用URL创建连接
使用openConnection方法来创建连接, 创建连接之后对这个URL代表的资源做相关操作
public static void main(String[] args) throws Exception {
URL url = new URL("http://www.baidu.com");
URLConnection conn = url.openConnection();
conn.connect();
}
在建立连接之前,可以设置请求连接的值。
如何读取URL代表的资源
- 创建一个URL对象
- 通过URL对象拿到URLConnection
- 通过设置URLConnection对象的属性来设置连接请求的参数
- 使用URLConnection类中的connect方法创建连接
- 使用连接创建一个输入流
- 从输入流中读取数据
- 关闭输入流
public static void main(String[] args) throws Exception {
URL url = new URL("http://www.baidu.com");
URLConnection conn = url.openConnection();
conn.connect();
try (Scanner scanner = new Scanner(new InputStreamReader(conn.getInputStream()))) {
while (scanner.hasNextLine()) {
System.out.println(scanner.nextLine());
}
}
}
运行结果如下
<!DOCTYPE html>
<!--STATUS OK--><html> <head><meta http-equiv=content-type content=text/html;charset=utf-8><meta http-equiv=X-UA-Compatible content=IE=Edge><meta content=always name=referrer><link rel=stylesheet type=text/css href=http://s1.bdstatic.com/r/www/cache/bdorz/baidu.min.css><title>百度一下,你就知道</title></head> <body link=#0000cc> <div id=wrapper> <div id=head> <div class=head_wrapper> <div class=s_form> <div class=s_form_wrapper> <div id=lg> <img hidefocus=true src=//www.baidu.com/img/bd_logo1.png width=270 height=129> </div> <form id=form name=f action=//www.baidu.com/s class=fm> <input type=hidden name=bdorz_come value=1> <input type=hidden name=ie value=utf-8> <input type=hidden name=f value=8> <input type=hidden name=rsv_bp value=1> <input type=hidden name=rsv_idx value=1> <input type=hidden name=tn value=baidu><span class="bg s_ipt_wr"><input id=kw name=wd class=s_ipt value maxlength=255 autocomplete=off autofocus></span><span class="bg s_btn_wr"><input type=submit id=su value=百度一下 class="bg s_btn"></span> </form> </div> </div> <div id=u1> <a href=http://news.baidu.com name=tj_trnews class=mnav>新闻</a> <a href=http://www.hao123.com name=tj_trhao123 class=mnav>hao123</a> <a href=http://map.baidu.com name=tj_trmap class=mnav>地图</a> <a href=http://v.baidu.com name=tj_trvideo class=mnav>视频</a> <a href=http://tieba.baidu.com name=tj_trtieba class=mnav>贴吧</a> <noscript> <a href=http://www.baidu.com/bdorz/login.gif?login&tpl=mn&u=http%3A%2F%2Fwww.baidu.com%2f%3fbdorz_come%3d1 name=tj_login class=lb>登录</a> </noscript> <script>document.write('<a href="http://www.baidu.com/bdorz/login.gif?login&tpl=mn&u='+ encodeURIComponent(window.location.href+ (window.location.search === "" ? "?" : "&")+ "bdorz_come=1")+ '" name="tj_login" class="lb">登录</a>');</script> <a href=//www.baidu.com/more/ name=tj_briicon class=bri style="display: block;">更多产品</a> </div> </div> </div> <div id=ftCon> <div id=ftConw> <p id=lh> <a href=http://home.baidu.com>关于百度</a> <a href=http://ir.baidu.com>About Baidu</a> </p> <p id=cp>©2017 Baidu <a href=http://www.baidu.com/duty/>使用百度前必读</a> <a href=http://jianyi.baidu.com/ class=cp-feedback>意见反馈</a> 京ICP证030173号 <img src=//www.baidu.com/img/gs.gif> </p> </div> </div> </div> </body> </html>
也可以直接使用URL类中的openStream方法,省去创建连接的代码
public static void main(String[] args) throws Exception {
URL url = new URL("http://www.baidu.com");
try (Scanner scanner = new Scanner(new InputStreamReader(url.openStream()))) {
while (scanner.hasNextLine()) {
System.out.println(scanner.nextLine());
}
}
}
实际上URL类中的openStream方法中会帮我们创建连接
public final InputStream openStream() throws java.io.IOException {
return openConnection().getInputStream();
}