总结
1.前端基础
-
<!DOCTYPE html>
<html lang="en">
<head>
<meta charset="UTF-8">
<meta http-equiv="X-UA-Compatible" content="IE=edge">
<meta name="viewport" content="width=device-width, initial-scale=1.0">
<link rel="icon" type="image/jpg" href="./img/jd.jpg" />
<title>Document</title>
</head>
<body>
<h1>震惊!UC挂了</h1>
<h2>二级标题</h2>
<h3>三级标题</h3>
<h4>四级标题</h4>
<h5>五级标题</h5>
<h6>六级标题</h6>
<p>
直播:袁隆平遗体送别仪式 “杂交水稻之父”、“共和国勋章”获得者袁隆平院士遗体送别仪式,将于2021年5月24日(星期一)上午10:00在湖南省长沙市明阳山殡仪馆铭德厅举行。袁隆平院士于2021年5月22日在湖南长沙逝世,消息震惊世人。袁隆平是我国研究与发展杂交水稻的开创者,也是世界上第一个成功利用水稻杂种优势的科学家,被誉为“杂交水稻之父”。他的老家是江西德安。5月24日上午,江西都市2直播团队将来到长沙,第一时间报道各界群众挥泪送别袁隆平的场景。
</p>
<span>发布时间:2014-09-19</span>
<span>中央新闻</span>
<font>发布时间:2014-09-19</font>
<font>中央新闻</font>
<p>
<b> 窗前明月光,<br></b>
<b><i> 疑是地上霜。<br></i></b>
举头望明月,<br>
<i> 低头思故乡。</i>
</p>
</body>
</html>
-
<!DOCTYPE html>
<html lang="en">
<head>
<meta charset="UTF-8">
<meta http-equiv="X-UA-Compatible" content="IE=edge">
<meta name="viewport" content="width=device-width, initial-scale=1.0">
<title>Document</title>
</head>
<body>
<img src="./img/QQ图片20210524103817.png" title="哈哈" alt="加载错误!">
<img src="./img/QQ图片20210524103825.png" title="路飞" alt="路飞加载失败" srcset="">
<img src="http://bos.pgzs.com/rbpiczy/Wallpaper/2013/1/16/5b1de69d655d45a6aa0c5fa06cd236c9-2.jpg" title="路飞" alt="路飞加载失败" srcset="">
<a target="_parent" href="https://image.baidu.com/search/detail?ct=503316480&z=0&ipn=d&word=%E8%B7%AF%E9%A3%9E&step_word=&hs=2&pn=0&spn=0&di=10560&pi=0&rn=1&tn=baiduimagedetail&is=0%2C0&istype=0&ie=utf-8&oe=utf-8&in=&cl=2&lm=-1&st=undefined&cs=3817108849%2C962695613&os=1291579852%2C531077876&simid=24500331%2C960877262&adpicid=0&lpn=0&ln=1428&fr=&fmq=1621828196424_R&fm=&ic=undefined&s=undefined&hd=undefined&latest=undefined©right=undefined&se=&sme=&tab=0&width=undefined&height=undefined&face=undefined&ist=&jit=&cg=&bdtype=0&oriquery=&objurl=https%3A%2F%2Fgimg2.baidu.com%2Fimage_search%2Fsrc%3Dhttp%3A%2F%2Fimg.biaoche.org%2F%3Fimg%3D06.imgmini.eastday.com%2Fmobile%2F20180307%2F20180307063319_b4cdc3e594bf1f6464632367303b6e2d_1.jpeg%26refer%3Dhttp%3A%2F%2Fimg.biaoche.org%26app%3D2002%26size%3Df9999%2C10000%26q%3Da80%26n%3D0%26g%3D0n%26fmt%3Djpeg%3Fsec%3D1624420240%26t%3Dbe414d7e0bc39409d29c86e7046ceb7e&fromurl=ippr_z2C%24qAzdH3FAzdH3Fooo_z%26e3Bu3fi7vit_z%26e3Bv54AzdH3FgjofAzdH3F314l4s4jll31bk4_z%26e3Bip4s&gsm=2&rpstart=0&rpnum=0&islist=&querylist=&force=undefined">
<img src="http://bos.pgzs.com/rbpiczy/Wallpaper/2013/1/16/5b1de69d655d45a6aa0c5fa06cd236c9-2.jpg" title="路飞" alt="路飞加载失败" srcset="">
</a>
</body>
</html>
-
<!DOCTYPE html>
<html lang="en">
<head>
<meta charset="UTF-8">
<meta http-equiv="X-UA-Compatible" content="IE=edge">
<meta name="viewport" content="width=device-width, initial-scale=1.0">
<link rel="icon" type="image/png" href="./img/jd.jpg">
<title>Document</title>
</head>
<body>
<form action="http://127.0.0.1:3000/userinfo" method="POST">
姓名:<input type="text" name="name" placeholder="请输入姓名" id="username"><br>
家庭住址:<input type="text" name="address" id=""><br>
电话号码:<input type="tel" name="tel" id=""><br>
<label>男:<input checked type="radio" name="男" id="1"></label>
<label>女:<input type="radio" name="女" id="1"><br></label>
<input type="submit" value="提交"><br>
<input type="reset" value="重置"><br>
</form>
<br>普通文本输入框:<input maxlength="10" type="text" name="txt" id="">
<br>密码输入框:<input type="password" name="pwd" id="">
<br>单选按钮: <input type="radio" name="sex" id="gender1" checked value="man"/><label for="gender1">男</label>
<input type="radio" name="sex" id="gender2" value="woman"/><label for="gender2">女</label>
<br><select name="" id=""></select>
<br>复选按钮:<input type="checkbox" name="" id="">
<br>普通按钮:<input type="button" value="按钮">
<br>重置按钮:<input type="reset" value="重置">
<br>颜色选择器:<input type="color" name="colsec" id="">
<br>文件选择:<input type="file" multiple name="" id="">
<br>时间选择器:<input type="time" name="" id="">
<br>时间选择器:<input type="datetime" name="" id="">
<br>时间选择器:<input type="datetime-local" name="" id="">
复选按钮:<input type="checkbox" checked name="" id="xxxx1"><label for="xxxx1">乒乓球</label><br>
复选按钮:<input type="checkbox" checked name="" id="xxxx2"><label for="xxxx2">羽毛球</label><br>
复选按钮:<input type="checkbox" checked name="" id="xxxx3"><label for="xxxx3">篮球</label><br>
复选按钮:<input type="checkbox" checked name="" id="xxxx4"><label for="xxxx4">足球</label><br>
重置按钮:<input type="reset" value="">
</body>
</html>
-
<!DOCTYPE html>
<html lang="en">
<head>
<meta charset="UTF-8">
<meta http-equiv="X-UA-Compatible" content="IE=edge">
<meta name="viewport" content="width=device-width, initial-scale=1.0">
<link rel="icon" type="image/png" href="./img/jd.jpg">
<title>Document</title>
</head>
<body>
<textarea style="resize: horizontal;" placeholder="请输入信息" name="" id="txarea" cols="30" rows="10"></textarea>
<br><br>
四川城市:
<select name="sex" id="sex" value="1">
<option value="1">四川成都</option>
<option value="2">四川达州</option>
<option value="3">四川资阳</option>
<option value="4">四川绵阳</option>
<option value="5">四川南充</option>
<option value="6">四川乐山</option>
<option value="7">四川眉山</option>
</select>
<ol>
<li>Python</li>
<li>Java</li>
<li>H5</li>
<li>UI</li>
<li>大数据</li>
<li>物联网</li>
</ol>
<ul>
<li>Python</li>
<li>Java</li>
<li>H5</li>
<li>UI</li>
<li>大数据</li>
<li>物联网</li>
</ul>
<div></div>
</body>
</html>
2.网页爬取
import requests
import re
response = requests.get('https://www.sohu.com/')
print(response)
if response.status_code == 200:
"""
成功获取网页内容
"""
data = response.text
re_obj = re.compile(r'''<a.*?href="(.*?)".*?title=["'](.*?)["'].*?>''')
a_list = re_obj.findall(data)
print(len(a_list))
for x in a_list:
print(x)