python spilt(‘ “ ‘),spilt(“=“)处理html 链接文本

web_test_response = requests.get(web_test_url)
print("web_test_response11",web_test_response.text)
print("web_test_response22",web_test_response.text.split('"'))


if web_test_response.status_code != 404:
    web_url_test = "http://x.x.x.x:xx"+web_test_response.text.split('"')[-2]
    print("web_url_test",web_url_test)
    web_test_response1 = requests.get(web_url_test)
    print("web_test_response1",web_test_response1)
    web_test_packurl = web_test_response1.text.split("=")[-1]
    print("
", web_test_response1.text)
    print("web_test_response3", web_test_response1.text.split("="))
    print("web_test_response4", web_test_response1.text.split("=")[-1])
    if web_test_packurl not in web_test_list:
        web_test_list.clear()
        web_test_list.append(web_test_packurl)


web_test_response11:用' " '分割

<html><head><title>X.X.X.X - /agora_ad/cloudclass/web/test/release_2.7.1/</title></head><body><H1>X.X.X.X - /agora_ad/cloudclass/web/test/release_2.7.1/</H1><hr>

<pre><A HREF="/agora_ad/cloudclass/web/test/">[转到父目录]</A><br><br> 2022/8/19    12:11          259 <A HREF="/agora_ad/cloudclass/web/test/release_2.7.1/20220819_4329.txt">20220819_4329.txt</A><br> 2022/8/19    13:34          259 <A HREF="/agora_ad/cloudclass/web/test/release_2.7.1/20220819_4331.txt">20220819_4331.txt</A><br> 2022/8/19    17:52          259 <A HREF="/agora_ad/cloudclass/web/test/release_2.7.1/20220819_4335.txt">20220819_4335.txt</A><br> 2022/8/22    10:34          259 <A HREF="/agora_ad/cloudclass/web/test/release_2.7.1/20220822_4340.txt">20220822_4340.txt</A><br> 2022/8/23    17:17          259 <A HREF="/agora_ad/cloudclass/web/test/release_2.7.1/20220823_4351.txt">20220823_4351.txt</A><br> 2022/8/24    11:10          259 <A HREF="/agora_ad/cloudclass/web/test/release_2.7.1/20220824_4365.txt">20220824_4365.txt</A><br> 2022/8/30    10:36          259 <A HREF="/agora_ad/cloudclass/web/test/release_2.7.1/20220830_4378.txt">20220830_4378.txt</A><br> 2022/8/30    17:26          259 <A HREF="/agora_ad/cloudclass/web/test/release_2.7.1/20220830_4384.txt">20220830_4384.txt</A><br> 2022/8/30    17:54          259 <A HREF="/agora_ad/cloudclass/web/test/release_2.7.1/20220830_4387.txt">20220830_4387.txt</A><br> 2022/8/30    17:54          259 <A HREF="/agora_ad/cloudclass/web/test/release_2.7.1/20220830_4389.txt">20220830_4389.txt</A><br> 2022/8/31    10:26          259 <A HREF="/agora_ad/cloudclass/web/test/release_2.7.1/20220831_4393.txt">20220831_4393.txt</A><br></pre><hr></body></html>

web_test_response22:分割后的列表

['<html><head><title>X.X.X.X - /agora_ad/cloudclass/web/test/release_2.7.1/</title></head><body><H1>X.X.X.X - /agora_ad/cloudclass/web/test/release_2.7.1/</H1><hr>\r\n\r\n<pre><A HREF=', '/agora_ad/cloudclass/web/test/', '>[转到父目录]</A><br><br> 2022/8/19    12:11          259 <A HREF=', '/agora_ad/cloudclass/web/test/release_2.7.1/20220819_4329.txt', '>20220819_4329.txt</A><br> 2022/8/19    13:34          259 <A HREF=', '/agora_ad/cloudclass/web/test/release_2.7.1/20220819_4331.txt', '>20220819_4331.txt</A><br> 2022/8/19    17:52          259 <A HREF=', '/agora_ad/cloudclass/web/test/release_2.7.1/20220819_4335.txt', '>20220819_4335.txt</A><br> 2022/8/22    10:34          259 <A HREF=', '/agora_ad/cloudclass/web/test/release_2.7.1/20220822_4340.txt', '>20220822_4340.txt</A><br> 2022/8/23    17:17          259 <A HREF=', '/agora_ad/cloudclass/web/test/release_2.7.1/20220823_4351.txt', '>20220823_4351.txt</A><br> 2022/8/24    11:10          259 <A HREF=', '/agora_ad/cloudclass/web/test/release_2.7.1/20220824_4365.txt', '>20220824_4365.txt</A><br> 2022/8/30    10:36          259 <A HREF=', '/agora_ad/cloudclass/web/test/release_2.7.1/20220830_4378.txt', '>20220830_4378.txt</A><br> 2022/8/30    17:26          259 <A HREF=', '/agora_ad/cloudclass/web/test/release_2.7.1/20220830_4384.txt', '>20220830_4384.txt</A><br> 2022/8/30    17:54          259 <A HREF=', '/agora_ad/cloudclass/web/test/release_2.7.1/20220830_4387.txt', '>20220830_4387.txt</A><br> 2022/8/30    17:54          259 <A HREF=', '/agora_ad/cloudclass/web/test/release_2.7.1/20220830_4389.txt', '>20220830_4389.txt</A><br> 2022/8/31    10:26          259 <A HREF=', '/agora_ad/cloudclass/web/test/release_2.7.1/20220831_4393.txt', '>20220831_4393.txt</A><br></pre><hr></body></html>

web_url_test:取web_test_response22列表的倒数第二个元素

http://x.x.x.x:x.x/agora_ad/cloudclass/web/test/release_2.7.1/20220831_4393.txt

处理:web_test_response2

web_test_response2:

url: https://agora-adc-artifacts.s3.cn-north-1.amazonaws.com.cn/apaas/app/test/release_2.7.1/20220831_4393/index.html
            global_url: url
=https://solutions-apaas.agora.io/apaas/app/test/release_2.7.1/20220831_4393/index.html

web_test_response3:用“=”分割后的列表

 ['\n            url: https://x.x.x.x/apaas/app/test/release_2.7.1/20220831_4393/index.html\n            global_url: url', 'https://solutions-apaas.agora.io/apaas/app/test/release_2.7.1/20220831_4393/index.html\n            \n']

web_test_response4:web_test_response3列表的最后一个元素

https://solutions-apaas.agora.io/apaas/app/test/release_2.7.1/20220831_4393/index.html


 

  • 0
    点赞
  • 1
    收藏
    觉得还不错? 一键收藏
  • 1
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论 1
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值