中国裁判文书下载:selenium路线

成功现状:
在这里插入图片描述

一、requests路线

requests路线需要对js进行解密,对js解密时遇到三个参数

docid
cipher
__RequestVerificationToken

这三个参数主要是针对文书列表页面的
在这里插入图片描述
试图在这个页面获取相应的链接,
js解密后,其中ciphertext参数需要感谢大神:越学越害怕
后面的docid和__RequestVerificationToken都非常简单

但是将这些参数传入后,请求仍然出现状态码202或者状态码200但是内容为None
在这里插入图片描述

最后放弃了requests路线

二、selenium路线

问题一:namespace error 命名空间错误

参见《selenium的namespace错误与无法定位:NamespaceError: An attempt was made to create…andUnable to locate elemen》,链接: link.

虽然原因不明,不知道为什么报这个错误,但是这个错误与标签有关,只要不是特殊标签即可
如果存在特殊标签,比如裁判文书网中的下面这个元素的初始标签:diy:lawyee

<diy:lawyee id="_view_1545184311000" var-name="_var_name_1545184311000" style="display: inline;"> <div class="LM_con clearfix" style="background: none;"> <div class="fl con_left clearfix" style="display: none;"> <a href="../181217BMTKHNT2W0/index.html"> </a> <a id="chartListBtn" href="javascript:;"> </a> </div> <div class="fr con_right">共检索到 <span>1559125</span> 篇文书,显示前600条</div> </div> <div class="LM_tool clearfix"> <div class="fl tool_PX tool_On" data-value="s50"> <a href="javascript:;">法院层级</a> </div> <div class="fl tool_PX " data-value="s51"> <a href="javascript:;">裁判日期</a> </div> <div class="fl tool_PX " data-value="s52"> <a href="javascript:;">审判程序</a> </div> <!-- <div class="fl tool_PX " data-value="s52"> <a href="javascript:;">审判程序</a> </div> --> <div class="fr tool_All"> <a class="AllSelect" href="javascript:;"><input type="checkbox" id="AllSelect"><label for="AllSelect">全选</label></a> <a class="AllKeep" href="javascript:;">批量收藏</a> <a class="AllDownload" href="javascript:void(0);">批量下载</a> </div> </div>   <div class="LM_list"> <div class="List_label clearfix"> <div class="labelOne"><img src="../images/list/one.png"></div> <div class="labelTwo"> 刑事复核 </div> <div class="labelThree"><img src="../images/list/three.png"></div> <!-- <span class="on_1">推荐案例</span> -->     </div> <div class="list_title clearfix"> <a class="AllSelect" href="javascript:void(0)"><input type="checkbox" class="ListSelect" data-value="3dc5113e6389402aafbeab200113faf6"></a>  <h4><a href="../181107ANFZ0BXSK4/index.html?docId=3dc5113e6389402aafbeab200113faf6" class="caseName" target="_blank">刘永权抢劫在法定刑以下量刑复核刑事裁定书</a></h4> </div> <div class="list_subtitle"> <span class="slfyName">中华人民共和国最高人民法院</span> <span class="ah">(2019)最高法刑核87677387号</span> <span class="cprq">2019-10-29</span> </div> <div class="list_reason">  <h4>[裁判理由]</h4>  <p>本院认为,原审被告人刘永权以<span style="color:red">非法占有</span>为目的,采取暴力手段强行劫取他人财物,其行为已构成抢劫罪,且属于入户抢劫。刘永权虽不具有法定减轻处罚情节,但其犯罪情节较轻,并能如实供述罪行,认罪态度较好,且已将...</p> </div>  <div class="list_Association">  <h4 style="height: 40px;"></h4>   </div> <div class="List_label2 clearfix"> <div class="fr"> <a data-value="3dc5113e6389402aafbeab200113faf6" class="a_sc" href="javascript:void(0)"><i class="fa fa-heart-o"> </i> 收藏</a> <a data-value="3dc5113e6389402aafbeab200113faf6" class="a_xz" href="javascript:void(0)"><i class="a_xzBox"></i> 下载</a> </div> </div> </div>  <div class="LM_list"> <div class="List_label clearfix"> <div class="labelOne"><img src="../images/list/one.png"></div> <div class="labelTwo"> 刑事审判监督 </div> <div class="labelThree"><img src="../images/list/three.png"></div> <!-- <span class="on_1">推荐案例</span> -->     </div> <div class="list_title clearfix"> <a class="AllSelect" href="javascript:void(0)"><input type="checkbox" class="ListSelect" data-value="21e9f0002d8c4d29baf6aabf00c11a3c"></a>  <h4><a href="../181107ANFZ0BXSK4/index.html?docId=21e9f0002d8c4d29baf6aabf00c11a3c" class="caseName" target="_blank">徐国庆 贪污罪 驳回申诉通知书</a></h4> </div> <div class="list_subtitle"> <span class="slfyName">最高人民法院</span> <span class="ah">(2019)最高法刑申230号</span> <span class="cprq">2019-08-14</span> </div> <div class="list_reason">  <h4>[裁判理由]</h4>  <p>本院经审查认为,你利用担任河南省宁陵县国有林场场长的职务便利,隐瞒已用林场债权顶抵部分应付款项的事实,多列支出予以报销,<span style="color:red">非法占有</span>公款25200元。原审认定你犯贪污罪的事实清楚,证据确实、充分,定罪准...</p> </div>  <div class="list_Association">  <h4 style="height: 40px;"></h4>   </div> <div class="List_label2 clearfix"> <div class="fr"> <a data-value="21e9f0002d8c4d29baf6aabf00c11a3c" class="a_sc" href="javascript:void(0)"><i class="fa fa-heart-o"> </i> 收藏</a> <a data-value="21e9f0002d8c4d29baf6aabf00c11a3c" class="a_xz" href="javascript:void(0)"><i class="a_xzBox"></i> 下载</a> </div> </div> </div>  <div class="LM_list"> <div class="List_label clearfix"> <div class="labelOne"><img src="../images/list/one.png"></div> <div class="labelTwo"> 刑事审判监督 </div> <div class="labelThree"><img src="../images/list/three.png"></div> <!-- <span class="on_1">推荐案例</span> -->     </div> <div class="list_title clearfix"> <a class="AllSelect" href="javascript:void(0)"><input type="checkbox" class="ListSelect" data-value="06d6f4b4fa2644708563aa8601126387"></a>  <h4><a href="../181107ANFZ0BXSK4/index.html?docId=06d6f4b4fa2644708563aa8601126387" class="caseName" target="_blank">抢劫刑事决定书</a></h4> </div> <div class="list_subtitle"> <span class="slfyName">最高人民法院</span> <span class="ah">(2017)最高法刑申407号</span> <span class="cprq">2019-07-05</span> </div> <div class="list_reason">  <h4>[裁判理由]</h4>  <p>本院认为,原审法院认定申诉人刘十民因成某的丈夫冉某勇盗窃其财物,使用暴力拉走成某所收废品的事实清楚,证据确实充分。刘十民所提成某同意其拉走废品和没有使用暴力的申诉理由不成立。鉴于:(1)冉某勇盗窃刘十民的财物,负有返还财产、赔偿损失的义务。成某所收购的废品,属...</p> </div>  <div class="list_Association">  <h4 style="height: 40px;"></h4>   </div> <div class="List_label2 clearfix"> <div class="fr"> <a data-value="06d6f4b4fa2644708563aa8601126387" class="a_sc" href="javascript:void(0)"><i class="fa fa-heart-o"> </i> 收藏</a> <a data-value="06d6f4b4fa2644708563aa8601126387" class="a_xz" href="javascript:void(0)"><i class="a_xzBox"></i> 下载</a> </div> </div> </div>  <div class="LM_list"> <div class="List_label clearfix"> <div class="labelOne"><img src="../images/list/one.png"></div> <div class="labelTwo"> 其他 </div> <div class="labelThree"><img src="../images/list/three.png"></div> <!-- <span class="on_1">推荐案例</span> -->     </div> <div class="list_title clearfix"> <a class="AllSelect" href="javascript:void(0)"><input type="checkbox" class="ListSelect" data-value="143b5d3f9ba64acfae8eaa9a00beb973"></a>  <h4><a href="../181107ANFZ0BXSK4/index.html?docId=143b5d3f9ba64acfae8eaa9a00beb973" class="caseName" target="_blank">赵志红故意杀人、强奸案死刑复核裁定</a></h4> </div> <div class="list_subtitle"> <span class="slfyName">最高人民法院</span> <span class="ah">无</span> <span class="cprq">2019-07-02</span> </div> <div class="list_reason">  <h4>[裁判理由]</h4>  <p>本院认为,被告人赵志红故意非法剥夺他人生命,其行为已构成故意杀人罪;违背妇女意志,采用暴力、胁迫等手段强奸妇女,其行为已构成强奸罪;以<span style="color:red">非法占有</span>为目的,采用暴力、胁迫手段劫取他人财物,其行为又构成抢劫...</p> </div>  <div class="list_Association">  <h4 style="height: 40px;"></h4>   </div> <div class="List_label2 clearfix"> <div class="fr"> <a data-value="143b5d3f9ba64acfae8eaa9a00beb973" class="a_sc" href="javascript:void(0)"><i class="fa fa-heart-o"> </i> 收藏</a> <a data-value="143b5d3f9ba64acfae8eaa9a00beb973" class="a_xz" href="javascript:void(0)"><i class="a_xzBox"></i> 下载</a> </div> </div> </div>  <div class="LM_list"> <div class="List_label clearfix"> <div class="labelOne"><img src="../images/list/one.png"></div> <div class="labelTwo"> 其他 </div> <div class="labelThree"><img src="../images/list/three.png"></div> <!-- <span class="on_1">推荐案例</span> -->     </div> <div class="list_title clearfix"> <a class="AllSelect" href="javascript:void(0)"><input type="checkbox" class="ListSelect" data-value="3e068515464d4e03a68eab2700dc9b5e"></a>  <h4><a href="../181107ANFZ0BXSK4/index.html?docId=3e068515464d4e03a68eab2700dc9b5e" class="caseName" target="_blank">王洪喜抢劫死刑复核刑事裁定书</a></h4> </div> <div class="list_subtitle"> <span class="slfyName">最高人民法院</span> <span class="ah">无</span> <span class="cprq">2019-06-18</span> </div> <div class="list_reason">  <h4>[裁判理由]</h4>  <p>本院认为,被告人王洪喜以<span style="color:red">非法占有</span>为目的,采用暴力手段劫取他人财物,并致人死亡,其行为已构成抢劫罪。犯罪性质恶劣,情节、后果严重,社会危害性大,应依法惩处。第一审判决、第二审裁定认定的事实清楚,证据确...</p> </div>  <div class="list_Association">  <h4 style="height: 40px;"></h4>   </div> <div class="List_label2 clearfix"> <div class="fr"> <a data-value="3e068515464d4e03a68eab2700dc9b5e" class="a_sc" href="javascript:void(0)"><i class="fa fa-heart-o"> </i> 收藏</a> <a data-value="3e068515464d4e03a68eab2700dc9b5e" class="a_xz" href="javascript:void(0)"><i class="a_xzBox"></i> 下载</a> </div> </div> </div>  <div class="LM_list"> <div class="List_label clearfix"> <div class="labelOne"><img src="../images/list/one.png"></div> <div class="labelTwo"> 其他 </div> <div class="labelThree"><img src="../images/list/three.png"></div> <!-- <span class="on_1">推荐案例</span> -->     </div> <div class="list_title clearfix"> <a class="AllSelect" href="javascript:void(0)"><input type="checkbox" class="ListSelect" data-value="b8f6b6fe813945109411ab1b00a5d177"></a>  <h4><a href="../181107ANFZ0BXSK4/index.html?docId=b8f6b6fe813945109411ab1b00a5d177" class="caseName" target="_blank">刘乐抢劫、故意杀人死刑复核刑事裁定书</a></h4> </div> <div class="list_subtitle"> <span class="slfyName">最高人民法院</span> <span class="ah">无</span> <span class="cprq">2019-06-03</span> </div> <div class="list_reason">  <h4>[裁判理由]</h4>  <p>本院认为,被告人刘乐以<span style="color:red">非法占有</span>为目的,采取杀人的暴力手段劫取他人财物,其行为已构成抢劫罪;刘乐故意非法剥夺他人生命,其行为构成故意杀人罪,应依法并罚。刘乐沉迷赌博,杀害妻子,情节恶劣,后果严重;其杀...</p> </div>  <div class="list_Association">  <h4 style="height: 40px;"></h4>   </div> <div class="List_label2 clearfix"> <div class="fr"> <a data-value="b8f6b6fe813945109411ab1b00a5d177" class="a_sc" href="javascript:void(0)"><i class="fa fa-heart-o"> </i> 收藏</a> <a data-value="b8f6b6fe813945109411ab1b00a5d177" class="a_xz" href="javascript:void(0)"><i class="a_xzBox"></i> 下载</a> </div> </div> </div>  <div class="LM_list"> <div class="List_label clearfix"> <div class="labelOne"><img src="../images/list/one.png"></div> <div class="labelTwo"> 其他 </div> <div class="labelThree"><img src="../images/list/three.png"></div> <!-- <span class="on_1">推荐案例</span> -->     </div> <div class="list_title clearfix"> <a class="AllSelect" href="javascript:void(0)"><input type="checkbox" class="ListSelect" data-value="8051cc11053748f7b85cab0901177455"></a>  <h4><a href="../181107ANFZ0BXSK4/index.html?docId=8051cc11053748f7b85cab0901177455" class="caseName" target="_blank">于京平抢劫、强奸死刑复核刑事裁定书</a></h4> </div> <div class="list_subtitle"> <span class="slfyName">最高人民法院</span> <span class="ah">无</span> <span class="cprq">2019-04-09</span> </div> <div class="list_reason">  <h4>[裁判理由]</h4>  <p>本院认为,被告人于京平以<span style="color:red">非法占有</span>为目的,采用暴力手段劫取他人财物,又强行与被害人发生性关系,其行为已构成抢劫罪和强奸罪,应依法数罪并罚。于京平在抢劫过程中为制服被害人反抗,用随身携带的斧子多次击打被...</p> </div>  <div class="list_Association">  <h4 style="height: 40px;"></h4>   </div> <div class="List_label2 clearfix"> <div class="fr"> <a data-value="8051cc11053748f7b85cab0901177455" class="a_sc" href="javascript:void(0)"><i class="fa fa-heart-o"> </i> 收藏</a> <a data-value="8051cc11053748f7b85cab0901177455" class="a_xz" href="javascript:void(0)"><i class="a_xzBox"></i> 下载</a> </div> </div> </div>  <div class="LM_list"> <div class="List_label clearfix"> <div class="labelOne"><img src="../images/list/one.png"></div> <div class="labelTwo"> 其他 </div> <div class="labelThree"><img src="../images/list/three.png"></div> <!-- <span class="on_1">推荐案例</span> -->     </div> <div class="list_title clearfix"> <a class="AllSelect" href="javascript:void(0)"><input type="checkbox" class="ListSelect" data-value="341ef75432434ac29b87ab0901177549"></a>  <h4><a href="../181107ANFZ0BXSK4/index.html?docId=341ef75432434ac29b87ab0901177549" class="caseName" target="_blank">贾谦龙抢劫死刑复核刑事裁定书</a></h4> </div> <div class="list_subtitle"> <span class="slfyName">最高人民法院</span> <span class="ah">无</span> <span class="cprq">2019-04-01</span> </div> <div class="list_reason">  <h4>[裁判理由]</h4>  <p>本院认为,被告人贾谦龙以<span style="color:red">非法占有</span>为目的,采用暴力手段劫取他人财物,其行为已构成抢劫罪。贾谦龙预谋抢劫,当场杀害一名被害人,犯罪手段残忍,情节、后果严重,罪行极其严重,应依法惩处。第一审判决、第二审裁...</p> </div>  <div class="list_Association">  <h4>[关联文书]</h4>   <div class="guanLian"> <span>本篇</span><b></b> <a href="../181107ANFZ0BXSK4/index.html?docId=341ef75432434ac29b87ab0901177549" target="_blank" title=""><i class="guanlianAnyou">其他</i><i>最高人民法院 </i><i>无</i><i>2019-04-01</i><i></i></a> </div>  <div class="guanLian"> <span></span><b></b> <a href="../181107ANFZ0BXSK4/index.html?docId=97fe25f949ee4d5fb136a9d300113d78" target="_blank" title=""><i class="guanlianAnyou">刑事二审</i><i>河北省高级人民法院 </i><i>(2018)冀刑终249号</i><i>2018-09-13</i><i></i></a> </div>  <div class="guanLian"> <span></span><b></b> <a href="../181107ANFZ0BXSK4/index.html?docId=fe7964f3f00a42c7ac04ab2700b109e2" target="_blank" title=""><i class="guanlianAnyou">刑事一审</i><i>河北省邯郸市中级人民法院 </i><i>(2018)冀04刑初2号</i><i>2018-03-28</i><i>判决</i></a> </div>  </div> <div class="List_label2 clearfix"> <div class="fr"> <a data-value="341ef75432434ac29b87ab0901177549" class="a_sc" href="javascript:void(0)"><i class="fa fa-heart-o"> </i> 收藏</a> <a data-value="341ef75432434ac29b87ab0901177549" class="a_xz" href="javascript:void(0)"><i class="a_xzBox"></i> 下载</a> </div> </div> </div>  <div class="LM_list"> <div class="List_label clearfix"> <div class="labelOne"><img src="../images/list/one.png"></div> <div class="labelTwo"> 其他 </div> <div class="labelThree"><img src="../images/list/three.png"></div> <!-- <span class="on_1">推荐案例</span> -->     </div> <div class="list_title clearfix"> <a class="AllSelect" href="javascript:void(0)"><input type="checkbox" class="ListSelect" data-value="fb5ecd6b76fd4709b0f4ab0e00c35e20"></a>  <h4><a href="../181107ANFZ0BXSK4/index.html?docId=fb5ecd6b76fd4709b0f4ab0e00c35e20" class="caseName" target="_blank">张治刚抢劫死刑复核刑事裁定书</a></h4> </div> <div class="list_subtitle"> <span class="slfyName">最高人民法院</span> <span class="ah">无</span> <span class="cprq">2019-04-01</span> </div> <div class="list_reason">  <h4>[裁判理由]</h4>  <p>本院认为,被告人张治刚以<span style="color:red">非法占有</span>为目的,伙同他人采用暴力手段劫取财物,其行为已构成抢劫罪。张治刚伙同他人经预谋后抢劫无证营运出租车,不顾司机求饶将其杀害,并为掩盖罪证抛尸枯井,犯罪情节恶劣,手段残忍...</p> </div>  <div class="list_Association">  <h4 style="height: 40px;"></h4>   </div> <div class="List_label2 clearfix"> <div class="fr"> <a data-value="fb5ecd6b76fd4709b0f4ab0e00c35e20" class="a_sc" href="javascript:void(0)"><i class="fa fa-heart-o"> </i> 收藏</a> <a data-value="fb5ecd6b76fd4709b0f4ab0e00c35e20" class="a_xz" href="javascript:void(0)"><i class="a_xzBox"></i> 下载</a> </div> </div> </div>  <div class="LM_list"> <div class="List_label clearfix"> <div class="labelOne"><img src="../images/list/one.png"></div> <div class="labelTwo"> 其他 </div> <div class="labelThree"><img src="../images/list/three.png"></div> <!-- <span class="on_1">推荐案例</span> -->     </div> <div class="list_title clearfix"> <a class="AllSelect" href="javascript:void(0)"><input type="checkbox" class="ListSelect" data-value="5b12fee3ad674bb189d7aaf501095b44"></a>  <h4><a href="../181107ANFZ0BXSK4/index.html?docId=5b12fee3ad674bb189d7aaf501095b44" class="caseName" target="_blank">张斌抢劫死刑复核刑事裁定书</a></h4> </div> <div class="list_subtitle"> <span class="slfyName">最高人民法院</span> <span class="ah">无</span> <span class="cprq">2019-03-31</span> </div> <div class="list_reason">  <h4>[裁判理由]</h4>  <p>本院认为,被告人张斌以<span style="color:red">非法占有</span>为目的,采取暴力手段劫取他人财物,其行为已构成抢劫罪。张斌持械入户抢劫并致人死亡,犯罪情节特别恶劣,后果严重,实属罪行极其严重,应依法惩处。第一审判决、第二审裁定认定的...</p> </div>  <div class="list_Association">  <h4 style="height: 40px;"></h4>   </div> <div class="List_label2 clearfix"> <div class="fr"> <a data-value="5b12fee3ad674bb189d7aaf501095b44" class="a_sc" href="javascript:void(0)"><i class="fa fa-heart-o"> </i> 收藏</a> <a data-value="5b12fee3ad674bb189d7aaf501095b44" class="a_xz" href="javascript:void(0)"><i class="a_xzBox"></i> 下载</a> </div> </div> </div>  <div class="LM_list"> <div class="List_label clearfix"> <div class="labelOne"><img src="../images/list/one.png"></div> <div class="labelTwo"> 其他 </div> <div class="labelThree"><img src="../images/list/three.png"></div> <!-- <span class="on_1">推荐案例</span> -->     </div> <div class="list_title clearfix"> <a class="AllSelect" href="javascript:void(0)"><input type="checkbox" class="ListSelect" data-value="76646deadbd2432e8f0eaaf501095bd8"></a>  <h4><a href="../181107ANFZ0BXSK4/index.html?docId=76646deadbd2432e8f0eaaf501095bd8" class="caseName" target="_blank">赵晏飞抢劫死刑复核刑事裁定书</a></h4> </div> <div class="list_subtitle"> <span class="slfyName">最高人民法院</span> <span class="ah">无</span> <span class="cprq">2019-03-22</span> </div> <div class="list_reason">  <h4>[裁判理由]</h4>  <p>本院认为,被告人赵晏飞以<span style="color:red">非法占有</span>为目的,以暴力手段入户劫取他人财物,致一人死亡,其行为已构成抢劫罪。赵晏飞经预谋,深夜入户抢劫,并持刀捅刺被害人颈部致被害人死亡,犯罪手段残忍,后果严重,情节特别恶劣...</p> </div>  <div class="list_Association">  <h4 style="height: 40px;"></h4>   </div> <div class="List_label2 clearfix"> <div class="fr"> <a data-value="76646deadbd2432e8f0eaaf501095bd8" class="a_sc" href="javascript:void(0)"><i class="fa fa-heart-o"> </i> 收藏</a> <a data-value="76646deadbd2432e8f0eaaf501095bd8" class="a_xz" href="javascript:void(0)"><i class="a_xzBox"></i> 下载</a> </div> </div> </div>  <div class="LM_list"> <div class="List_label clearfix"> <div class="labelOne"><img src="../images/list/one.png"></div> <div class="labelTwo"> 其他 </div> <div class="labelThree"><img src="../images/list/three.png"></div> <!-- <span class="on_1">推荐案例</span> -->     </div> <div class="list_title clearfix"> <a class="AllSelect" href="javascript:void(0)"><input type="checkbox" class="ListSelect" data-value="becc9a11b4bd4696a424aaf501095bc1"></a>  <h4><a href="../181107ANFZ0BXSK4/index.html?docId=becc9a11b4bd4696a424aaf501095bc1" class="caseName" target="_blank">谷宪武抢劫死刑复核刑事裁定书</a></h4> </div> <div class="list_subtitle"> <span class="slfyName">最高人民法院</span> <span class="ah">无</span> <span class="cprq">2019-03-12</span> </div> <div class="list_reason">  <h4>[裁判理由]</h4>  <p>本院认为,被告人谷宪武以<span style="color:red">非法占有</span>为目的,使用暴力劫取他人财物,其行为已构成抢劫罪。谷宪武因赌博输钱而实施抢劫,并杀害被害人,犯罪动机卑劣,情节特别恶劣,实属罪行极其严重,应依法惩处。原审判决、高级人...</p> </div>  <div class="list_Association">  <h4 style="height: 40px;"></h4>   </div> <div class="List_label2 clearfix"> <div class="fr"> <a data-value="becc9a11b4bd4696a424aaf501095bc1" class="a_sc" href="javascript:void(0)"><i class="fa fa-heart-o"> </i> 收藏</a> <a data-value="becc9a11b4bd4696a424aaf501095bc1" class="a_xz" href="javascript:void(0)"><i class="a_xzBox"></i> 下载</a> </div> </div> </div>  <div class="LM_list"> <div class="List_label clearfix"> <div class="labelOne"><img src="../images/list/one.png"></div> <div class="labelTwo"> 其他 </div> <div class="labelThree"><img src="../images/list/three.png"></div> <!-- <span class="on_1">推荐案例</span> -->     </div> <div class="list_title clearfix"> <a class="AllSelect" href="javascript:void(0)"><input type="checkbox" class="ListSelect" data-value="88b891c7c7c74f0d954caaf701657b09"></a>  <h4><a href="../181107ANFZ0BXSK4/index.html?docId=88b891c7c7c74f0d954caaf701657b09" class="caseName" target="_blank">高中强抢劫死刑复核刑事裁定书</a></h4> </div> <div class="list_subtitle"> <span class="slfyName">最高人民法院</span> <span class="ah">无</span> <span class="cprq">2019-01-30</span> </div> <div class="list_reason">  <h4>[裁判理由]</h4>  <p>本院认为,被告人高中强以<span style="color:red">非法占有</span>为目的,采用暴力手段劫取他人财物,其行为已构成抢劫罪。高中强抢劫并致人死亡,犯罪手段残忍,犯罪情节特别恶劣,实属罪行极其严重,应依法惩处。第一审判决、第二审裁定认定的...</p> </div>  <div class="list_Association">  <h4 style="height: 40px;"></h4>   </div> <div class="List_label2 clearfix"> <div class="fr"> <a data-value="88b891c7c7c74f0d954caaf701657b09" class="a_sc" href="javascript:void(0)"><i class="fa fa-heart-o"> </i> 收藏</a> <a data-value="88b891c7c7c74f0d954caaf701657b09" class="a_xz" href="javascript:void(0)"><i class="a_xzBox"></i> 下载</a> </div> </div> </div>  <div class="LM_list"> <div class="List_label clearfix"> <div class="labelOne"><img src="../images/list/one.png"></div> <div class="labelTwo"> 刑事再审 </div> <div class="labelThree"><img src="../images/list/three.png"></div> <!-- <span class="on_1">推荐案例</span> -->     </div> <div class="list_title clearfix"> <a class="AllSelect" href="javascript:void(0)"><input type="checkbox" class="ListSelect" data-value="971d62400160436aaefda9df0112219c"></a>  <h4><a href="../181107ANFZ0BXSK4/index.html?docId=971d62400160436aaefda9df0112219c" class="caseName" target="_blank">赵明利诈骗再审刑事判决书</a></h4> </div> <div class="list_subtitle"> <span class="slfyName">最高人民法院</span> <span class="ah">(2018)最高法刑再6号</span> <span class="cprq">2019-01-03</span> </div> <div class="list_reason">  <h4>[裁判理由]</h4>  <p>本院认为,原审被告人赵明利在与东北风冷轧板公司的冷轧板购销交易过程中,主观上没有<span style="color:red">非法占有</span>的目的,客观上亦未实施虚构事实、隐瞒真相的行为,其行为不符合诈骗罪的构成要件,不构成诈骗罪。理由如下:.......... </diy:lawyee>

只要出现这种特殊标签就会报namespace error

问题二:元素的动态变化

主要是下一页这个元素,这个元素会动态变化
第一页的下一页元素为:
在这里插入图片描述

//*[@id="_view_1545184311000"]/div[18]/a[8]

第二页的元素为:
在这里插入图片描述

//*[@id="_view_1545184311000"]/div[18]/a[9]

由a[8]变为a[9]

一开始试图通过,对于翻页的次数的变化进行更新,但是失败了,失败原因在于只显示前600份裁判文书,但是由于限定条件的不同,查找的裁判文书数量就不同,导致不好判断
思路是这样的:

# 下一页xpath:
            '''
             if 7<=index <36:
                try:
                    button1 = wait.until(EC.presence_of_element_located((By.XPATH, '//div[@class="left_7_3"]/a[14]')))#问题并不是全为a8
                    time.sleep(1)
                    button1.click()
                except:
                    pass

            elif 36 <= index < 40:
                try:
                    button1 = wait.until(EC.presence_of_element_located((By.XPATH, '//div[@class="left_7_3"]/a[%d]' % int(
                        49-index))))
                    time.sleep(1)
                    button1.click()
                except:
                    pass
            elif index ==40:
                pass
            '''

然后试图通过分段函数进行解决,就是提取查找到的文书数量计算要翻的页数:

在这里插入图片描述

"""目的:减少遍历次数"""
        time.sleep(1)
        condition = browser.find_element_by_xpath('//div[@class="LM_con clearfix"]/div[@class="fr con_right"]/span')
        print(condition.text)  # 不能直接//text()原因不明
        conditions = math.ceil(int(condition.text) / 15)  # 最长12,最短6
        print(conditions)

但是还是不成功!!!!

最后突然想到,下一页 这个元素正好是最后一个元素可以直接使用xpath语法选中最后一个

button_ = wait.until(EC.presence_of_element_located((By.XPATH, '//div[@class="left_7_3"]/a[last()]')))
button_.click()

问题三、只显示前600份裁判文书

需要爬所有,但是只显示前600份
,没有办法直接爬取所有,因为裁判文书网限定了只显示前600份

解决方法:通过限定搜索条件如:
在这里插入图片描述
这个方法是某个博主(暂时想不起来)给的思路,这个博主是通过北京市的律师来限定的,但是我没有找到律师的名字,所有我是通过限定区域,如xx区县
在这里插入图片描述

准确说这样,还是有问题,某些区县仍然超过600,但是相对较少了,即使超过了也不多,通过设置筛选可以进一步等待后续处理,如:

"""目的:减少遍历次数"""
        time.sleep(1)
        condition = browser.find_element_by_xpath('//div[@class="LM_con clearfix"]/div[@class="fr con_right"]/span')
        print(condition.text)  # 不能直接//text()原因不明
        conditions = math.ceil(int(condition.text) / 15)  # 最长12,最短6
        print(conditions)
        if int(condition.text) > 600:
            with open('超过600页.txt','a+',encoding='utf-8')as file:
                file.write('出现超过600条的裁判文书,其所在区域为:'+ str(i.strip()) +',其数量为:'+str(condition.text) + '\n')
            logging.warning('出现超过600条的裁判文书,其所在区域为:'+ str(i.strip()) +',其数量为:'+str(condition.text))

问题四:弹出框的处理

参见《selenium弹窗之windows下载文件弹窗点击方法》 ,使用pyautogui进行键鼠自动化

但是据说还有selenium的内部方法

"""下载无弹窗
profile = webdriver.FirefoxProfile()
profile.set_preference('browser.download.dir', '‪I:\\360下载\\firefox')
profile.set_preference('browser.download.folderList', 2)
profile.set_preference('browser.download.manager.showWhenStarting', False)
profile.set_preference('browser.helperApps.neverAsk.saveToDisk', 'application/zip')
"""

但是不知道为什么我的不行,所以注释掉了

三、selenium路线的缺陷

有两个缺陷,一个是速度慢,另一个是弹出框暂时没有办法解决
后续可能需要使用scrapy+selenium,以及使弹出框不在弹出

  • 3
    点赞
  • 20
    收藏
    觉得还不错? 一键收藏
  • 7
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论 7
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值