跑代码的时候遇到这个问题:
(base) root@cef5910f82ad:~/workspace/unsupervised_captioning# python3 preprocessing/crawl_descriptions.py
None
Traceback (most recent call last):
File "preprocessing/crawl_descriptions.py", line 101, in <module>
app.run(main)
File "/root/anaconda3/lib/python3.7/site-packages/absl/app.py", line 274, in run
_run_main(main, argv)
File "/root/anaconda3/lib/python3.7/site-packages/absl/app.py", line 238, in _run_main
sys.exit(main(argv))
File "preprocessing/crawl_descriptions.py", line 97, in main
download(FLAGS.data_dir, FLAGS.num_pages, i, c)
File "preprocessing/crawl_descriptions.py", line 77, in download
all_pages = get_num_pages(label)
File "preprocessing/crawl_descriptions.py", line 61, in get_num_pages
num_pages = int(obj.group(1))
AttributeError: 'NoneType' object has no attribute 'group'
把preprocessing/crawl_descriptions.py中的,
obj = re.search('data-max="(\d*)"', page)
改为
obj = re.search('max="(\d*)"', page)
(base) root@cef5910f82ad:~/workspace/unsupervised_captioning# vim preprocessing/crawl_descriptions.py
(base) root@cef5910f82ad:~/workspace/unsupervised_captioning# python3 preprocessing/crawl_descriptions.py
<re.Match object; span=(232380, 232392), match='max="434847"'>
person 434847 pages available.