I am trying to download a PDF file using requests.get(). It works for most test PDF files I found but for this case it does not and the file is corrupted. If I open the URL with a browser and save the file it is working just fine. I have tried to download it in chunks using 'Stream' but with the same result. Could you please explain to me what am I missing?
import requests
file_url = 'http://medianet.edmond-de-rothschild.fr/edram/pdf/kiid_fr0010172767_en_20200120_20200128_1954.pdf'
headers = {'Content-type': 'application/pdf'}
r = requests.get(file_url, headers=headers)
with open("python.pdf", "wb") as pdf:
pdf.write(r.content)
pdf.close()
解决方案
Fixing the header information makes it work.
import requests
file_url =