Python Projects: Download A PDF 🐍

This repo contains python code that downloads PDF files from a link.
Run the code.

Python

import requests
from bs4 import BeautifulSoup

# URL from which pdfs to be downloaded
url = "https://www."

# Requests URL and get response object
response = requests.get(url)

# Parse text obtained
soup = BeautifulSoup(response.text, 'html.parser')

# Find all hyperlinks present on webpage
links = soup.find_all('a')

i = 0

# From all links check for pdf link and
# if present download file
for link in links:
    if ('.pdf' in link.get('href', [])):
        i += 1
        print("Downloading file: ", i)

        # Get response object for link
        response = requests.get(link.get('href'))

        # Write content in pdf file
        pdf = open("pdf" + str(i) + ".pdf", 'wb')
        pdf.write(response.content)
        pdf.close()
        print("File ", i, " downloaded")

print("All PDF files downloaded")

Output

All PDF files downloaded

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
download a file.py		download a file.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Python Projects: Download A PDF 🐍

About

Releases

Packages

Languages

natnew/Python-Projects-Download-A-PDF

Folders and files

Latest commit

History

Repository files navigation

Python Projects: Download A PDF 🐍

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages