Skip to content

natnew/Python-Projects-Download-A-PDF

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

3 Commits
ย 
ย 
ย 
ย 

Repository files navigation

Python Projects: Download A PDF ๐Ÿ

This repo contains python code that downloads PDF files from a link.
Run the code.

Python

import requests
from bs4 import BeautifulSoup

# URL from which pdfs to be downloaded
url = "https://www."

# Requests URL and get response object
response = requests.get(url)

# Parse text obtained
soup = BeautifulSoup(response.text, 'html.parser')

# Find all hyperlinks present on webpage
links = soup.find_all('a')

i = 0

# From all links check for pdf link and
# if present download file
for link in links:
    if ('.pdf' in link.get('href', [])):
        i += 1
        print("Downloading file: ", i)

        # Get response object for link
        response = requests.get(link.get('href'))

        # Write content in pdf file
        pdf = open("pdf" + str(i) + ".pdf", 'wb')
        pdf.write(response.content)
        pdf.close()
        print("File ", i, " downloaded")

print("All PDF files downloaded")

Output

All PDF files downloaded

About

Python Projects Download A PDF

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages