GitHub - jpwiig/simpleScraper: a simple web scraper by terminal

Simple web scraper

This is a simple webscraper to be able to scrape html content from website as a bare mininum to either save it to a file, or to print it to a terminal, you can think of the first version of the script as a very simple version of Curl with some extra spice

How to use:

there are no dependcies right now (only OS and requests)

you simply:

clone the repo git clone https://github.com/jpwiig/simpleScraper && cd simpleScraper
run the main.py with the website you want to curl.

example: `main.py https://www.nrk.no`, the program will help you with the rest

be responsible and everything.

Features i want to add:

a way to get images and videos from websites (wget like)
propper installation
more detailed error messages
able to show headers

If you see any crazy thing that shouldnt be there, please let me know, i will check it out! PRs and issues are open!

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.gitignore		.gitignore
README.md		README.md
main.py		main.py
scraper.py		scraper.py
todo.md		todo.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Simple web scraper

How to use:

example: `main.py https://www.nrk.no`, the program will help you with the rest

Features i want to add:

About

Releases

Packages

Languages

jpwiig/simpleScraper

Folders and files

Latest commit

History

Repository files navigation

Simple web scraper

How to use:

example: main.py https://www.nrk.no, the program will help you with the rest

Features i want to add:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

example: `main.py https://www.nrk.no`, the program will help you with the rest

Packages