SLOR: read off-line sites

This is a set of scripts for downloading a web site and its images and stylesheets. The web site is manipulated in such a way that

links point to the right position on the internet
whereas links to images and stylesheets point to the downloaded version
scripts and iframes are removed without trace

Great, ha?

This is done using an awkward awk script slor.awk which has an ugly interface. That's why there is a script wrapper.sh that allows the following usage

sh wrappper.sh www.abc.de

Somewhat more practical is listener.sh which runs wrapper.sh in a loop, constantly prompting for URLs.

The downloaded files are saved in directories with such memorable names as cfd140df628db7480213704ae76d85a5; the html file is saved in cfd140df628db7480213704ae76d85a5/cfd140df628db7480213704ae76d85a5.html.

Requirements

POSIX shell
POSIX awk
curl

Note that the sh and awk of busybox fulfill the requirements. With some little changes in wrapper.sh and slor.awk, wget can be used instead (even the wget of busybox).

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
README.md		README.md
busybox		busybox
do-escape.awk		do-escape.awk
do-unescape.awk		do-unescape.awk
environment.sh		environment.sh
escape-nonascii.sh		escape-nonascii.sh
listener.sh		listener.sh
multiplier.sh		multiplier.sh
retry.sh		retry.sh
slor.awk		slor.awk
unescape-nonascii.sh		unescape-nonascii.sh
wrapper.sh		wrapper.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SLOR: read off-line sites

Requirements

About

Releases

Packages

Languages

kedorlaomer/slor

Folders and files

Latest commit

History

Repository files navigation

SLOR: read off-line sites

Requirements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages