![Python 3.10][python_badge] ![License: AGPL v3][agpl3_badge] ![Code Style: Black][black_badge] ![Code Coverage][codecov_badge]
Initialises a Selenium browser controller for a specific firefox profile on an
Ubuntu 22.10 system for an apt
installation of Firefox.
Put into a separate pip package to remove boiler-plate code from other repositories that control the browser.
First install this pip package with:
pip install browsercontroller
Then run:
from browsercontroller.get_controller import (
get_ubuntu_apt_firefox_controller,
)
get_ubuntu_apt_firefox_controller(url="https://www.startpagina.nl")
Some entities use checks to verify you are human. at 20223-09-19 those can be evaded using either:
- playwright
- puppeteer
pip install playwright
playwright install
npm install puppeteer
npm install puppeteer-extra-plugin-stealth
npm install puppeteer puppeteer-extra puppeteer-extra-plugin-stealth
# Then run the example script with:
node puppeteer_stealth.js
An example of each of those two options is included. One drawback of the way
playwright is used in this repository, is that the entire sequence of actions
must occur within a single method, because the playwright browsercontroller is
created within a Python with
statement, and returning the Page
object from
the initialisation function: initialise_playwright_browsercontroller()
yields
"loop closed error."
It is implemented in javascript. Pypeteer is an archived Python approximation of puppeteer. However, that is limited in its functionalities and not supported anymore.
Warning:
Checks whether a snap
version of Firefox is installed, and if yes, removes it
and installs an apt
version of Firefox instead. You'll lose browser history,
logins and bookmarks if you don't have an apt
version of Firefox.
Build the pip package with:
pip install --upgrade pip setuptools wheel
pip install twine
Install the pip package locally with:
pip install -e .
Upload the pip package to the world with:
rm -r dist
rm -r build
python3 setup.py sdist bdist_wheel
python3 -m twine upload dist/\*