Page Size Check is an utility to check the size of pages from a sitemap and its resources parsering the HAR file of the request using Selenium and haralyzer. The execution of this utility produces some files to allow the user to make an analysis of the number of requests and its size. The execution use ThreadPoolExecutor to launch the browsers in parallel.
- Git.
- Python 3.5 or higher, pip and virtualenvwrapper.
- System requeriments in requirements-sys.txt
Installation
pip install page-size-check
Download
- Firefox webdriver for selenium: geckodriver - Browsermob-proxy
Execution
page_size_check --sitemap_url="sitemap.url" [--help]
Output
- Summary tables: tables with summary info for each url and the total amount of results - Resume urls file: a resume of the urls with the number of entries, the page size and the page load times - Resources list file: a list of the resources on every page with its mimetype, size and load time - Mimetype resources: a resume of the resources grouped by mimetype in each url of the sitemap
If you want to use Docker and avoid to install external dependencies just type:
docker run --rm apsl/page-size-check --sitemap_url=http://www.example.com/sitemap.xml
As was showned before, you can execute the script doing python page_size_check.py --help
to see all the
parameters that you can set:
--browsermob_server_path TEXT | |
Browsermob server path. | |
--browsermob_server_port INTEGER | |
Browsermob server port. | |
--firefox_driver_path TEXT | |
Firefox driver path. | |
--sitemap_url TEXT | |
Sitemap to get urls. | |
--threads INTEGER | |
Number of threads. | |
--display_summary BOOLEAN | |
If true displays the results summary to the stdout. | |
--generate_extra_csv BOOLEAN | |
If true generates extra csv with resume information | |
--help | Show this message and exit. |
Contributions are very welcome. Please open a pull request or file an issue. Tests will be ready as soon as posible, please ensure the coverage at least stays the same before you submit a pull request.
Distributed under the terms of the MIT license, "page-size-check" is free and open source software
If you encounter any problems, please file an issue along with a detailed description.