edspdf-poppler provides a Poppler-based PDF parser component for EDS-PDF. We only provide prebuilt binaries for linux and macos. If you are on windows, you will need to build this lib from source.
Beware, Poppler is GPL-licensed: edspdf-poppler is therefore also GPL-licensed, and any model depending on this component must be too.
Install the library with pip:
$ pip install edspdf-poppler
from edspdf import Pipeline
model = Pipeline()
model.add_pipe('poppler-extractor')
model(pdf_bytes)
Visit the documentation for more information!
We would like to thank Assistance Publique – Hôpitaux de Paris and AP-HP Foundation for funding this project.