Skip to content

Latest commit

 

History

History
executable file
·
287 lines (186 loc) · 15.5 KB

HISTORY.md

File metadata and controls

executable file
·
287 lines (186 loc) · 15.5 KB

Release History

master

0.11.0 (2023-02-26)

  • Replace PdfFileReader with PdfReader and pin PyPDF to >=3.0.0. #307 by Martin Thoma.

0.10.1 (2021-07-11)

  • Change extra requirements from cv to base. You can use pip install "camelot-py[base]" to install everything required to run camelot.

0.10.0 (2021-07-11)

Improvements

  • Add support for multiple image conversion backends. #198 and #253 by Vinayak Mehta.
  • Add markdown export format. #222 by Lucas Cimon.

Documentation

0.9.0 (2021-06-15)

Bugfixes

Improvements

Documentation

0.8.2 (2020-07-27)

  • Revert the changes in 0.8.1.

0.8.1 (2020-07-21)

Bugfixes

  • #169 Fix import error caused by pdfminer.six==20200720. #171 by Vinayak Mehta.

0.8.0 (2020-05-24)

Improvements

  • Drop Python 2 support!
    • Remove Python 2.7 and 3.5 support.
    • Replace all instances of .format with f-strings.
    • Remove all __future__ imports.
    • Fix HTTP 403 forbidden exception in read_pdf(url) and remove Python 2 urllib support.
    • Fix test data.

Bugfixes

  • Fix library discovery on Windows. #32 by KOLANICH.
  • Fix calling convention of callback functions. #34 by KOLANICH.

0.7.3 (2019-07-07)

Improvements

  • Camelot now follows the Black code style! #1 and #3.

Bugfixes

  • Fix Click.HelpFormatter monkey-patch. #5 by Dimiter Naydenov.
  • Fix strip_text argument getting ignored. #4 by Dimiter Naydenov.
  • #25 edge_tol skipped in read_pdf. #26 by Vinayak Mehta.
  • Fix pytest deprecation warning. #2 by Vinayak Mehta.
  • #293 Split text ignores all text to the right of last cut. #294 by Vinayak Mehta.
  • #277 Sort TableList by order of tables in PDF. #283 by Sym Roe.
  • #312 table_regions throws ValueError when flavor='stream'. #332 by Vinayak Mehta.

0.7.2 (2019-01-10)

Bugfixes

  • #245 Fix AttributeError for encrypted files. #251 by Yatin Taluja.

0.7.1 (2019-01-06)

Bugfixes

  • Move ghostscript import to inside the function so Anaconda builds don't fail.

0.7.0 (2019-01-05)

Improvements

  • #240 Add support to analyze only certain page regions to look for tables. #243 by Vinayak Mehta.
    • You can use table_regions in read_pdf() to specify approximate page regions which may contain tables.
    • Kwarg line_size_scaling is now called line_scale.
  • #212 Add support to export as sqlite database. #244 by Vinayak Mehta.
  • #239 Raise warning if PDF is image-based. #240 by Vinayak Mehta.

Documentation

  • Remove mention of old mesh kwarg from docs. #241 by fte10kso.

Note: The python wrapper to Ghostscript's C API is now vendorized under the ext module. This was done due to unavailability of the ghostscript package on Anaconda. The code should be removed after we submit a recipe for it to conda-forge. With this release, the user doesn't need to ensure that the Ghostscript executable is available on the PATH variable.

0.6.0 (2018-12-24)

Improvements

  • #91 Add support to read from url. #236 by Vinayak Mehta.
  • #229, #230 and #233 New configuration parameters. #234 by Vinayak Mehta.
    • strip_text: To define characters that should be stripped from each string.
    • edge_tol: Tolerance parameter for extending textedges vertically.
    • resolution: Resolution used for PDF to PNG conversion.
    • Check out the advanced docs for usage details.
  • #170 Add option to pass pdfminer layout kwargs. #232 by Vinayak Mehta.
    • Keyword arguments for pdfminer.layout.LAParams can now be passed using layout_kwargs in read_pdf().
    • The margins keyword argument in read_pdf() is now deprecated.

0.5.0 (2018-12-13)

Improvements

  • #207 Add a plot type for Stream text edges and detected table areas. #224 by Vinayak Mehta.
  • #204 suppress_warnings is now called suppress_stdout. #225 by Vinayak Mehta.

Bugfixes

Documentation

  • Add pdfplumber comparison and update Tabula (stream) comparison. Check out the wiki page.

0.4.1 (2018-12-05)

Bugfixes

0.4.0 (2018-11-23)

Improvements

  • #102 Detect tables automatically when Stream is used. #206 Add implementation of Anssi Nurminen's table detection algorithm by Vinayak Mehta.

0.3.2 (2018-11-04)

Improvements

  • #186 Add _bbox attribute to table. #193 by Vinayak Mehta.
    • You can use table._bbox to get coordinates of the detected table.

0.3.1 (2018-11-02)

Improvements

  • Matplotlib is now an optional requirement. #190 by Vinayak Mehta.
    • You can install it using $ pip install camelot-py[plot].
  • #127 Add tests for plotting. Coverage is now at 87%! #179 by Suyash Behera.

0.3.0 (2018-10-28)

Improvements

  • #162 Add password keyword argument. #180 by rbares.
    • An encrypted PDF can now be decrypted by passing password='<PASSWORD>' to read_pdf or --password <PASSWORD> to the command-line interface. (Limited encryption algorithm support from PyPDF2.)
  • #139 Add suppress_warnings keyword argument. #155 by Jonathan Lloyd.
    • Warnings raised by Camelot can now be suppressed by passing suppress_warnings=True to read_pdf or --quiet to the command-line interface.
  • #154 The CLI can now be run using python -m. Try python -m camelot --help. #159 by Parth P Panchal.
  • #165 Rename table_area to table_areas. #171 by Parth P Panchal.

Bugfixes

  • Raise error if the ghostscript executable is not on the PATH variable. #166 by Vinayak Mehta.
  • Convert filename to lowercase to check for PDF extension. #169 by Vinicius Mesel.

Files

  • #114 Add Makefile and make codecov run only once. #132 by Vaibhav Mule.
  • Add .editorconfig. #151 by KOLANICH.
  • Downgrade numpy version from 1.15.2 to 1.13.3.
  • Add requirements.txt for readthedocs.

Documentation

  • Add "Using conda" section to installation instructions.
  • Add readthedocs badge.

0.2.3 (2018-10-08)

  • Remove hard dependencies on requirements versions.

0.2.2 (2018-10-08)

Bugfixes

  • Move opencv-python to extra_requires. #134 by Vinayak Mehta.

0.2.1 (2018-10-05)

Bugfixes

  • #121 Fix ghostscript subprocess call for Windows. #124 by Vinayak Mehta.

Improvements

  • #123 Make PEP8 compatible. #125 by Oshawk.
  • #110 Add more tests. Coverage is now at 84%!
    • Add tests for __repr__. #128 by Vaibhav Mule.
    • Add tests for CLI. #122 by Vaibhav Mule and #117 by Vinayak Mehta.
    • Add tests for errors/warnings. #113 by Vinayak Mehta.
    • Add tests for output formats and parser kwargs. #126 by Vinayak Mehta.
  • Add Python 3.5 and 3.7 support. #119 by Vinayak Mehta.
  • Add logging and warnings.

Documentation

Note: This release also bumps the version for numpy from 1.13.3 to 1.15.2 and adds a MANIFEST.in. Also, openpyxl==2.5.8 is a new requirement and pytest-cov==2.6.0 is a new dev requirement.

0.2.0 (2018-09-28)

Improvements

  • #81 Add Python 3.6 support. #109 by Vinayak Mehta.

0.1.2 (2018-09-25)

Improvements

  • #85 Add Travis and Codecov.

0.1.1 (2018-09-24)

Documentation

  • Add documentation fixes.

0.1.0 (2018-09-24)

  • Rebirth!