Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ENH: Extend cmap compatibilty to GBK_EUC_H/V #1812

Merged
merged 1 commit into from
May 1, 2023

Conversation

pubpub-zz
Copy link
Collaborator

a new encoding to be considered.
fixes #1809

@codecov
Copy link

codecov bot commented Apr 25, 2023

Codecov Report

Patch and project coverage have no change.

Comparison is base (6053167) 93.39% compared to head (0974e7a) 93.39%.

Additional details and impacted files
@@           Coverage Diff           @@
##             main    #1812   +/-   ##
=======================================
  Coverage   93.39%   93.39%           
=======================================
  Files          34       34           
  Lines        6614     6614           
  Branches     1301     1301           
=======================================
  Hits         6177     6177           
  Misses        285      285           
  Partials      152      152           
Impacted Files Coverage Δ
pypdf/_cmap.py 95.29% <ø> (ø)

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

☔ View full report in Codecov by Sentry.
📢 Do you have feedback about the report comment? Let us know in this issue.

@pubpub-zz
Copy link
Collaborator Author

@MartinThoma
All yours. maybe we should wait for feedback from @Ontheroad123

@Ontheroad123
Copy link

@MartinThoma All yours. maybe we should wait for feedback from @Ontheroad123

It's ok when add encoding in '_cmap.py'

@MartinThoma
Copy link
Member

I would rewrite the text to use parametrization, but besides that it looks good to me :-)

I'll merge it once I'm back from vacation :-)

@MartinThoma MartinThoma changed the title ENH : extend cmap compatibilty to GBK_EUC_H/V ENH: Extend cmap compatibilty to GBK_EUC_H/V May 1, 2023
@MartinThoma MartinThoma merged commit dde4c79 into py-pdf:main May 1, 2023
MartinThoma added a commit that referenced this pull request May 21, 2023
New Features (ENH)
-  Simplify metadata input (Document Information Dictionary) (#1851)
-  Extend cmap compatibilty to GBK_EUC_H/V (#1812)

Bug Fixes (BUG)
-  Prevent infinite loop when no character follows after a comment (#1828)
-  get_contents does not return ContentStream (#1847)
-  Accept XYZ destination with zoom missing (default to zoom=0.0) (#1844)
-  Cope with 1 Bit images (#1815)

Robustness (ROB)
-  Handle missing /Type entry in Page tree (#1845)

Documentation (DOC)
-  Expand file size explanations (#1835)
-  Add comparison with pdfplumber (#1837)
-  Clarify that PyPDF2 is dead (#1827)
-  Add Hunter King as Contributor for #1806

Maintenance (MAINT)
-  Refactor internal Encryption class (#1821)
-  Add R parameter to generate_values (#1820)
-  Make encryption_key parameter of write_to_stream optional (#1819)
-  Prepare for adding AES enryption support (#1818)

Code Style (STY):
-  Iterate directly over the list instead of using range (#1839)
-  Minor refactorings in _encryption.py (#1822)

[Full Changelog](3.8.1...3.9.0)
@pubpub-zz pubpub-zz deleted the iss1809 branch June 24, 2023 08:42
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Parsing pdf garbled characters
3 participants