Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bump tika-core from 1.26 to 2.1.0 #61

Merged

Conversation

dependabot[bot]
Copy link
Contributor

@dependabot dependabot bot commented on behalf of github Sep 30, 2021

Bumps tika-core from 1.26 to 2.1.0.

Changelog

Sourced from tika-core's changelog.

Release 2.1.1 - ???

  • Improve robustness and features of the httpfetcher (TIKA-3543)

  • Add optional fetch ranges to FetchEmitTuple to allow range fetching from, e.g. http or s3 (TIKA-3542).

  • Exclude dependencies on jsoup and ehcache in ucar grib/cdm (TIKA-3003).

Release 2.1.0 - 08/18/2021

MAJOR CHANGES in 2.1.0:

  • Improved packaging for tika-parsers-extended. Use the tika-parser-scientific-package and tika-parser-sqlite3-package artifacts if you want fat jars with dependencies. (TIKA-3510)

  • Tika app writes UTF-8 when an encoding is not specified; the legacy behavior was UTF-8 on Mac OS, but System default on other OSs (TIKA-3515).

  • Change the default rendering strategy for PDFs from NO_TEXT to ALL (TIKA-3520).

Other changes:

  • Fixed bug that pointed to the wrong tessdata directory if the user specified a tesseract path but not also a tessdata path (TIKA-3518).

  • Fixed bug in Icu4j's encoding detector where it would return non-standard names for charsets, e.g. IBM424_rtl is now returned as IBM424 (TIKA-3516).

  • Add a simple UrlFetcher in tika-core as a basic alternative to tika-fetcher-http (TIKA-3527).

  • Add tika-pipes support for Google Cloud Storage (TIKA-3524).

  • Fix markup ordering errors in xhtml output for ODT files (TIKA-2242).

  • Fix serialization of embedded docs in OpenSearch emitter and fix embedded documents not being indexed in some use cases in the Solr emitter (TIKA-3490).

  • Add pipesClientId system property to PipesServer so that each forked process can log to its own logger (TIKA-3480).

  • Add DateNormalizingMetadataFilter let users ensure that all dates emitted to Solr/OpenSearch are in UTC. Users can configure which timezone they'd like to use in cases where the file format does not store a timezone (TIKA-3496).

  • Breaking change in the Solr and OpenSearch emitters. To achieve

... (truncated)

Commits

Dependabot compatibility score

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.

Dependabot will merge this PR once CI passes on it, as requested by @AntonOellerer.


Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

  • @dependabot rebase will rebase this PR
  • @dependabot recreate will recreate this PR, overwriting any edits that have been made to it
  • @dependabot merge will merge this PR after your CI passes on it
  • @dependabot squash and merge will squash and merge this PR after your CI passes on it
  • @dependabot cancel merge will cancel a previously requested merge and block automerging
  • @dependabot reopen will reopen this PR if it is closed
  • @dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually
  • @dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

Bumps [gson](https://github.com/google/gson) from 2.8.7 to 2.8.8.
- [Release notes](https://github.com/google/gson/releases)
- [Changelog](https://github.com/google/gson/blob/master/CHANGELOG.md)
- [Commits](google/gson@gson-parent-2.8.7...gson-parent-2.8.8)

---
updated-dependencies:
- dependency-name: com.google.code.gson:gson
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>
Bumps [tika-core](https://github.com/apache/tika) from 1.26 to 2.1.0.
- [Release notes](https://github.com/apache/tika/releases)
- [Changelog](https://github.com/apache/tika/blob/main/CHANGES.txt)
- [Commits](https://github.com/apache/tika/commits)

---
updated-dependencies:
- dependency-name: org.apache.tika:tika-core
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <support@github.com>
@dependabot dependabot bot added dependencies Pull requests that update a dependency file java Pull requests that update Java code labels Sep 30, 2021
@AntonOellerer
Copy link
Contributor

@dependabot merge

@github-actions
Copy link

github-actions bot commented Oct 4, 2021

Unit Test Results

  2 files  ±0    2 suites  ±0   4s ⏱️ ±0s
12 tests ±0  12 ✔️ ±0  0 💤 ±0  0 ❌ ±0 

Results for commit 47e4c8b. ± Comparison against base commit 47e4c8b.

♻️ This comment has been updated with latest results.

@AntonOellerer AntonOellerer merged commit 47e4c8b into master Oct 4, 2021
@dependabot dependabot bot deleted the dependabot/gradle/org.apache.tika-tika-core-2.1.0 branch October 4, 2021 09:17
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
dependencies Pull requests that update a dependency file java Pull requests that update Java code
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant