Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Paragraph content missing for 'Funding' (annex) section #895

Closed
mdparkin opened this issue Feb 28, 2022 · 0 comments · Fixed by #959
Closed

Paragraph content missing for 'Funding' (annex) section #895

mdparkin opened this issue Feb 28, 2022 · 0 comments · Fixed by #959
Assignees
Labels
bug From Hemiptera and especially its suborder Heteroptera enhancement

Comments

@mdparkin
Copy link

I am using GROBID to convert bioRxiv preprints PDFs to XML and finding that paragraph content with the heading 'Funding' is not being captured in the TEI XML output (version 0.7.0, lightweight Docker image).

Example PDF:
2021.09.27.461862v1.full.pdf

The 'Funding' section in this document appears immediately after the 'Acknowledgements' on Page 19 of the PDF. It is captured in the TEI XML as @type="annex", but the <p> element and text content is missing:

<div type="annex">
  <div 
    xmlns="http://www.tei-c.org/ns/1.0">
    <head>Funding</head>
   </div>
</div>

Although recognition of this section with some @type="funding" attribute would be the ideal scenario, for my use case I simply need the associated <p> content (i.e. "This work was funded by...") to be output in the TEI XML.

Many thanks in advance for looking into this issue.

@kermitt2 kermitt2 added bug From Hemiptera and especially its suborder Heteroptera enhancement labels Feb 28, 2022
@kermitt2 kermitt2 self-assigned this Feb 28, 2022
@lfoppiano lfoppiano assigned lfoppiano and unassigned kermitt2 Oct 12, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug From Hemiptera and especially its suborder Heteroptera enhancement
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants