Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add warning about confabulations to "New entry from plain text" #11825

Closed
ThiloteE opened this issue Sep 24, 2024 · 4 comments · Fixed by #11831
Closed

Add warning about confabulations to "New entry from plain text" #11825

ThiloteE opened this issue Sep 24, 2024 · 4 comments · Fixed by #11831
Assignees
Labels
AI Related to AI Chat/Summarization good first issue An issue intended for project-newcomers. Varies in difficulty.

Comments

@ThiloteE
Copy link
Member

Description of Problem:

"New entry from plain text" is based on the Grobid project, which uses machine learning techniques. Results are probability based, therefore not always 100% accurate.

We have had users that are confused by this and think the resulting data is or always should be true.

Solution:

Add a warning about confabulations or hallucinations in the GUI. It could be phrased similar to the warning that is shown to users that make use of the AI Chat or AI Summarization feature.

Additional context

@ThiloteE ThiloteE added the good first issue An issue intended for project-newcomers. Varies in difficulty. label Sep 24, 2024
@github-project-automation github-project-automation bot moved this to Free to take in Good First Issues Sep 24, 2024
@koppor
Copy link
Member

koppor commented Sep 24, 2024

Follow-up: Use the openAI API to create BibTeX. Has much better results. @InAnYan Can you "unzip" this feature wish? - Which code to use in JabRef to interact with openAI etc. Prompt is like "Please convert the following bibliography entry to a well-formatted high-quality BibTeX entry".

@ThiloteE
Copy link
Member Author

ThiloteE commented Sep 25, 2024

are there statistics about "much better results"?

@koppor
Copy link
Member

koppor commented Sep 25, 2024

are there statistics about "much better results"?

I tried with the example of our homepage - https://docs.jabref.org/collect/newentryfromplaintext

In: O. Kopp, A. Armbruster, und O. Zimmermann, "Markdown Architectural Decision Records: Format and Tool Support", in 10th ZEUS Workshop, 2018.

Out 1:

@Article{Kopp_2018abc,
  author       = {Kopp, O and Armbruster, A and Zimmermann, O},
  date         = {2018},
  journaltitle = {Communications},
  title        = {Le gigantisme architectural en Union soviétique},
  doi          = {10.3406/comm.1985.1625},
  number       = {1},
  pages        = {45-67},
  volume       = {42},
  publisher    = {PERSEE Program},
}

Out 2:

@inproceedings{kopp2018markdown,
  author    = {Oliver Kopp and Andreas Armbruster and Olaf Zimmermann},
  title     = {Markdown Architectural Decision Records: Format and Tool Support},
  booktitle = {Proceedings of the 10th ZEUS Workshop},
  year      = {2018},
}

Guess which output is from which tool!

I know that both are wrong. - Anita will go crazy.

Refined prompt:

Please convert the following bibliography entry to a well-formatted high-quality BibTeX entry. Do not invent additional information (such as full first names). Keep as close as possible to the input.

@inproceedings{kopp2018markdown,
  author    = {O. Kopp and A. Armbruster and O. Zimmermann},
  title     = {Markdown Architectural Decision Records: Format and Tool Support},
  booktitle = {Proceedings of the 10th ZEUS Workshop},
  year      = {2018}
}

@InAnYan
Copy link
Collaborator

InAnYan commented Sep 25, 2024

Thi Lo's right, I would like to see a paper for overview of plain citation parsers with comparisons LLM/GROBID

@ThiloteE ThiloteE moved this from Free to take to Assigned in Good First Issues Oct 5, 2024
@ThiloteE ThiloteE moved this from Assigned to In Progress in Good First Issues Oct 5, 2024
@ThiloteE ThiloteE added the AI Related to AI Chat/Summarization label Oct 5, 2024
@github-project-automation github-project-automation bot moved this from In Progress to Done in Good First Issues Oct 6, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
AI Related to AI Chat/Summarization good first issue An issue intended for project-newcomers. Varies in difficulty.
Projects
Status: Done
Development

Successfully merging a pull request may close this issue.

3 participants