Add test cases for "UnicodeToLatex" and "LatexToUnicode" #11061

AbdAlRahmanGad · 2024-03-20T21:24:33Z

I added the test case mentioned in #5547 and other test cases to ensure reliability.

The problem with the test case was that ı̄ is not one character it's a combination of ı + ̄ unlike ā. What was happennig is that ı was being converted to \i and then ̄ was being converted to {\={}} so the result would be {\i{\={}}}.

I made it that we deal with such characters that cause conflict after combining accents. They are stored in a new variable called UNICODE_LATEX_CONVERSION_MAP_AFTER_COMBINING_ACCENTS I think this will be the base to deal with such cases.

I also added some of the underdot characters as the previous implementation didn't handle them.

Mandatory checks

Change in CHANGELOG.md described in a way that is understandable for the average user (if applicable)
Tests created for changes (if applicable)
Manually tested changed features in running JabRef (always required)
Screenshots added in PR description (for UI changes)
Checked developer's documentation: Is the information available and up to date? If not, I outlined it in this pull request.
Checked documentation: Is the information available and up to date? If not, I created an issue at https://github.com/JabRef/user-documentation/issues or, even better, I submitted a pull request to the documentation repository.

calixtus · 2024-03-20T22:06:58Z

Maybe this could help: https://www.unicode.org/faq/char_combmark.html

koppor · 2024-03-20T23:13:00Z

I think, all the unicode should be normalized before conversion. Use the formatter introduced at #11056.

src/test/java/org/jabref/logic/layout/format/LatexToUnicodeFormatterTest.java

koppor · 2024-03-20T23:15:31Z

I made it that we deal with such characters that cause conflict after combining accents.

My comment above in other words: We need to rely on the normal form NFC. Base our internal maps on that. And do not introduce some other maps.

Normalize unicode before conversion, remove the new mapping, add one new test case

koppor

Thank you for the quick action taken. Just minor comments.

Future work: The part at " // Combining accents" will never be touched? One should check with "code coverage"

src/main/java/org/jabref/logic/formatter/bibtexfields/UnicodeToLatexFormatter.java

src/main/java/org/jabref/logic/util/strings/HTMLUnicodeConversionMaps.java

src/test/java/org/jabref/logic/layout/format/LatexToUnicodeFormatterTest.java

- Removed unnecessary line - Renamed the `normalizer` variable to `UNICODE_NORMALIZER` - Added link to the issue

Add test cases for "UnicodeToLatex" and "LatexToUnicode"

1893e5a

koppor reviewed Mar 20, 2024

View reviewed changes

src/test/java/org/jabref/logic/layout/format/LatexToUnicodeFormatterTest.java Outdated Show resolved Hide resolved

Normalize unicode before conversion

c49aef7

Normalize unicode before conversion, remove the new mapping, add one new test case

koppor requested changes Mar 21, 2024

View reviewed changes

Refactor: Remove whitespace and update variable name

7b8e2bc

- Removed unnecessary line - Renamed the `normalizer` variable to `UNICODE_NORMALIZER` - Added link to the issue

koppor approved these changes Mar 21, 2024

View reviewed changes

koppor added this pull request to the merge queue Mar 21, 2024

Merged via the queue into JabRef:main with commit 7bb9339 Mar 21, 2024
20 checks passed

AbdAlRahmanGad deleted the test_case branch April 13, 2024 07:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add test cases for "UnicodeToLatex" and "LatexToUnicode" #11061

Add test cases for "UnicodeToLatex" and "LatexToUnicode" #11061

AbdAlRahmanGad commented Mar 20, 2024

calixtus commented Mar 20, 2024

koppor commented Mar 20, 2024

koppor commented Mar 20, 2024

koppor left a comment

Add test cases for "UnicodeToLatex" and "LatexToUnicode" #11061

Add test cases for "UnicodeToLatex" and "LatexToUnicode" #11061

Conversation

AbdAlRahmanGad commented Mar 20, 2024

Mandatory checks

calixtus commented Mar 20, 2024

koppor commented Mar 20, 2024

koppor commented Mar 20, 2024

koppor left a comment

Choose a reason for hiding this comment