-
-
Notifications
You must be signed in to change notification settings - Fork 61
Suppression of name parsing using "
does not work
#151
Comments
Actually, the “protecting” double quotes should be kept when converting between biblio formats. I edited the “Expected:” bit above accordingly, assuming that When formatting a bibliography, however, they should of course be removed:
Actual:
Expected:
|
Instead of protecting names in yaml databases from being parsed, we could of course also adopt a convention that expects all names in yaml databases to be fully parsed already (unlike json databases). pandoc-citeproc then would not try to parse any names from yaml databases (unless the If we adopted this solution – which I’d favour – we’d also want to protect family names containing spaces upon export from yaml to json, or simply set the |
I'd prefer to allow unparsed names in YAML databases; it's easier for users not to have to worry about the various sorts of particles, etc. |
OK, I can see that parsing names by default might seem more convenient, but the CSL specs expect fully parsed names throughout. I also doubt that very many users actually write CSL JSON or CSL YAML databases by hand. Still, those who wish to have names parsed by pandoc currently can include a per-entry It was citeproc-js that introduced name parsing (listed in its specs under “Dirty Tricks”, http://gsl-nagoya-u.net/http/pub/citeproc-doc.html#input-data-rescue), with these specs saying names are only parsed if the As it seems, Zotero is currently being reworked to do the parsing itself, and using fully parsed names for data interchange with citeproc-js, thus strictly following the CSL specs again (https://forums.zotero.org/discussion/30974/any-idea-why-an-a-author-comes-last-in-the-bibliography/#Item_46). I feel the best solution for pandoc would be to introduce a metadata variable (
After that, anyone who wishes to have names from CSL JSON or CSL YAML databases parsed by pandoc would have to state this explicitly, either per-entry or globally. Note that this in no way affects bibtex/biblatex name parsing which seems perfectly ok (and outputs fully parsed names, which again is perfectly ok). |
Update: Zotero, starting with at least 4.0.28.1, is now exporting fully parsed names, see zotero/zotero@1cbd7f7 Thus, pandoc-citeproc should stop parsing names by default straight away, to avoid parsing names once too often. |
We have now stopped parsing names by default. Do we need special handling of double quotes? |
Well, if you do want to keep the option to parse two-field names (and I guess this makes sense), I feel this missing bit should be added to make it work correctly overall, and follow the inofficial standard introduced by citeproc-js. The parsing rules for two-field names do yield some false positives, i.e., elements that look like particles but in fact aren’t, so an option for protecting names is needed. I’d say it’s not one of the top priorities, though – #47 and #94 would be much higher on my list … :-) |
We now handle these double quotes -- effectively just ignoring the quotes. |
Well, the output from my original example now is
Note the smart quotes – but with a default of |
Are you sure you're using the latest development (not released)
version? I get
…---
references:
- id: item1
type: article-journal
author:
- family: de Man
given: Al
issued:
- year: '2019'
title: Test
...
+++ Nick Bart [Dec 22 17 21:40 ]:
Well, the output from my original example now is
---
references:
- id: item1
type: article-journal
author:
- family: “de Man”
given: Al
issued:
- year: '2019'
title: Test
...
Note the smart quotes – but with a default of parse-names: false there
shouldn’t be any quotes now.
—
You are receiving this because you modified the open/close state.
Reply to this email directly, [1]view it on GitHub, or [2]mute the
thread.
References
1. #151 (comment)
2. https://github.com/notifications/unsubscribe-auth/AAAL5GGno6t6Tu-Ky_Dr_MDfxhHtQkUbks5tDCHXgaJpZM4Ff9CR
|
I thought I was using the latest development version, but cannot reproduce this now. Sorry for the false alarm. |
In citeproc-js, parsing of particles can be suppressed to treat particles as part of the family name field by enclosing the family name field content in double quotes (http://gsl-nagoya-u.net/http/pub/citeproc-doc.html#particles-as-part-of-the-last-name). pandoc-citeproc should be fixed to allow this, and to remove the extra (smart) double quotes that currently appear in the output.
Example:
Expected:
Actual:
Note that, in addition, we see the effect of #130, too.
The text was updated successfully, but these errors were encountered: