-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Take 064 into account for transformation to RDF #266
Comments
We might consider aligning the RDF for pre-RDA and RDA records by removing "Formschlagwörter" from the subject array for pre-RDA records. See also hbz/lobid-rdf-to-json#23 (comment). |
As nobody asked for this, I'd say it is sufficient to do this in API 2.0. Thus, adding the label. |
Here is the core list of GND Formschlagwörter: http://access.rdatoolkit.org/document.php?id=nlgpschp7&target=nlgps07-27 Here is the extended list with all GND Formschlagörter (PDF): https://wiki.dnb.de/download/attachments/106042227/AH-007.pdf |
There is redundant information in MAB/Aleph fields
Some examples to take a closer look at: http://lobid.org/hbz01/HT019025947, http://lobid.org/hbz01/HT019025943, http://lobid.org/hbz01/HT018814546, http://lobid.org/hbz01/HT018913029, http://lobid.org/hbz01/HT018909174 |
By testing I found out that fields 051/052 aren't automatically generated from 064. For the core Formschlagwörter I took the first five hits of lobid.org/resource and looked at them at lobid.org/hbz01. For Autobiografie, Bibliografie, Biografie, Comic, Festschrift, Hochschulschrift, Hörbuch, Schulbuch, Website and Zeitschrift I found no file with a field 064. |
@ChristophEwertowski We have to check the RDA titles to see whether the 051/052 are automatically generated from 064. RDA are those with creation date after 2015-10-01. You can limit a query to those using the Elasticsearch query DSL, see https://www.elastic.co/guide/en/elasticsearch/reference/2.3/search-uri-request.html |
@dr0i showed me how to limit the queries to those created after a specific point in time using the URL. E.g. http://lobid.org/resources?q=describedby.dateCreated:%3E20151001 |
I confined my search to October 2015 and onwards and looked at it again. There are still cases where 064 doesn't exist but 051 does, so for these cases 051 isn't automatically generated from 064. Example: http://lobid.org/hbz01/HT018979011 In other cases both fields exist but contain different information. Example: http://lobid.org/hbz01/HT018976920
Since a biography could also exist in other forms, e.g. books, for this case 051 couldn't be generated from 064. |
And if you look closer at the first example you can see that in the hbz01 file it's described as a newspaper (http://lobid.org/hbz01/HT017458093, field 064) and in the lobid-resource as a journal (http://lobid.org/resource/HT017458093, type:bibo/Journal) which are two different publication types. |
The example you point to has Source data: <controlfield tag="052">pag||||aw||||||</controlfield> From the MAB documentation:
|
To get back, I sum up which points are open:
Are the fields 051/052 derived from 064 for RDA? (Probably not.) @acka47 which person would be the right contact person?
I'm going to tackle the first question by looking which and how much Formschlagwörter are already represented by mapping of 050-052. |
|
R.D. (Edoweb) just asked for the 064 in an email:
Here is a link to the example from the screenshot: http://lobid.org/resources/HT019149667 |
I think it will be hard to align 064 ("Nature of Content"/"Art des Inhalts", see ) with the information we already have about a resource from other fields (inlcuding Formschlagwörter). Thus, it might be the easiest way to just add |
As there are GND URIs given (I already linked to the PDF above that also lists the GND URIs), we will just use these along with the label given in subfield {
"@context":"http://lobid.org/resources/context.jsonld",
"id":"http://lobid.org/resources/HT019149667#!",
"natureOfContent":[
{
"id":"http://d-nb.info/gnd/4048476-2",
"label":"Ratgeber"
},
{
"id":"http://d-nb.info/gnd/4142300-8",
"label":"Amtliche Publikation"
}
]
} |
NatureOfContent is added. Example (production) / example (test). |
Looks good.+1 |
Deplyoed to prodcution, closing. |
Sub-issue of #161. The "Formschlagwörter" are in field 064 in RDA instead of being listed with the other subject headings.
Examples
http://lobid.org/resource/HT017458093 which has Formschlagwort ""Zeitung" but isn't typed as such yet:
http://lobid.org/resource/HT018781721 (snippet) which has Formschlagwort "Zeitschrift" and is already typed as
bibo:Journal
:http://lobid.org/resource/HT018772904 (Formschlagwort "Bibliographische Reihe" and already typed as
bibo:Series
):The text was updated successfully, but these errors were encountered: