-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Adjust transformation rules to RDA changes #161
Comments
Here is a list of RDA record in lobid courtesy of @donboern: rda-ids.txt |
Most of the reosurces listed in rda-ids.txt seem to be periodicals. Here is a print book: http://lobid.org/resource/HT018779822 |
What seems important to me for the start is field 419 with the pbulisher, publication place/date information. Snippet from http://lobid.org/resource/HT018779822: <datafield ind2="1" ind1="-" tag="419">
<subfield code="a">New York</subfield>
<subfield code="b">Routledge</subfield>
<subfield code="c">2014</subfield>
</datafield> Snippet from http://lobid.org/resource/HT018772912: <datafield ind2="1" ind1="-" tag="419">
<subfield code="a">Sundern</subfield>
<subfield code="b">Baulmann Leuchten GmbH</subfield>
<subfield code="c">2011-</subfield>
<subfield code="A">3</subfield>
</datafield> |
Does this mean that fields are ambiguous (i.e. e.g. |
@DRoI Could you please get the Aleph XML source of all files in rda-ids.txt and put them in one file so that I can search for specific fields? |
|
Speaking to publisso stakeholders, they want to work with roles of persons/corporations from RDA. We will have to consider these in the transformation. Note to self: Take a look at this and open a separate issue. |
Updated rda-ids.alephMabXmlPretty.xml . Took as base DE-605-aleph-baseline-marcxchange-2016011515.tar.gz which reveals 16k resources as RDA. Hope this suffices. |
@dr0i Could you please update rda-ids.alephMabXmlPretty.xml once more? |
Provides a list of RDA records residing in hbz01 catalogue. See hbz/lobid#161.
Around 180k docs, concatenated in one big bzipped xml file: http://lobid.org/download/rda-ids.alephMabXmlPretty.xml.bz2 |
Thanks. Unwieldy as the file gets, I won't ask again for creating it. Now thinking about how to work with a 1,5GB xml file... |
Depending on what you want, you can always use the friendly stream tools like |
There seems to be a problem with the rda-ids.alephMabXmlPretty.xml. When I do for example
|
As a finger execise I looked at the morph-hbz01-to-lobid.xml to check fields which are now omitted, how they are transformed and to document it here.
|
Closing this super-issue as the two remaining sub-issues are sufficient for future orientation (and don't need to be implemented for the launch). |
From 1 October 2015 people will be cataloging in hbz union catalog according to the RDA rules as documented here. We will have to adjust the transformation, i.e. the hbz01-to-lobid morph file accordingly.
After a first cursory look at the documents, I suggest the following approach:
Identifying RDA records
RDA is only implemented to newly catalogued resources which get an RDA marker
r
in field 030, indicator=blank,position 4 of the Aleph sequentials (aseq), see the documentation. Thus, we will have to add the RDA transformation rules only for these records.Checking fields that will be omitted
Several fields won't be used anymore with RDA cataloging. You can see the list here. We will check whether and how we currently transform these to RDF.
Find out how to transform the new data to lobid'scurrent RDF data model
After having identified the data fields where RDA means change we will have to find find out how we integrate the new RDA data into the the current lobid RDF.
Discuss how to handle breaks in the cataloging practice
While we be able to make a seemless transformation for some of RDA cataloging so that lobid customers won't even notice that things have changed, this may not be possible for all of the changes. E.g., regarding IMD (Inhaltstyp, Medientyp, Datenträgertype)/CMC (content type, media type, carrier type) we will get better and more coherent information (see here for details).
On cases where cataloging practice significantly breaks, we will have to look, whether we will both try to map the data to the old/currrent data model and map the data according to RDA.
The text was updated successfully, but these errors were encountered: