-
Notifications
You must be signed in to change notification settings - Fork 9
Home
The Thoth User Manual, for publishers and other creators of metadata records in Thoth, can be found here.
In the digital realm, a Work usually consists of two constituent parts: data and metadata. The data comprise the contents of the publications, the information contained in it targeted at human readers, machine readers, or both. The metadata comprise all the data about the publication, such as its author, title, and subject classification.
Metadata are frequently also part of the Work. For example, the title and author are often mentioned on the opening pages, and the ISBN numbers are usually listed in the colophon. Despite this partial overlap, it is useful to distinguish between data and metadata, as they are handled in distinct manners in the Open Access book supply chain. There are several international Metadata Standards setting baseline quality criteria for metadata.
There are specific digital Data Formats and Metadata Formats that are supported by Thoth. An important subset of metadata is formed by Persistent Identifiers.
Thoth operates at several level in the Open Access book supply chain. We employ here the categorization of key stakeholders and intermediaries proposed in Michael Clarke and Laura Ricci's 2021 report OA Books Supply Chain Mapping (Clark & Ricci 2021).
Work records in Thoth allow Content Creators to add information about Funding by referencing an Institution by means of Persistent Identifiers as well as further grant program and project information. Content Funders are able to harvest these data through one of the Thoth Metadata Formats or our Open API.
Libraries, both University Libraries and National Libraries, have become increasingly important Content Funders in the OA Book Supply Chain. Thoth is partially funded by library subscriptions through the Open Book Collective and in return Thoth provides high-quality metadata in a range of Metadata Formats including MARC 21 that libraries can ingest into their Library Management Systems.
Thoth is primarily designed as a platform for Content Creators, in particular Open Access Publishers. Thoth provides integrated services for the maintenance, management, and dissemination of metadata records in a wide variety of Metadata Formats to a large selection of Content Platforms, Distributors, and Catalogs & Indices.
Publishers may use one of the available Publishing Platforms, which allow authors, editors, and publishers to collaborate in a digital, in-browser environment. Thoth is currently collaborating collaborating with Open Monograph Press and PubPub to improve integration with their in-platform metadata management functionalities.
Individuals authors are not a targeted user group of Thoth. They may manage their private bibliographic metadata on one of the available commercial or open source Bibliographic Reference Management Platforms and upload their research directly to one of the Green OA Repositories.
Thoth currently supports the export of metadata to all available Bibliographic Reference Management Platforms via BibTeX.
Ebook Aggregators "license and consolidate titles from many publishers into one combined database, [… and] often combine OA and paid-access titles for greater discoverability and convenience" (Clark & Ricci 2021).
Thoth currently supports the export of metadata to Baobab ebooks, EBSCO eBooks, JSTOR, Project MUSE, and ProQuest Ebook Central.
OA Platforms and Repositories "have no underlying infrastructure for the buying and selling of books, and are intended to host exclusively free or OA content" (Clark & Ricci 2021).
Thoth currently supports the export of metadata to OAPEN.
Consumer Ebook Platforms "offer titles for an individual’s use and access, and do not actively support institutional or library integration" (Clark & Ricci 2021).
Thoth currently supports the export of metadata to Google Play Books.
Shadow Libraries are online databases of readily available content that is normally obscured or otherwise not readily accessible. Such content may be inaccessible for a number of reasons, including the use of paywalls, copyright controls, or other barriers to accessibility placed upon the content by its original owners" (Wikipedia).
Thoth currently does not support export of metadata to any of the shadow libraries.
Ebook Distributors differ from Digital Libraries in the sense that they do not claim to offer a scholarly function, be that to research institutions or to the general public. Distributors repackage and normalize ebook metadata. Most ebook distributors operate some form of monetization scheme, which may not be hospitable to OA books.
Thoth currently supports the export of metadata to OverDrive and RNIB Bookshare.
Library-agnostic, global content indices.
- Central KnowledgeBase (ProQuest ExLibris Alma)
- BDSLive*
- EBSCO Knowledge Base
- JISC Knowledge Base+ (KB+)
- ProQuest 360 Core
- WikiData+
- WorldCat KnowledgeBase (OCLC)
- Diamond Discovery Hub (OpenAIRE/EOSC/DIAMAS/CRAFT-OA)
Search engines built on top of publication databases tailored toward researches. See Jeroen Bosman's excellent Scholarly search engine comparison for useful information regarding availability of publication types, etc. See also the recent comparative study by Visser, Van Eck, and Waltman.
- BASE (Bielefeld Academic Search Engine)
- Connected Papers
- Dimensions
- FAIRsFAIR
- FatCat (Internet Archive)
- FederatedFAIR
- IEEE Xplore
- Inciteful
- Internet Archive Scholar (Internet Archive)
- ISIDORE (huma-num)
- Lens
- LexisNexis
- Library Hub Discover* (Jisc)
- MathSciNet
- OpenAIRE Research Graph
- OpenAlex
- Open Research Knowledge graph
- OpenTexts.World
- Orion Search
- PubMed
- ResearchGraph
- S2ORC - Semantic Scholar Open Research Corpus
- ScienceOpen+
- SciFinder
- Scopus (RELX)
- Semantic Scholar
- SHARE (ARL/COS)
- Summon (ProQuest)
- Triple
- Web of Science (Clarivate)
- WorldCat Discovery* (OCLC)
Bibliographies managed by scholarly organizations related to a specific field of inquiry.
- MLA Bibliography
- Database of Medieval Digital Resources (Medieval Academy of America)
Print Book Distributors allow both authors and publisher to publish print-on-demand hardcopies. They require the manual input of metadata upload of print-ready PDF files without apparent batch or automated upload options.
Open Educational Resources (OER) focus mainly on textbooks rather than scholarly publications.
Name | Governance | Membership | OA | non-OA | Ingest |
---|---|---|---|---|---|
BCcampus OpenEd | Public | N | Y | N | Push |
Merlot | Public | Y | Y | N | Push |
OER Commons | Non-Profit | Y | Y | N | Push |
Open Textbook Library | Non-Profit | N | Y | N | Push |
For open archiving there is an Open Archiving Information System (OAIS) ISO definition.
See also list: https://coptr.digipres.org/Category:Preservation_System
- Archive-It
- Archivematica
- Arkivum Perpetua
- APTrust
- CINES
- CLOCKSS
- Conifer
- Digital Bedrock
- Emulation as a Service Infrastructure (EaaSI)
- Fulcrum
- GitHub Archive Programme
- HathiTrust
- Internet Archive
- LIBSAFE
- LOCKSS
- MetaArchive
- Perma.cc
- Portico
- Preservica
- ReplayWeb.Page
- RODA
- Samvera
- Wayback Machine
- WebCite
- Zenodo
- Unicode OCR (??)
The Thoth Wiki has been developed in the context of the COPIM (Community-led Open Publication Infrastructures for Monographs) project. Individual contributions to the wiki have been made by Tim Elfenbein, Rupert Gatti, Ross Higman, Brendan O'Connell, Vincent W.J. van Gerven Oei, Tobias Steiner and Hannah Hillen under the general editorship of Van Gerven Oei. All data are available under a CC-BY 4.0 license.