Skip to content

deutschestextarchiv/dingler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

61 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Dingler Online

This repository contains all data created and compiled by the DFG project “Digitalisierung des Polytechnischen Journals”.

NB: Any facsimile images and generated content to run a website are not included.

Building a website with minimal requirements

Prerequisites to generate a static website

  • GNU make (maybe other makes will fit)
  • saxonb-xslt (available in Debian-like Linux distributions via libsaxonb-java)

Install MathJax as git submodule

git submodule init
git submodule update

Extract articles from volumes and build a static website

make articles
make -C web all

# or, when you’re impatient and have the power of multi-cores:
make articles -j4
make -C web all -j4

How to run a static website

Apache configuration snippet, assuming your checkout resides under /var/www/dingler and you want to serve the site as /dingler:

<Directory /var/www/dingler/web/site>
  DirectoryIndex index.html
  Options +Indexes +FollowSymLinks
  Require all granted
</Directory>
Alias /dingler /var/www/dingler/web/site

How to obtain facsimile images

Go to the SLUB pages and grab them from there.

Progress of this project

Build

  • extract articles and miscellanea from volumes
    • add volume id (eg. pjXXX) to article/miscellanea file
    • adjust URLs in <idno type="URLXML"> for volumes and articles

Website

  • index page and documentation
  • create volume index
  • HTML version of volumes
  • HTML version of articles:
    • link facsimile
  • link to XML sources
  • styling TEI elements within <text>:
    • <add>
    • <bibl>
    • <cb>
    • <cell>
    • <choice>
      • <sic>/<corr>
    • <date>
    • <div>
    • <div type="continuation">
    • <figure>
    • <formula>
    • <front>
    • <head>
    • <hi>
    • <item>
    • <l>
    • <lb>
    • <lg>
    • <list>
    • <milestone>
    • <note>
    • <p>
    • <pb>
    • <persName>
    • <placeName>
    • <q>
    • <ref> (to articles)
    • <row>
    • <table>
    • <titlePart>
    • <unclear>

Images

  • XSLT processing for IMT
  • thumbnail generation

Search

  • basic dstar search
  • documentation
  • time series plots

Registers

  • patents
  • persons
  • sources
  • tables (aka figures)

Article rendering

  • figure tables, resp. IMT XML embedding
  • footnotes
  • keyword cloud (most common nouns)
  • links to SLUB

Downloadable packages

  • journals
  • articles
  • CMDI files

Miscellaneous stuff

  • GND/BEACON file
  • list of curious articles
  • 404 page
  • use dwds.de search API

Directory structure

  • data: generated files
    • articles: articles extracted from volumes
    • cmdi: CMDI metadata for articles
  • images: facsimile images
  • scripts: processing tools
  • sources: project sources
    • volumes: volumes as TEI P5 XML
      • …/$volume: table figures
  • web: files for web presentation
    • documentation: project documentation
    • site: (generated) site content
      • assets: static site assets
        • bootstrap: Bootstrap files
        • css: CSS files
        • fonts: font files
        • images: image files
        • js: Javascript files
    • xslt: XSLT files for various pages

Known issues

  • faksimiles for vol. 8 are dis-ordered
  • vol. 12: lot of thumbnails only instead of real size images