Skip to content
@impresso

Media Monitoring of the Past

Media Monitoring of the Past - Beyond Borders: Connecting Historical Newspapers and Radio.

Impresso Project Logo

About

Hi there đź‘‹ !

Impresso - Media Monitoring of the Past is an interdisciplinary research project that uses machine learning to pursue a paradigm shift in the processing, semantic enrichment, representation, exploration and study of historical media across modalities, temporal, linguistic, and national borders. The project has received two rounds of funding, from 2017-2020 and 2023-2027 (hence, there is code from both periods).

We design and develop the Impresso Web App and the upcoming Impresso Datalab (coming soon), while conducting research at the intersection of Natural Language Processing, Design, and History. Find more details on the project website.

Contents

This GitHub organization hosts numerous repositories dedicated to:

  • the code behind the Web App and Datalab. While a few repositories are public, many are still private. We aim to document and release code properly as it matures and becomes ready;
  • code supporting research efforts;
  • code from student projects.

More information and highlights will be shared as we continue to make progress! In addition to the public repositories listed below, you can also check out our models on the Impresso Hugging Face organisation.

Impresso 2 release history

(to come)

Popular repositories Loading

  1. named-entity-tutorial-dh2019 named-entity-tutorial-dh2019 Public

    Tutorial on NE processing for Digital Humanities - DH Utrech 2019

    Jupyter Notebook 25 4

  2. CLEF-HIPE-2020 CLEF-HIPE-2020 Public

    Identifying Historical People, Places and other Entities: Shared Task on Named Entity Recognition and Linking on Historical Newspapers at CLEF 2020.

    SCSS 22 5

  3. NZZ-black-letter-ground-truth NZZ-black-letter-ground-truth Public

    8 1

  4. impresso-text-acquisition impresso-text-acquisition Public

    🛠️ Python library to import OCR data in various formats into the canonical JSON format defined by the Impresso project.

    Jupyter Notebook 7 2

  5. impresso-frontend impresso-frontend Public

    🚀 The frontend application of the Impresso WebApp http://impresso-project.ch/app

    Vue 5

  6. impresso.github.io impresso.github.io Public

    HTML 3 4

Repositories

Showing 10 of 47 repositories
  • impresso-user-admin Public

    Basic Django admin to manage user-related data in Impresso's Master DB.

    impresso/impresso-user-admin’s past year of commit activity
    Python 1 AGPL-3.0 0 3 0 Updated Dec 23, 2024
  • impresso-essentials Public

    ⚙️ Python package highly reusable modules and functions within impresso.

    impresso/impresso-essentials’s past year of commit activity
    Python 0 GPL-3.0 1 6 2 Updated Dec 20, 2024
  • impresso-middle-layer Public

    Middle layer API

    impresso/impresso-middle-layer’s past year of commit activity
    JavaScript 0 AGPL-3.0 1 16 2 Updated Dec 18, 2024
  • impresso-frontend Public

    🚀 The frontend application of the Impresso WebApp http://impresso-project.ch/app

    impresso/impresso-frontend’s past year of commit activity
    Vue 5 AGPL-3.0 0 169 (2 issues need help) 8 Updated Dec 18, 2024
  • impresso-datalab Public

    Impresso Datalab static Astro website

    impresso/impresso-datalab’s past year of commit activity
    MDX 0 AGPL-3.0 0 14 0 Updated Dec 16, 2024
  • paraphrasus Public
    impresso/paraphrasus’s past year of commit activity
    Jupyter Notebook 2 AGPL-3.0 1 0 0 Updated Dec 16, 2024
  • transmedia Public

    Website for the Transmedia History Conference

    impresso/transmedia’s past year of commit activity
    HTML 1 AGPL-3.0 0 0 0 Updated Dec 13, 2024
  • impresso-passim Public

    This repository contains code and sample data related to running the impresso corpus through the text reuse detection software passim.

    impresso/impresso-passim’s past year of commit activity
    Jupyter Notebook 0 AGPL-3.0 0 5 0 Updated Dec 13, 2024
  • impresso-schemas Public

    Repository of JSON schemas used in the Impresso project.

    impresso/impresso-schemas’s past year of commit activity
    Python 3 AGPL-3.0 3 5 0 Updated Dec 12, 2024
  • impresso-linguistic-processing Public

    Code for running spaCy on rebuilt impresso data.

    impresso/impresso-linguistic-processing’s past year of commit activity
    Python 0 AGPL-3.0 0 0 0 Updated Dec 10, 2024

Top languages

Loading…

Most used topics

Loading…