Skip to content
@nla

National Library of Australia

Popular repositories Loading

  1. outbackcdx outbackcdx Public

    Web archive index server based on RocksDB

    Java 32 20

  2. httrack2warc httrack2warc Public

    Converts HTTrack crawls to WARC files

    Java 30 6

  3. chropro chropro Public archive

    Chrome debugging protocol client for Java

    Java 10 2

  4. solrbackup solrbackup Public

    Python script for backing up a remote Solr 4 core or SolrCloud cluster

    Python 9 6

  5. chronicrawl chronicrawl Public archive

    Experimental continouous web crawler for web archiving

    Java 9

  6. bamboo bamboo Public

    Web archive collection manager

    Java 8 4

Repositories

Showing 10 of 75 repositories
  • pandas4 Public

    Web archive workflow system

    nla/pandas4’s past year of commit activity
    Java 3 Apache-2.0 2 16 1 Updated Nov 21, 2024
  • nla-blacklight Public

    Discovery application for the National Library of Australia's catalogue

    nla/nla-blacklight’s past year of commit activity
    Ruby 0 1 0 7 Updated Nov 21, 2024
  • nla-arclight Public

    Custom implementation of ArcLight for The National Library of Australia.

    nla/nla-arclight’s past year of commit activity
    Ruby 0 0 0 6 Updated Nov 20, 2024
  • outbackcdx Public

    Web archive index server based on RocksDB

    nla/outbackcdx’s past year of commit activity
    Java 32 Apache-2.0 20 18 0 Updated Nov 20, 2024
  • bamboo Public

    Web archive collection manager

    nla/bamboo’s past year of commit activity
    Java 8 Apache-2.0 4 9 1 Updated Nov 20, 2024
  • nla-blacklight_common Public

    Common functionality for Blacklight and ArcLight applications

    nla/nla-blacklight_common’s past year of commit activity
    Ruby 0 0 0 5 Updated Nov 18, 2024
  • nla-pywb Public

    pywb config overlay for the Australian Web Archive

    nla/nla-pywb’s past year of commit activity
    HTML 2 0 1 0 Updated Nov 12, 2024
  • heritrix3 Public Forked from internetarchive/heritrix3

    Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.

    nla/heritrix3’s past year of commit activity
    Java 0 778 0 0 Updated Nov 7, 2024
  • ai-scout-audio2 Public

    AI audio proof of concept #2 - read TEI transcripts, build SOLR index with nomic embeddings, exploratory search and delivery web interface

    nla/ai-scout-audio2’s past year of commit activity
    JavaScript 0 0 0 6 Updated Oct 22, 2024
  • ai-scout-imageSearchComparison Public

    Simple website to capture evaluation of different ways to search images.

    nla/ai-scout-imageSearchComparison’s past year of commit activity
    EJS 1 0 0 7 Updated Oct 10, 2024

Top languages

Loading…

Most used topics

Loading…