Skip to content
@Living-with-machines

Living with Machines

A radical collaboration between computational linguists, curators, data scientists, software engineers, geographers and historians

Popular repositories Loading

  1. DeezyMatch DeezyMatch Public

    A Flexible Deep Learning Approach to Fuzzy String Matching

    Jupyter Notebook 141 35

  2. histLM histLM Public

    Neural Language Models for Historical Research

    Jupyter Notebook 24 21

  3. DiachronicEmb-BigHistData DiachronicEmb-BigHistData Public

    Tools to train and explore diachronic word embeddings from Big Historical Data

    Jupyter Notebook 21 2

  4. nnanno nnanno Public

    nnanno is a collection of tools that sample, annotate and apply computer vision to the Newspaper Navigator dataset

    Jupyter Notebook 17 1

  5. deduplify deduplify Public archive

    A Python tool to search for and remove duplicated files in messy datasets

    Python 16 2

  6. alto2txt alto2txt Public

    Convert ALTO XML to plain text + minimal metadata

    Python 15 2

Repositories

Showing 10 of 55 repositories
  • alto2txt2csv Public

    Code for converting zipped alto2txt output to csv files

    Living-with-machines/alto2txt2csv’s past year of commit activity
    0 MIT 0 0 0 Updated Feb 18, 2025
  • alto2txt2fixture Public

    Converts metadata from alto2txt into JSON data with corresponding relational IDs for ingestion into a relational database

    Living-with-machines/alto2txt2fixture’s past year of commit activity
    Python 0 MIT 1 11 4 Updated Feb 17, 2025
  • lwmdb Public

    A django-based library for managing the Living with Machines newspapers metadata database schema

    Living-with-machines/lwmdb’s past year of commit activity
    CSS 2 MIT 0 35 10 Updated Feb 17, 2025
  • T-Res Public

    A Toponym Resolution Pipeline for Digitised Historical Newspapers

    Living-with-machines/T-Res’s past year of commit activity
    Python 8 1 33 5 Updated Feb 17, 2025
  • presswords Public

    Code for the counts data derived from historical newspapers

    Living-with-machines/presswords’s past year of commit activity
    0 MIT 0 0 0 Updated Feb 12, 2025
  • jisc-wrangler Public

    Tool for restructuring data in the JISC 19th Century British Library Newspaper collection

    Living-with-machines/jisc-wrangler’s past year of commit activity
    Python 0 MIT 0 0 3 Updated Feb 12, 2025
  • zoonyper Public

    Code to make it easy to import and process Zooniverse annotations and their metadata in Python/Jupyter Notebooks

    Living-with-machines/zoonyper’s past year of commit activity
    Python 0 MIT 2 0 3 Updated Feb 12, 2025
  • TheLivingMachine Public

    Computational Approaches to the Nineteenth-Century Language of Technology

    Living-with-machines/TheLivingMachine’s past year of commit activity
    Jupyter Notebook 1 MIT 0 5 1 Updated Feb 11, 2025
  • DiachronicEmb-BigHistData Public

    Tools to train and explore diachronic word embeddings from Big Historical Data

    Living-with-machines/DiachronicEmb-BigHistData’s past year of commit activity
    Jupyter Notebook 21 MIT 2 2 0 Updated Jan 30, 2025
  • dhoxss-text2tech Public

    Materials for the Text to Tech workshop at the Digital Humanities Oxford Summer School

    Living-with-machines/dhoxss-text2tech’s past year of commit activity
    Jupyter Notebook 13 MIT 2 0 2 Updated Jan 22, 2025

Top languages

Loading…

Most used topics

Loading…