Projects

I develop open-source tools for linguistic research, primarily in Python. My software focuses on data management, analysis, transformation, and visualization, usually related to languages. Check my GitHub profile for more.

Major Projects

  • TITUS 2.0 - Next generation of the TITUS database for historical linguistics and ancient Indo-European languages
  • Swiss German corpus search filter map - Spatially modelling dialect areas used in the Idiotikon and integrating them into an interactive search map
  • CDTD (Comparative Dictionary of Tibetan Dialects) - Salvaging legacy HyperCard data and preparing publication of the second volume

Cariban Language Projects

My work with Cariban languages spans documentation, analysis, and the development of digital resources:

Open-Source Linguistic Tools

Document Preparation

  • lingdocs - Write data-rich linguistics documents with integrated CLDF dataset support and multiple output formats
  • expex-acro - LaTeX package for glossing abbreviations and linguistic markup

Corpus Management & Analysis

  • cldf-ldd - Component collection for linguistic descriptive data in CLDF
  • pyradigms - Compose and decompose linguistic paradigms (essentially pivot tables for linguistics)

Data Conversion & Integration

Visualization & Mapping

  • lingtreemaps - Plot linguistic data simultaneously on phylogenetic trees and geographic maps

Utilities