Projects
I develop open-source tools for linguistic research, primarily in Python. My software focuses on data management, analysis, transformation, and visualization, usually related to languages. Check my GitHub profile for more.
Major Projects
- TITUS 2.0 - Next generation of the TITUS database for historical linguistics and ancient Indo-European languages
- Swiss German corpus search filter map - Spatially modelling dialect areas used in the Idiotikon and integrating them into an interactive search map
- CDTD (Comparative Dictionary of Tibetan Dialects) - Salvaging legacy HyperCard data and preparing publication of the second volume
Cariban Language Projects
My work with Cariban languages spans documentation, analysis, and the development of digital resources:
- Comparative Cariban Database - A collection of linguistic data on Cariban languages (companion app to my dissertation), structured as a CLDF dataset and served through a CLLD web application (Source Code)
- A digital sketch grammar of Yawarana - Digital grammar of Yawarana, a Cariban language spoken in Venezuela
- Yawarana Corpus (CLDF) - Structured corpus dataset
- Morphological parser for Yawarana - Computational morphological analysis tools
Open-Source Linguistic Tools
Document Preparation
- lingdocs - Write data-rich linguistics documents with integrated CLDF dataset support and multiple output formats
- expex-acro - LaTeX package for glossing abbreviations and linguistic markup
Corpus Management & Analysis
- cldf-ldd - Component collection for linguistic descriptive data in CLDF
- pyradigms - Compose and decompose linguistic paradigms (essentially pivot tables for linguistics)
Data Conversion & Integration
- unboxer - Extract data from Shoebox and Toolbox to CLDF-ready formats
- cldflex - Convert FLEx data to CLDF-ready formats
Visualization & Mapping
- lingtreemaps - Plot linguistic data simultaneously on phylogenetic trees and geographic maps
Utilities
- humidifier - Create human-friendly IDs from strings
- biblatex2bibtex - Convert BibLaTeX files to BibTeX format