I am a data science consultant with 10+ years of experience.
- Designing, implementing, and running data-driven projects / products, by myself or with others
- I enjoy helping other people figure something out
- 🔮 Lots of forecasting, prediction, and classification using statistical and machine learning models for tabular and time-series data
- 📈 A little bit of causal modeling
- 📝 An even smaller (but growing) bit of text, video, speech analysis
- 🏢 I’ve done work for clients like Leidos, Duke University, the University of Gothenburg (V-Dem Institute), the US Intelligence Advanced Research Projects Agency (through USC’s ISI), and a startup pitch coach
- 💻 Technical experience with Python, R, SQL, databases, git, Docker, Flask, cloud solutions (AWS)
- 🎓 Ph.D. in Political Science
- 📍 I’m physically in Tallinn, Estonia
If you are interested in hiring me, get in touch or check out my LinkedIn profile for more information.
Where you can find me
What else is here
For a list of academic publications, see my research page.
For a brief period of time, I used to blog.
POLECAT event data: Some resources for the POLECAT event data, which is available at https://dataverse.harvard.edu/dataverse/POLECAT.
et1000: the 1,000 most common Estonian words: Estonian is a somewhat boutique language. At the time I did this, you couldn’t find a list of the most commonly used Estonian words online, so I made this.
R package doc pages
icews: The ICEWS event data consists of more than 270 million event data records extracted from global news stories. The raw data is delivered via dataverse. The icews R package automates the process of keeping an up to date local copy, using either a file- or SQLite-based storage backend.
states: I used to frequently work with global data for independent states. This package has some utility functions for making it easier to work with the two major lists of state system membership, Gleditsch & Ward and COW.
spduration: Implements a time-varying covariate split-population duration regression model for survival data where an unknown portion of the cases are immune from failure. These are sometimes also called cure models.
Bonus trivia
Thank you for reaching the bottom of my page. Your reward is this:
The 5th most interesting thing about me is that at the 2017 MyFitness Madness City Race at the Tallinn Song Festival Grounds, I was, due to a clerical error, part of the best all-female team.