I create text corpora as part of my linguistic work.
I have expertise in several aspects of corpus construction:
- Manual syntactic parsing with close to 100% accuracy
- Design of rigid, linguistically motivated annotation guidelines
- Basic natural language processing techniques like automatic parsing and POS tagging
- Professional text documentation
You can see some of my corpus construction projects below.
1 / 3
The Parsed Corpus of Middle English Poetry (PCMEP)
The Parsed Corpus of Middle English Poetry
is a fully annotated and syntactically parsed corpus of Middle English verse texts, c. 1150-1400, to supplement the scant prose texts of that time.
2 / 3
The Student-Transcribed Corpus of Spoken American English
3 / 3
The Corpus of Late Medieval English Prose (CoLMEP)
The Corpus of Late Medieval English Prose is a new project with the aim of annotating prose texts of the period 1350-1550 to study linguistic changes at the close of the Middle Ages.
Get in touch if you would like me to help with your corpus project:
Copyright © 2020 - 2023 www.RichardZimmermann.com