Linguistic and Sociolinguistic Annotation of 17th Century Dutch Letters
Publication date
2018-05-07
Editors
Calzolari, Nicoletta
Advisors
Supervisors
DOI
Document Type
Part of book
Metadata
Show full item recordCollections
License
Abstract
Developments in the Dutch language during the 17th century, part of the Early Modern period, form an active research topic in historical linguistics and literature. To enable automatic quantitative analysis, a corpus of letters by the 17th century Dutch author and politician P.C. Hooft is manually annotated with parts-of-speech, document segmentation and sociolinguistic metadata. The corpus is developed as part of the Nederlab online research portal, which is available through the CLARIN ERIC European research infrastructure. This paper discusses the design and evaluation of the annotation effort, as well as adding new annotations to an existing annotated corpus.
Keywords
Early Modern Dutch, POS tagging, sociolinguistic annotation, data integration
Citation
Schraagen, M P, Dietz, F M & van Koppen, J M 2018, Linguistic and Sociolinguistic Annotation of 17th Century Dutch Letters. in N Calzolari (ed.), Proceedings of the Eleventh International Conference on Language Resources and Evaluation. European Language Resources Association (ELRA), Miyazaki,Japan, pp. 1146-1152, Language Resources and Evaluation Conference (LREC 2018), 7/05/18. < https://aclanthology.info/papers/L18-1184/l18-1184 >, conference