Linguistic and Sociolinguistic Annotation of 17th Century Dutch Letters

Publication date

2018-05-07

Authors

Schraagen, MarijnISNI 0000000419454950
Dietz, F.M.ISNI 0000000398613253
van Koppen, MarjoISNI 000000011038355X

Editors

Calzolari, Nicoletta

Advisors

Supervisors

DOI

Document Type

Part of book
Open Access logo

License

Abstract

Developments in the Dutch language during the 17th century, part of the Early Modern period, form an active research topic in historical linguistics and literature. To enable automatic quantitative analysis, a corpus of letters by the 17th century Dutch author and politician P.C. Hooft is manually annotated with parts-of-speech, document segmentation and sociolinguistic metadata. The corpus is developed as part of the Nederlab online research portal, which is available through the CLARIN ERIC European research infrastructure. This paper discusses the design and evaluation of the annotation effort, as well as adding new annotations to an existing annotated corpus.

Keywords

Early Modern Dutch, POS tagging, sociolinguistic annotation, data integration

Citation

Schraagen, M P, Dietz, F M & van Koppen, J M 2018, Linguistic and Sociolinguistic Annotation of 17th Century Dutch Letters. in N Calzolari (ed.), Proceedings of the Eleventh International Conference on Language Resources and Evaluation. European Language Resources Association (ELRA), Miyazaki,Japan, pp. 1146-1152, Language Resources and Evaluation Conference (LREC 2018), 7/05/18. < https://aclanthology.info/papers/L18-1184/l18-1184 >, conference