Please use DOI in citation: https://doi.org/10.15155/1-00-0000-0000-0000-0015AL
Estonian Reference Corpus* with NER annotations
NER annotations were produced with Estnltk toolkit and include entities of a type person (PER), organisation (ORG), location (LOC) and timex (TIMEX). The corpus archive contains two subdirectories - tok and lbl. Tok directory contains original text files which are word and sentence tokenised, while the lbl directory contains corresponding NER annotations.
* Estonian Reference Corpus http://www.cl.ut.ee/korpused/segakorpus/