Estonian Reference Corpus* with NER annotations

NER annotations were produced with Estnltk toolkit and include entities of a type person (PER), organisation (ORG), location (LOC) and timex (TIMEX). The corpus archive contains two subdirectories - tok and lbl. Tok directory contains original text files which are word and sentence tokenised, while the lbl directory contains corresponding NER annotations.

* Estonian Reference Corpus http://www.cl.ut.ee/korpused/segakorpus/

