Estonian Reference Corpus* with NER annotations
NER annotations were produced with Estnltk toolkit and include entities of a type person (PER), organisation (ORG), location (LOC) and timex (TIMEX). The corpus archive contains two subdirectories - tok and lbl. Tok directory contains original text files which are word and sentence tokenised, while the lbl directory contains corresponding NER annotations.
* Estonian Reference Corpus http://www.cl.ut.ee/korpused/segakorpus/
People who looked at this resource also viewed the following: