EMMA korpuse EKI alaosa

The corpus contains assessment test and exam materials Carried out by Harno (previously Innove). The materials have been collected and prepared for the research purposes in the Institute of the Estonian Language. This includes pseudonymisation of the materials and syntactic parsing with the UDpipe parser (estonian-edt-ud-2.15-241121). The materials are in vrt-format and include both grammatical and contextual information.

The corpus includes in 12076 texts in total, most of which are Estonian L2 materials. More specifically 10747 of the texts are Estonian L2 and 1198 of the texts are Estonian L1.

View resource description in all available languages

Korpuses sisalduvad Harno (varasema Innove) läbiviidud tasemetöö- ja eksamisoorituste tekstid. Materjalid on kogutud ja uurimismaterjaliks ette valmistatud Eesti Keele Instituudis. Selle käigus on tekstid pseudonüümitud ning süntaktiliselt märgendatud UDpipe parseriga (estonian-edt-ud-2.15-241121). Materjalid on salvestatud vrt-formaadis, sisaldades nii grammatilist infot kui ka soorituse ja sooritaja kohta käivat üldist infot.

Korpuses on kokku 12076 teksti, koosnedes valdavalt eesti keel teise keelena tekstidest. Täpne jaotus on E2 tekste 10747, E1 tekste 1198.

You don’t have the permission to edit this resource.
People who looked at this resource also viewed the following: