Clause Segmenter for Estonian
View resource name in all available languages
Osalausestaja
Clause Segmenter is a program that splits long and complex natural
language sentences into smaller segments (clauses). For example, the
sentence "Mees, keda seal kohtasime, oli tuttav ja teretas meid."
will be split into following clauses:
"[Mees, [keda seal kohtasime,] oli tuttav ja] [teretas meid.]"
(in the example, clauses are surrounded by brackets)
The algorithm mainly relies on punctuation, conjunction words, and
finite verb forms on identifying the clause boundaries.
For linguistic details/motivations behind the algorithm, see (Kaalep,
Muischnek 2012).
View resource description in all available languages
Osalausepiiride tuvastaja
People who looked at this resource also viewed the following: