n-gram for Swedish (based on the NST Text Corpus)
View resource name in all available languages
n-gram for svensk (basert på NSTs tekstkorpus)
n-gram for svensk (basert på NST sitt tekstkorpus)
Please use DOI in citation: https://doi.org/10.15155/9-00-0000-0000-0000-00168L
From the Swedish texts in the Text Corpus of Nordisk språkteknologi holding AS, Språkbanken has provided a collection of derivated word n-grams (1-gram, 2-gram, 3-gram, 4-gram, 5-gram og 6-gram) from approximately 400 million words. The n-grams have been made available in two versions, one "light" version with only the 1.000 most frequent n-grams, and a full version where all the derived n-grams are sorted by different criteria. There is also a third format, making it possible to select text types. This version contains more texts and has approximately 437 million words. The n-grams are freely available for language technology research and development purposes.
View resource description in all available languages