Samples of Spoken Finnish
View resource name in all available languages
Suomen kielen näytteitä
Please use DOI in citation: https://doi.org/10.15155/9-00-0000-0000-0000-0014FL
The Institute for the Languages of Finland (Kotus) published the printed series Suomen kielen näytteitä (SKN) during the years 1978-2000. A total of 50 booklets appeared, each of which contains the transcripts of approximately two hours of dialect speech. The locations that were selected for the series are well representative of the Finnish dialectal regions.
Using the audio recordings in the Audio Archive of Finnish at Kotus, a database was created for the LAT platform, containing both the audio recordings and the text aligned with audio. For the time being, the access rights can be obtained by contacting FIN-CLARIN at firstname.lastname@example.org. The corpus is available for research and teaching only.
The text has been roughly aligned with the audio on a per sentence basis. Each word in the original dialect transcripts has been associated with a corresponding form in standard language. Please note that this is teh first version of the aligned corpus and the standardization is still very tentative. Not everything has been manually checked and corrected.
The original audio recordings have been processed by Sakari Pietilä. The text and audio have been manually aligned by My Sjöholm, Pauliina Liuska and Olli Miettinen. The file conversions for LAT were performed by Mietta Lennes. The normalized word readings have been created by Maria Vilkuna and Pauliina Liuska.
View resource description in all available languages