<?xml version='1.0' encoding='UTF-8'?>
<OAI-PMH xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:oai_dc="http://www.openarchives.org/OAI/2.0/oai_dc/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://www.openarchives.org/OAI/2.0/" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/ http://www.openarchives.org/OAI/2.0/OAI-PMH.xsd">
  <responseDate>2026-04-04T05:07:45Z</responseDate>
  <request verb="GetRecord" metadataPrefix="olac" identifier="fe0c6e23a0d311eebb4773db10791bcfba380eef872341628835bbe75bbf53bb">https://metashare.ut.ee/oai_pmh/</request>
  <GetRecord>
    <record>
      <header>
        <identifier>fe0c6e23a0d311eebb4773db10791bcfba380eef872341628835bbe75bbf53bb</identifier>
        <datestamp>2026-03-20T11:38:52Z</datestamp>
        <setSpec>corpus</setSpec>
        <setSpec>corpus:text</setSpec>
        <setSpec>corpus:audio</setSpec>
        <setSpec>corpus:text:audio</setSpec>
      </header>
      <metadata>
        <olac:olac xmlns:olac="http://www.language-archives.org/OLAC/1.1/" xmlns:dcterms="http://purl.org/dc/terms/" xsi:schemaLocation="http://purl.org/dc/elements/1.1/ http://www.language-archives.org/OLAC/1.1/dc.xsd http://purl.org/dc/terms/ http://www.language-archives.org/OLAC/1.1/dcterms.xsd http://www.language-archives.org/OLAC/1.1/ http://www.language-archives.org/OLAC/1.1/olac.xsd">
          <dc:title xml:lang="et">Suulise keele korpus</dc:title>
          <dc:title xml:lang="en">Corpus of Spoken Estonian</dc:title>
          <dc:description xml:lang="en">The Department of Estonian Language initiated the corpus of spoken Estonian in 1997. The corpus is compiled by the research group of Spoken Estonian (Tiit Hennoste, Airi Jansons, Liina Lindström, Andriela Rääbis, Olga Gerassimenko, Krista Strandson, Piret Toomet, Riina Vellerind). 
The corpus is transcribed by the transcription of conversational analysis (CA). Each tape is provided with a header that lists in all 44 situational factors that have been found to affect language use in the analysis of various languages. For each concrete tape the number of possible factors is as high as possible. 
The corpus is planned as an open corpus, i.e. no limits have been set. Our intention is to collect various types of oral speech, the usage of both everyday and institutional conversation, spontaneous and planned speech, monologues and dialogues, face-to-face interaction and media texts. The speakers are inhabitants of the largest towns of Estonia: Tallinn, Tartu and Pärnu. 
As of April 2018, the corpus consists of 3761 audio  and 166 video records (703 hours, 3927 conversations alltogether) and 2337 transliterated texts (2 206 810 words according to Microsoft Word statistics). 
Recordings divide to:  
1345 face-to-face conversations 
1924 phone conversations 
456 radio and TV broadcasts 
7 skype conversations 

195 undefined conversations (partially transcribed or fully transcribed extinct recordings). 

On the institutionality scale, conversations divide to: 
824 everyday conversations;
2796 institutional conversations;
84 other conversations, 
223 non-defined. 

The institutional situations include a large number of shop dialogues, dialogues at service institutions and government offices. 
The corpus is a data bank in the Word format and simple txt-format (ISO-8859-1). In order to access the corpus, a contract with the research group of Spoken Estonian is required. </dc:description>
          <dc:identifier xsi:type="dcterms:URI">http://hdl.handle.net/11297/1-00-0000-0000-0000-0002-7</dc:identifier>
          <dc:language xsi:type="olac:language" olac:code="et">Estonian</dc:language>
          <dc:type xsi:type="olac:linguistic-type" olac:code="primary_text"/>
          <dc:subject>language resources, monolingual corpus</dc:subject>
          <dc:type xsi:type="dcterms:DCMIType">Text</dc:type>
          <dc:type xsi:type="dcterms:DCMIType">Sound</dc:type>
          <dcterms:license>
	proprietary
	Restrictions of Use: other
	User Nature: academic
	</dcterms:license>
          <dcterms:extent>1 315 500 words</dcterms:extent>
          <dc:contributor xsi:type="olac:role" olac:code="depositor">Andriela Rääbis, andriela.raabis[at]ut.ee</dc:contributor>
        </olac:olac>
      </metadata>
    </record>
  </GetRecord>
</OAI-PMH>
