<codeBook xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xsd="http://www.w3.org/2001/XMLSchema" xsi:schemaLocation="ddi:codebook:2_5 http://www.ddialliance.org/Specification/DDI-Codebook/2.5/XMLSchema/codebook.xsd" xmlns="ddi:codebook:2_5">
  <docDscr>
    <citation>
      <titlStmt>
        <titl xml:lang="sv">xhosa</titl>
        <parTitl xml:lang="en">Corpus of spoken isiXhosa</parTitl>
        <IDNo agency="SND">doi-10-23695-xrsg-mp07-0</IDNo>
        <IDNo agency="DOI">https://doi.org/10.23695/XRSG-MP07</IDNo>
      </titlStmt>
      <prodStmt>
        <producer xml:lang="en" abbr="SND">Swedish National Data Service</producer>
        <producer xml:lang="sv" abbr="SND">Svensk nationell datatjänst</producer>
      </prodStmt>
      <holdings URI="https://doi.org/10.23695/XRSG-MP07">Landing page</holdings>
    </citation>
  </docDscr>
  <stdyDscr>
    <citation>
      <titlStmt>
        <titl xml:lang="sv">xhosa</titl>
        <parTitl xml:lang="en">Corpus of spoken isiXhosa</parTitl>
        <IDNo agency="SND">doi-10-23695-xrsg-mp07-0</IDNo>
        <IDNo agency="DOI">https://doi.org/10.23695/XRSG-MP07</IDNo>
      </titlStmt>
      <rspStmt>
        <AuthEnty xml:lang="en" affiliation="">Språkbanken Text</AuthEnty>
      </rspStmt>
      <prodStmt />
      <distStmt>
        <distrbtr xml:lang="en" abbr="SND" URI="https://snd.se">Swedish National Data Service</distrbtr>
        <distrbtr xml:lang="sv" abbr="SND" URI="https://snd.se">Svensk nationell datatjänst</distrbtr>
        <distDate xml:lang="en" date="2024-05-08" />
      </distStmt>
      <verStmt>
        <version elementVersion="0" elementVersionDate="2024-05-08" />
      </verStmt>
      <holdings URI="https://doi.org/10.23695/XRSG-MP07">Landing page</holdings>
    </citation>
    <stdyInfo>
      <subject />
      <abstract xml:lang="en" contentType="abstract">The Corpus of Spoken isiXhosa

  The Corpus of Spoken isiXhosa consists of transcribed and annotated recordings of spoken Xhosa [xho]. The recordings have been made in the Eastern Cape in South Africa from 2015 onwards. The transcribed texts are annotated with morpheme-by-morpheme glosses, part-of-speech tags, and free English translations.

  The recordings and the annotations of Xhosa data have been made as part of three different research projects led by senior lecturer Eva-Marie Bloom Ström at the University of Gothenburg. All projects, including the ongoing ‘How do words get in order? The role of speaker-hearer interaction in languages of southern Africa’, were founded by the Swedish Research Council.

  The Corpus has been developed in collaboration with Språkbanken Text.

  A user guide and more extensive information about the corpus data can be found in the Corpus of Spoken isiXhosa Manual [PDF].

For more on annotation, preparation of data, and acknowledgements see:

Bloom Ström, E.-M., Slater, O., Zahran, A., Berdicevskis, A., &amp; Schumacher, A. (2023). Preparing a corpus of spoken Xhosa. Proceedings of the 2023 CLASP Conference on Learning with Small Data (LSD), 62–67. https://aclanthology.org/2023.clasp-1.7

For questions about the corpus:
  Eva-Marie Bloom Ström eva-marie.strom@gu.se

  If you notice any errors or inconsistencies in annotations, please report them to this email address.

Main contributors:

Eva-Marie Bloom Ström
  Senior Lecturer, University of Gothenburg
Onelisa Slater
  MA, Rhodes University
Aron Zahran
  PhD, Inalco/Llacan (CNRS) &amp; Ghent University</abstract>
      <abstract xml:lang="sv" contentType="abstract">The Corpus of Spoken isiXhosa

  The Corpus of Spoken isiXhosa consists of transcribed and annotated recordings of spoken Xhosa [xho]. The recordings have been made in the Eastern Cape in South Africa from 2015 onwards. The transcribed texts are annotated with morpheme-by-morpheme glosses, part-of-speech tags, and free English translations.

  The recordings and the annotations of Xhosa data have been made as part of three different research projects led by senior lecturer Eva-Marie Bloom Ström at the University of Gothenburg. All projects, including the ongoing ‘How do words get in order? The role of speaker-hearer interaction in languages of southern Africa’, were founded by the Swedish Research Council.

  The Corpus has been developed in collaboration with Språkbanken Text.

  A user guide and more extensive information about the corpus data can be found in the Corpus of Spoken isiXhosa Manual [PDF].

For more on annotation, preparation of data, and acknowledgements see:

Bloom Ström, E.-M., Slater, O., Zahran, A., Berdicevskis, A., &amp; Schumacher, A. (2023). Preparing a corpus of spoken Xhosa. Proceedings of the 2023 CLASP Conference on Learning with Small Data (LSD), 62–67. https://aclanthology.org/2023.clasp-1.7

For questions about the corpus:
  Eva-Marie Bloom Ström eva-marie.strom@gu.se

  If you notice any errors or inconsistencies in annotations, please report them to this email address.

Main contributors:

Eva-Marie Bloom Ström
  Senior Lecturer, University of Gothenburg
Onelisa Slater
  MA, Rhodes University
Aron Zahran
  PhD, Inalco/Llacan (CNRS) &amp; Ghent University</abstract>
      <sumDscr />
    </stdyInfo>
    <method>
      <dataColl />
    </method>
    <dataAccs>
      <useStmt>
        <restrctn xml:lang="en">Access to data through an external actor. </restrctn>
        <restrctn xml:lang="sv">Åtkomst till data via extern aktör. </restrctn>
      </useStmt>
    </dataAccs>
    <othrStdyMat />
  </stdyDscr>
</codeBook>