Enregistré dans:
Détails bibliographiques
Auteur principal: Petr Pořízka
Format: Artículo científico
Langue:en
Publié: Universität Bern 2009
Sujets:
Accès en ligne:https://www.redalyc.org/articulo.oa?id=664573569005
Tags: Ajouter un tag
Pas de tags, Soyez le premier à ajouter un tag!
Table des matières:
  • Olomouc Corpus of Spoken Czech: Characterization and Main Features of the Project Petr Pořízka Lengua y Literatura This study presents the results of the author's research project called Olomouc Corpus of Spoken Czech (OCSC). The paper is focused on the state and partial phases of constructing the corpora, its methodology and annotation. Within the OCSC we use so called dual system of transcription, which means (1) an orthographic one with the purpose of linguistic (morphological) analysis and tagging and (2) a phonetic version of transcript which consists of three layers of the text: first the real transcription and further various types of the metatexts as a second and third layer, including communication aspects of the texts. The criteria of selection of speakers are also listed here and the highly important statistical analysis of the sociolinguistic categories (gender, age, type of education, types of recordings) is presented as well. This analysis can serve as a base for a partial correction of possible non-balance among those sociolinguistic parameters. The annotation rules and principles are mentioned at the end of this study. 2009 artículo científico 1615-3014 https://www.redalyc.org/articulo.oa?id=664573569005 en http://www.redalyc.org/revista.oa?id=6645 Linguistik online application/pdf Universität Bern Linguistik online (Suiza) Num.2 Vol.38