|ELRA-S0345 Spoken Portuguese Corpus|
|Giovedì 13 Settembre 2012 19:13|
"The Spoken Portuguese corpus consists of a total of 86 recordings (8h44m), collected among sociolinguistically diverse speakers having Portuguese as mother tongue or as second language. The corpus was recorded in a situation of spontaneous oral communication, on different themes of everyday life, with speakers of different ages and social and professional backgrounds. The corpus consists of audio files in .wav format, aligned transcriptions in XML Exmaralda format and transcriptions in plain text."