The initiators of this website are a group of experts interested in speech data with very different backgrounds - oral history, computational linguistics, anthropology, sociolinguistics, phonetics and phonology. We all have an interest in exploring how technology can be integrated into research that involves spoken narratives.
This may vary from very basic technologies, such as the conversion of recorded speech from analogue to digital, or more elaborate ones, such as ASR applied to automatically generate transcripts of the spoken content.
We first offer basic information about the epistemological focus of research domains that deal with speech data. This is followed by an explanation of relevant technologies and related tools that researchers can use. We then share our knowledge in the sections Workshops, showcases and publications.
Last but not least, you can access the first concrete result of our efforts: The Transcription Portal, an open source service for automatic transcription in English, German, Dutch and Italian.
AI and Oral History: Applications in Holocaust Testimonies
An interesting on-line meeting about AI and Oral History Date: 25 November 2024Time: 14:25 –15:45How: Online We are pleased to welcome Maria Dermentzi (digital humanities consultant) and...
What automatic speech recognition can and cannot do for conversational speech transcription
Sam O'Connor Russell, Iona Gessinger, Anna Krason, Gabriella Vigliocco, Naomi Harte, In: Research Methods in Applied Linguistics, Volume 3, Issue 3, 2024,100163,ISSN 2772-7661.DOI:...
A new ASR tool: aTrain
At the end of March, we got the first version of our paper back to the LREC-COLING workshop about Holocaust Testimonies as Language Resources (Workshops – LREC-Coling 2024). We needed to modify some...
Update Whisper Large Model
OpenAI is pleased to announce the latest iteration of Whisper, called large-v3. Whisper-v3 has the same architecture as the previous large models except some minor differences. The large-v3 model...