New Digital Recordings - Speech & Technology

For new AV-recordings it makes sense to create a recording situation, optimized for technology such as ASR, Aligment, Emotion Detection, Facial Expression Analyses and more.

A good example is the VXi VoxStar UC Bluetooth HeadsetSome small guidelines:

if possible, record each speaker on a separate audio channel via a separate microphone
record the speech with a high sample frequency and a 4-bit sample value (not 16-16-mono but 96-32-channel-per-speaker)
use microphones that have a more-or-less fixed distance to the mouth
use microphones that mute as much as possible the sound from other sources than the mouth of the speaker

The benefits of the approach mentioned here are great. Separate channels per speaker makes it possible to do automatic turn-taking, it prevents that a louder speaking person "overrules" a softer speaking person and the speech can be transcribed even if people are talking together.

20 May

Joint COLING and LREC conference

20-05-2024 - 25-05-2024

The three-day main conference (22-23-24, May) will be accompanied by a total of three days of workshops and tutorials (20-21-25, May) held in the days immediately before and after.

Two major international key players in the area of computational linguistics, the ELRA Language Resources Association (ELRA) and the International Committee on Computational Linguistics (ICCL), are joining forces to organize the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024) to be held in Torino (Italy) on 20-25 May, 2024.

The hybrid conference will bring together researchers and practitioners in computational linguistics, speech, multimodality, and natural language processing, with special attention to evaluation and the development of resources that support work in these areas. Following in the tradition of the well-established parent conferences COLING and LREC, the joint conference will feature grand challenges and provide ample opportunity for attendees to exchange information and ideas through both oral presentations and extensive poster sessions, complemented by a friendly social program.

31 Aug

The Young Female* Researchers in Speech Workshop

31-08-2024

YFRSW 2024 The Young Female* Researchers in Speech Workshop (YFRSW) is a workshop for female* Bachelor’s and Master’s students currently working in speech science and technology. The workshop aims to promote interest in research in our field among women* who have not yet committed to pursuing a PhD in speech science or technology, but who have already gained research experience at their universities through individual or group projects.

The workshop will be held prior to Interspeech 2024 on Saturday, August 31st, 2024. The event will take place in Greece. The workshop will feature panel discussions with PhD students and senior researchers in the field, student poster presentations, and a mentoring session. Student poster presentations should give an overview of a current or planned research project in which the student is involved, with an emphasis on promoting discussion.

For more information, see: https://sites.google.com/view/yfrsw-2024/abstract-submission