WEBVTT 00:00:00.794--> 00:00:05.880 My name is Silvia Calamai and I work at the University of Siena in Arezzo 00:00:06.002--> 00:00:08.753 I'm a general linguist a specialist in dialects 00:00:09.365--> 00:00:14.556 Over the years I have collected, catalogued, and digitized 00:00:14.581--> 00:00:19.878 oral archives of Tuscan origin from the 70's onwards 00:00:20.663--> 00:00:25.608 This huge amount of material consisting of spoken and sung Tuscan 00:00:26.699--> 00:00:30.532 Needs to be studied, by linguists 00:00:30.548--> 00:00:33.151 Not only by linguists, but also by linguists 00:00:34.960--> 00:00:41.278 Voice Technologies can provide us with tools to facilitate our work 00:00:41.907--> 00:00:46.671 Through systems of automatic alignment and speech recognition 00:00:46.682--> 00:00:52.037 This would greatly facilitate the research of those interested 00:00:52.062--> 00:00:56.055 in Sociological perspective of phonetics, sociolinguistics 00:00:56.493--> 00:01:07.534 It would allow to see the changes in phonetic aspects in pronunciation, accents, intonation 00:01:07.559--> 00:01:11.492 Which have occurred over the last 70 years in one region 00:01:11.831--> 00:01:17.904 This work done manually is time consuming, takes time long, is dispersive 00:01:17.929--> 00:01:22.532 If we succeed through the support of CLARIN 00:01:22.557--> 00:01:30.128 and through a dialogue with technologists 00:01:30.153--> 00:01:32.791 If we can find common ground, create a shared vocabulary 00:01:33.336--> 00:01:37.845 We eventually could get a huge amount of data 00:01:39.385--> 00:01:42.440 That could be analyzed from a socio-phonetic point of view 00:01:42.964--> 00:01:50.225 And we could describe it with a reliable database that could be verified by the community 00:01:50.265--> 00:01:53.815 We could describe on the basis of a sound and verifiable database 00:01:53.840--> 00:01:57.496 the phonetic changes that have occurred within one region, not only 00:01:57.575--> 00:02:05.468 We the automatization of the workflow of the transcription chain is successful 00:02:06.699--> 00:02:12.757 We could make a great amount of sound data available through reading 00:02:13.678--> 00:02:16.887 The wonderful thing about oral archives is that they are at a cross road 00:02:16.912--> 00:02:21.300 of disciplines that are quite distinctive, 00:02:21.341--> 00:02:29.208 The oral historians, the socio-linguists, the sociologists, the economists, the geographers 00:02:29.242--> 00:02:33.861 The oral archives can be considered as a resource in an imaginary place 00:02:35.074--> 00:02:41.827 suitable to find paths to truly embark in interdisciplinary approaches to research. 00:02:42.065--> 00:02:48.130 Regarding my discipline, we only have to take advantage of it 00:02:48.337--> 00:02:52.219 On the other hand, Voice Technology faces a big challenge 00:02:52.410--> 00:02:56.343 To be able to create models of automatic speech recognition 00:02:56.820--> 00:03:04.947 Strong enough to be able to perform well in speech with strong regional dialectical features 00:03:05.067--> 00:03:12.885 It is material that has not been created in isoloation in an experimental setting. 00:03:12.988--> 00:03:20.948 Informal everyday talk offers a very strong challenge on the technological side 00:03:20.964--> 00:03:29.073 Who has oral archives, should open them up and make them accessible 00:03:29.098--> 00:03:36.026 should be willing to digitize and catalogue the material, to describe it and share it with the community 00:03:37.156--> 00:03:42.957 And technologists should be ready and available to talk to humanists 00:03:43.950--> 00:03:55.027 To make a common effort and come to richer and fuller of research results 00:03:58.517--> 00:04:03.587 make it accessible and store archive 00:04:03.897--> 00:04:11.127 Regardless of the discipline accessible and even interrogative 00:04:13.643--> 00:04:22.955 Find the way to put it on a common vocabulary with the world of technology 00:04:22.980--> 00:04:27.641 Sometimes hard or hard to understand them 00:04:27.768--> 00:04:34.553 Technologists but I feel that the existing technicians are struggling to understand the humanists 00:04:34.561--> 00:04:41.404 Then this can be a very fertile ground for us but also for them