During our work as speech technologists we developed additional software solutions for the work that often occur after "using ASR".
These various software packages are here described and made available for download.
FromTo V1.6 is our solution for this.
Once the ASR has been done, one can download the (timed) results. However, each ASR-engine seems to have its own set of output-formats. Sometimes you can get a special XML-file, sometimes a CSV-file and sometime something else. Moreover, if it is XML, each ASR-engine seems to use its own XML-schema.
So we wrote software that reads the (standard) output of the ASR-engines we support (a CTM-file), and transforms it into one of the following output formats:
- SRT: the standard subtitle format used by nearly all existing video and audio players (like VLC)
- VTT: the new internetversion of the SRT-format, used by all the modern browsers
- Karaoke: a html-file where each recognised word is "connected" with the audio-file and where clicking on a word results in playing the audio-file from that word. Words played, are highlighted.
- CHA: the format used in the CHILDES format.
FAconvertor is a software program that combines the corrected text (from ASR corrector) with the CTM-file (i.e. outcome of the FAconvertor-software). The outcome is a correct HTML-file (Karaoke), SRT-subtile file and text-file that cab easily be converted into a MSword-file.
ToWave convertor is a software program for MacOS that allows you to convert various AV-files into a 16-16-1 wav-files (a 16kHz, 16-bit, mono).