During our work as speech technologists we developed additional software solutions for the work that often occur after "using ASR".
These various software packages are here described and made available for download.
LabelMaker: How to add lables to the transcripts?
LabelMaker V1.4 is a software program that allows you to add labels to selected parts of the subtitles.
CTMimprover: How to automatically improve the ASR-results in a CTM or SPK file.
Manual will follow.pdf
ASRcorrector: How can I correct the mistakes within an automatically generated transcript?
ASRcorrector V2.8.25 and above, is a software program with various screens that allows you to correct the outcome of the ASR into a suitable text.
(last update: 8-3-2023)
WhisperCorrector: How can I correct the mistakes within an automatically generated transcript with the OpenAI Whisper engine?
WhisperCorrector V1.0.0 and above, is a software program with various screens that allows you to correct the outcome of the Whisper ASR-engine into a suitable text. WhisperCorrector is based on (and still equal to) ASRcorrector. However, ASRcorrector has the option to include the ASR-results from various ASR-engines, while WhisperCorrector only deals with the results of the Whisper ASR-engine.
(last update: 24-3-2023)
FromTo: How can I convert a ctm/spk file into subtitles and/or html-files?
FromTo V1.7 (and above) is our solution for this.
Once the ASR has been done, one can download the (timed) results. However, each ASR-engine seems to have its own set of output-formats. Sometimes you can get a special XML-file, sometimes a CSV-file and sometime something else. Moreover, if it is XML, each ASR-engine seems to use its own XML-schema.
So we wrote software that reads the (standard) output of the ASR-engines we support (a CTM-file), and transforms it into one of the following output formats:
- SRT: the standard subtitle format used by nearly all existing video and audio players (like VLC)
- VTT: the new internetversion of the SRT-format, used by all the modern browsers
- Karaoke: a html-file where each recognised word is "connected" with the audio-file and where clicking on a word results in playing the audio-file from that word. Words played, are highlighted.
- CHA: the format used in the CHILDES format.
FAcleaner: How to convert the text from the ASRcorrector into a good FA_text that can be run by a FA-conversion?
FAcleaner is a software program that corrects the text (from ASR corrector) and removes al kind of know errors. The outcome is a correct text that could be used by the FA-routine.
FAconvertor: How do I combine the corrected transcript with the CTM-file to generated a subtitle/karaoke style file?
FAconvertor is a software program that combines the corrected text (from ASR corrector) with the CTM-file (i.e. outcome of the FAconvertor-software). The outcome is a correct HTML-file (Karaoke), SRT-subtile file and text-file that can easily be converted into a MSword-file.
FAgluer: How do I combine two (or more) parts of an interview into one set of files?
FAgluer is a software program that combines 3x 2 adjacent parts of an interview. The outcome is a new wav, ctm and txt-file.
To_WAV_Convertor: How do I convert audio or video files to WAVE files, a filetype which is often required by tools?
ToWave convertor is a software program for MacOS that allows you to convert various AV-files into a 16-16-1 wav-files (a 16kHz, 16-bit, mono).
K-Lite: I can't seem to play certain mp4-files on my PC. How do I fix this?
Here's the K-Lite Codec pack; two small audio-improvers you sometimes need to play mp4-files.
Start with "K-Lite_Codec_Pack_1608_Full.exe" and then eventually run "klcp_update_1608_20210313.exe".
For the latest information, see here.