During our work as speech technologists we developed additional software solutions for the work that often occur after "using ASR".
These various software packages are here described and made available for download.
Attention! Some people have problems when they used Chrome or Safari. So, if you have "problems", please use Firefox or Edge. |
LabelMaker: How to add lables to the transcripts?
LabelMaker V1.4 is a software program that allows you to add labels to selected parts of the subtitles.
![]() ![]() |
CTMimprover: How to automatically improve the ASR-results in a CTM or SPK file.
WhisperCorrector: How can I correct the mistakes within an automatically generated transcript with the OpenAI Whisper engine?
WhisperCorrector is a software program with various screens that allows you to correct the outcome of the Whisper ASR-engine into a suitable and correct transcription. WhisperCorrector deals with the results of various Whisper ASR-engines (Whisper, WhisperX, Faster-Whisper, Faster-Whisper-XXL, Whisper by Surf, MacWhisper, and aTrain).
(last update: 23-04-2025)
Version: 2.7.12
SRT to/from VTT: Software to convert SRT into VTT and from VTT to SRT.
A simple program that reads a SRT or VTT file (select file) or a directory (select dir) with all SRT or VTT files and converts them to VTT or SRT.
Whisper Batch: Gives the possibility to select various files that need to be recognised.
Gives the possibility to select various files that need to be recognised, each with its own items like model, languages and other otems. Once selected all the files, you can start the process from this program. It works now for Windows 64-bit and MacOS.
A manual will be written in the near future after the first comments :-).
FromTo: How can I convert a ctm/spk file into subtitles and/or html-files?
FromTo V1.7 (and above) is our solution for this.
Once the ASR has been done, one can download the (timed) results. However, each ASR-engine seems to have its own set of output-formats. Sometimes you can get a special XML-file, sometimes a CSV-file and sometime something else. Moreover, if it is XML, each ASR-engine seems to use its own XML-schema.
Karaoke example
So we wrote software that reads the (standard) output of the ASR-engines we support (a CTM-file), and transforms it into one of the following output formats:
- SRT: the standard subtitle format used by nearly all existing video and audio players (like VLC)
- VTT: the new internetversion of the SRT-format, used by all the modern browsers
- Karaoke: a html-file where each recognised word is "connected" with the audio-file and where clicking on a word results in playing the audio-file from that word. Words played, are highlighted.
- CHA: the format used in the CHILDES format.
FAcleaner: How to convert the text from the ASRcorrector into a good FA_text that can be run by a FA-conversion?
FAconvertor: How do I combine the corrected transcript with the CTM-file to generated a subtitle/karaoke style file?
FAconvertor is a software program that combines the corrected text (from ASR corrector) with the CTM-file (i.e. outcome of the FAconvertor-software). The outcome is a correct HTML-file (Karaoke), SRT-subtile file and text-file that can easily be converted into a MSword-file.
FAgluer: How do I combine two (or more) parts of an interview into one set of files?
FAgluer is a software program that combines 3x 2 adjacent parts of an interview. The outcome is a new wav, ctm and txt-file.
SRT Condensor: software to turn the existing SRT file into longer pieces.
SRTcondersor is a software program that combines the original subtitles into pieces of X-minutes. The outcome is a new srt-file.
Links to useful software
To_WAV_Convertor: How do I convert audio or video files to WAVE files, a filetype which is often required by tools?
ToWave convertor is a software program for MacOS that allows you to convert various AV-files into a 16-16-1 wav-files (a 16kHz, 16-bit, mono).
K-Lite: I can't seem to play certain mp4-files on my PC. How do I fix this?
Here's the K-Lite Codec pack; two small audio-improvers you sometimes need to play mp4-files.
Start with "K-Lite_Codec_Pack_1608_Full.exe" and then eventually run "klcp_update_1608_20210313.exe".
For the latest information, see here.