Audio and video recording for automatic transcription
Automatic transcription : Know which recording formats are compatible
If you are interested in this topic, we invite you to read this article here
For saving time, a lot of Companies, Institutions, Medias, Universities and School use an automatic transcription system. So, which are the files compatible for a transcription? Just audio files?
Audio recording formats for transcription
Audio recording formats for transcription
Transcription is a simple process: Convert Speech to Text. When we say ‘transcription’, that's'audio file'which comes to mind first. As we will see later on, recording quality is essential in order to obtain a reliable transcription.
Audio file formats are very numerous, some of them are a proprietary technology such as the .wma file or Waveform Audio File Format, developed by Microsoft.
Others are free and open as the .wav file or Waveform Audio File Format is a Microsoft and IBM audio file format standard for storing an audio bitstream on PCs. This format is compatible with Windows, Macintosh, and Linux operating systems. WAV files can also be edited and manipulated with relative ease using software.
If you have these extensions in your audio recordings library, or if you have unusual formats as .3GPA, don't panic! They will be accepted by the transcription system, provided that the recording quality is good, as we’ll see later.
For example, on the online platform Authôt, you just have to send your audio format and the system will convert it automatically in .MP3 or .OGG. The conversion in .OGG is needed in order to be compatible with browsers which don’t accept the mp3. For more informations, we invite you to consult the list of formats which are accepted on our platform, these are all present in the library FFMPEG (PDF)
Video recording formats for transcription
Video recording formats for transcription
Audio files are not the only ones to be concerned by the transcription! Video files are too, and in this way it’s the video’s sound which is transcribed.
A numeric video, is a file which contains images, sound and text (metadata) placed in a container. In this container, images, sound and text are compressed. Compression and decompression of these files are realised by codec.
Such as audio files, there is a multitude of different video files formats. For example, the .MOV, file extension compatible with QuickTime or, the .VQF which, in the line of the .MP3, allows a compression more important with a better quality.
Similar to audio files, every format are accepted by the transcription system. They will be automatically converted in .MP4, compatible with the several browsers.
For general information, on our transcription application, on average each month since the beginning of the year, our users send:
- audio files: 36%
-
video files: 64 %
Recording quality, essential for a good transcription!
Recording quality, essential for a good transcription!
The result of the transcription depend mainly on recording quality (our system has 95% accuracy with high quality audio/ video records).
Today, the majority of recorders are numeric. Their advantages:
All microphones do not capture sounds in the same manner, some microphones are designed to capture sounds coming from one single direction, and others are sensitive in all directions of space. As a consequence, the choice of the material is essential in order to have a good recording rendering, in particular if you are recording in a slightly noisy environment.
However, recording in a car with closed windows, regular speed, and without radio is a possible solution. Indeed, the overlap between car noise frequencies and voice frequencies are low and do not prevent from a good transcription.
Elocution is also a major element concerning the recording, it should not be too fast. Tone of voice must be regular. Strong accents also impact on transcription quality.
Each person can also wear a lapel microphone, connected with a recorder. Other solution, the room can be equipped of a conference octopus. During meetings, you must avoid speakers interrupt themselves or speak at once. This strongly alters the transcription.
Save time sending your audio or video recording files at the format of your choice on app.authôt.com and obtain your transcription in one click!
The next article will inform you in details on the different formats available for the export on the app, once the transcription is done. Stay tuned!
Authôt. You speak. We write.