Audio Transcription FAQ's

What is audio transcription?

Foot pedal for audio or video transcription
Transcriber's Best Friend the Foot Pedal

Transcribing audio is the skill and art of listening to and converting recorded speech from either audio, video, podcast or any other recorded dialogue into a digital text document that you can read, edit, print, share, or copy and paste text for social media updates. A transcript delivered in a Word or basic text document can be added to a video, podcast or webinar. By including the additional text or Word document to your video, podcast or any other audio or visual media, you are providing accessibility to the hearing impaired and giving search engine crawlers more information about your content. This will result in your video or podcast appearing higher search results because of the additional data.

To increase the accuracy and speed of the speech-to-text process from recorded speech manually, a transcriber can use a foot pedal. See the picture to the left of the exact foot pedal used by Virtuadmin for audio transcription projects.  Using hotkeys as an alternative can be time-consuming for this type of transcription technique. Using a foot pedal you control the playback of audio with your foot to play, rewind and replay. Therefore, a professional touch typist does not have to take their hands from the keyboard, they keep their fingers on the home keys, increasing the typing speed conversion of recorded audio speech into an accurate transcript.

How long does audio transcription take?

Audio Transcription How Long Does It Take
Audio Transcription

There are many factors to take into consideration when determining how long the audio transcription takes.  First and foremost the following factors need to be taken into consideration:

  • What is the audio quality like? 
  • Is there any background noise? It could have been recorded in a public place (not recommended) is there static in the recording? 
  • How many speakers are in the audio?
  • Do the speaker/s have an accent?
  • Is the transcription to be intelligent verbatim (um's, ah's and repetitive words or sayings like, 'you know' and 'sort of' or strict verbatim, where these are left in the transcript, every little detail is transcribed).

All of these elements need to be taken into consideration when trying to estimate the length of time for a professional transcriber with a touch typing speed of 80 words per minute to turn our recorded speech into text.

How is audio transcription charged?

In the United Kingdom, audio transcription is charged by the audio minute, and not the time it takes to convert recorded speech into text.  It takes approximately 4-6 hours for a professional transcriber to manually listen and type ONE HOUR of recorded audio.Audio Transcription Rate

For our audio transcription service at Virtuadmin our rate starts at 60p per audio minute for a clear single-speaker audio.

If you have any other questions regarding audio transcription, please do not hesitate to email me at and I would be happy to answer any further questions.

Virtuadmin’s Transcription Service on People Per Hour