Convert Audio To Text

The problem of how to convert audio to text is an ancient problem. The spoken word has an immediacy that, until recently, was nearly impossible to capture. Today, though, we have thankfully progressed and there are at least three options available that offer a solution of converting audio to text (or transcription): 

  1. Do it yourself (with software),
  2. use a human-based or hybrid transcription service,
  3. try out machine transcription services.

Accuracy and budget are the main influences when it comes to your choice on how you will convert your audio to text. We take a brief look at each with some popular links for you to consider with your choice. Important: we cannot guarantee the accuracy or veracity of other transcription services listed.

Do It Yourself – Transcription Software

If you have time, are very concerned about excellent accuracy but have a very low or no budget, we would suggest you “do it yourself”. In this case, take a look at our transcription software list for (mostly free) software you can download to use to type your recording (tip: see our hints for best quality audio recording – if you have recorded yet). Transcription software has been around for many years and continues to slowly improve. Often used by audio typists, some software requires additional equipment such as foot peddles. Most software is suitable for dictation.

Software Transcription Picks

Express Scribe Transcription Software – professional audio player software for PC or Mac designed to assist the transcription of audio recordings. A typist can install it on their computer and control audio playback using a transcription foot pedal or keyboard.

InqScribe – supports multiple languages and has the capacity insert time codes and utilize analysis codes. It is simple to use. You can view a video and transcribe it using one window.

Nuance Dragon – if you rely on a digital voice recorder or smartphone to capture notes and memos, you can use Dragon’s transcription to turn recordings into text.

FTW Transcriber – transcription software that offers time-saving features like automatic time stamps.

Listen n Write – used to play and transcribe ordinary audio and video recordings (WAV, MP3, OGG, WMA, AVI, MPG, WMV, OGV, FLV, VOB, TS, etc).

VoxSigma – a suite of language-specific speech recognition software offered by Vocapia Research for Unix-like x86 and x86-64 platforms.

EureScribe – transcription client that breaks files into small, easy-to-memorize chunks.

MAXQDA – text analysis software.

Human or Hybrid Transcription

If you do not have the time or simply don’t want to transcribe your recordings but have a budget and need an accurate transcription, then look at using appropriate audio to text transcription service – human or hybrid transcription services. The ability to convert audio to text (or convert audio text) using human transcribers requires a reasonable typing speed, an excellent level of general knowledge and very good hearing indeed.

Services have appeared everywhere from individuals or freelance transcribers who offer their skills directly in the market to established transcription companies that have a global presence and good reputations. Whichever you prefer, always choose a preferred transcription service according to the quality of transcribers they employ (and how they source, select and assess them!). Also, you should review the company’s reputation and ability to transcribe in your required language and accents. Furthermore, you should check the company’s experience and knowledge of relevant client subject matter. Also, see if the company offers localized solutions and is secure.

Human Transcription Picks

There are hundreds of services out there and if you try Google for transcription services companies you will get an endless list. We have chosen ten prominent and reputable transcription companies globally.

Way With Words  – The original English-language transcription service company. Personifies itself as “Transcription made personal”. A professional transcription boutique service offering various turnaround options and standard or customized transcripts.

Get An Instant Quote

Way With Words’ standard transcription service is calculated on a per audio or video minute rate. Pricing depends on the turnaround time chosen and the add-on options selected. The longer the turnaround and the fewer the add-ons selected, the lower the price. Use our calculator below to get started ↴

Number of minutes to transcribe
Turnaround time
Time Coding  
Full Verbatim  
Rate Per Minute   $0.00
Deposit   $0.00
TOTAL: $0.00

Turnaround time
Time Coding  
Full Verbatim  
Rate Per Minute   $0.00
Deposit   $0.00
TOTAL: $0.00 Sales Tax N/A

Do you have something special in mind?

Send us your specific requirements by using our CONTACT FORM and we will get back to you with your custom quote.

Just want to chat? Please feel free to use our live chat app by clicking the tab at the bottom of the page.

Nibity – A first stage hybrid transcription solution using people. Simple drop-and-go transcription solution to convert audio or video to text. On completion, an email notification is sent. The service focus is on low pricing, superb accuracy, 24-hour turnarounds or queuing and increased automation of the process.

Athreon – quality professional transcription services at a reasonable price.

GMR Transcription – professional transcription and translation.

Transcribeme – Cost-effective and flexible, includes basic and premium features.

Transcription Outsourcing – LLC’s secure, CJIS-approved online transcription platform. Focuses on police transcription services.

Today Translations – international translation services company that handle more than 200 languages.

Take1 – a UK leading transcription company with a focus on TV and video.

VoxTab – a human team of transcriptionists and quality analysts for the US, UK, and Australia.

Machine Transcription

Alternatively, if you still don’t have time but either have a very small budget or don’t care about a high level of accuracy, then you can always try out one of the many start-up machine transcription services. Machine transcription applications are still very much in development and have challenges with the degree of their accuracy depending on language, dialect and circumstances of recording. Machine transcript accuracy can range vastly depending on the number of speakers, background noise, etc. A human audio to text transcriber with a level of less than 95% accuracy would be regarded as barely competent. For machines this can be an enviable target to achieve.

Machine Transcription Picks

Google Speech – one of the giants offering draft transcription.

Amazon Transcribe – Offers a wider range of options for more commercial integration.

Happy Scribe – automatically transcribes interviews for € 0.09 per minute.

Speechmatics – challenges understanding speech in any language.

Trint – a platform for searching, editing and getting audio and video content.”



You can choose between proven approaches to convert audio to text or new emerging options. Either way, we hope your choice works for you! English is a very context-dependent language, stuffed with homophones, homonyms and very idiosyncratic spelling and grammar. Therefore, it’s unlikely that we are going to reach the stage when a machine delivers the same quality of audio-to-text conversion as humans before 2025 (according to our research). However, if its simply keyword search or meta tagging you need then this might be an option for you.

A Brief Review

Various attempts to convert audio to text (or convert audio text) have been made over the years. The first was shorthand, where a skilled practitioner could record up to 225 words a minute. Shorthand though has the weakness inherent in immediacy as the practitioner only has one shot at hearing what is said.

It wasn’t until a reliable means for recording voices came along in the late 19th century, with Edison’s phonograph cylinder that digital recording devices came into existence. Even then, they were hardly portable. But for the first time, transcription of speech based on a recording was possible, and the art of dictation was born. So it was now possible to convert audio to text with a level of accuracy never before achieved.

But let’s not get caught up in history. You are probably here to look at some options to get your audio recording transcribed (if you are looking at having a video recording transcribed, you might like to consider our guide to video transcription services). If so, we suggest that you should consider two things before you choose your preferred transcription method to get your get your recording transcribed. If so, we suggest that you should consider two factors before you choose your preferred approach to get your audio into text:

  1. How important accuracy is for your transcript.
  2. The budget you are prepared to spend.