GetApp offers free software discovery and selection resources for professionals like you. Our service is free because software vendors pay us when they generate web traffic and sales leads from GetApp users. Because we’re committed to help you find the right solution for your business needs, we list all software vendors on our website, and give them the opportunity to feature their solutions and collect user reviews. At GetApp, our comprehensive software listings, verified user reviews, software evaluation scorecards, product comparison pages and articles will empower you to make confident and well-informed purchase decisions.
Here's our list of apps for Speech Recognition Software. Filters help you narrow down the results to find exactly what you’re looking for.
Rev.ai’s suite of speech-to-text APIs allows businesses to build downstream applications. Speech recognition software built from speech engine trained to transcribe content on various topics with various accents for various industries.
Read more about Rev.ai
CallHippo is a Virtual Phone System that is easy-to-use while offering robust functionality with advanced features, extensive reporting, and seamless integrations to empower sales and service teams to have effective conversations with customers. 24x7 World Class Support. Instant Setup
Read more about CallHippo
Twilio brings a powerful API for phone services enabling companies to make and receive phone calls, and send and receive text messages. It allows programmers to easily integrate various communication methods and to use existing web development skills and codes to solve communication problems.
Read more about Twilio
Dragon Professional Individual is a speech recognition software designed to help professionals leverage deep learning technology to dictate and transcribe documents. Its smart format rules automatically adapt to required abbreviations, phone numbers, dates, and other appearing details.
Read more about Dragon Professional Individual
Proprietary Speech Recognition and A.I-enabled technology to help students speak English more fluently and effectively. The ELSA API can detect and correct pronunciation errors in scripted and unscripted speech input, providing immediate feedback and recommendations.
Read more about ELSA Speak
Descript is a transcription software that is designed for businesses in multiple industries, such as marketing, sales, user research, online learning, and customer support. It helps team members collaborate on projects, send feedback, create shared folders, add comments, and track document versions.
Read more about Descript
ASR technology allows the customer to interact with IVR's, virtual agents, among other computer systems by voice, avoiding the pulsing of DTMF tones in menus with long and difficult to remember options
Read more about wolkvox
AmberScript is a suite of software products that allow you to transform audio and video files into searchable text and subtitles. Create closed captions and subtitles to improve accessibility, save money, and time.
Read more about Amberscript
Talkatoo is a speech recognition and dictation software that helps veterinary organizations utilize speech-to-text technology to capture chart notes on a centralized platform. It provides a built-in medical dictionary, which lets medical professionals dictate terms, such as eosinophilia, hypothermia, intubation, and more.
Read more about Talkatoo
Happy Scribe helps journalists, researchers, podcasters, and video editors convert audio and video files into text documents on a unified portal. The platform lets users store proper nouns, acronyms, and other terminologies in a personalized vocabulary for reference during future projects.
Read more about Happy Scribe
Snowfly is an employee engagement and gamification software designed to help businesses measure the performance of employees and engage them through incentives and rewards. It enables organizations to create, implement, and manage recognition programs to improve employee experience (EX) and satisfaction.
Read more about Snowfly
Trint is a cloud-based audio and video transcription solution which leverages artificial intelligence (AI), machine learning, and natural language processing (NLP) to automatically transcribe audio from a range of file formats and generate an interactive, searchable, editable & shareable transcript
Read more about Trint
SpeechTexter is a speech recognition and conversion software that helps corporates, teachers, lawyers, writers, and students convert audio files into text. It offers a multi-language speech recognizer as well as document and email transcriber, enabling users to transcribe documents in real-time.
Read more about SpeechTexter
CallFinder® is the leading provider of managed cloud-based SaaS speech analytics, automated call scoring, and speech-to-text transcription with conversational insights, such as sentiment and emotion detection.
Read more about CallFinder
Capté is an online web application that allows you to add subtitles instantly and automatically. Capté makes subtitling easier and faster. Capté uses speech recognition to transcribe audio into subtitles. Subtitling becomes a breeze.
Read more about Capté
Reason8 is a cloud-based speech-to-text app which uses artificial intelligence (AI) to automate note taking & summary preparation from in-person meetings. The platform supports multi-device reporting, highlighting, action item extraction, decision recording, transcript & summary exports, and more.
Read more about Reason8
Amazon Transcribe is an automatic speech recognition platform that helps businesses convert speech to text and generate read or review transcripts. It includes a call analytics API, which allows developers to process live as well as recorded audio/video inputs and perform transcriptions.
Read more about Amazon Transcribe
DeepScribe is the world’s most widely adopted ambient, AI-powered medical scribe. Through an easy to use mobile app, DeepScribe captures the natural conversation between a clinician and patient and automatically produces accurate medical documentation directly within a clinician's EHR.
Read more about DeepScribe
Enthu is an artificial intelligence (AI)-enabled speech analytics and conversation intelligence software designed for contact centers, call centers, and BPOs. It enables professionals to monitor customer conversations to derive actionable intelligence, manage call QA processes, and ensure compliance with industry regulations.
Read more about Enthu
Reportex from Sony is a cloud-based audio transcription and editing solution which allows users to automatically transcribe audio from multiple file formats, edit and correct transcriptions, create and share video clips of transcribed audio, download edited files, and more
Read more about Reportex
Maestra is a speech to text software designed to help educators, researchers, marketers, journalists, and media houses automatically add transcriptions, captions, subtitles, and voiceovers to audio and video files in real-time. The platform enables professionals to translate text into various languages including English, French, Spanish, and...
Read more about Maestra
3Play Media is designed to help businesses across media, entertainment, eCommerce, fitness, education, and government sectors handle closed captioning, transcription, audio description, live captioning, and subtitling operations. It enables users to manage podcasts, enhance search engine optimization (SEO) activities, and track audience engagement...
Read more about 3Play Media
OneVoice is part of a unified messaging platform for Office 365 and Gmail. It is an audio transcription, voicemail, and translation tool developed by Donoma. It aims to help sales and customer service agents perform their daily tasks by providing a range of accessible and inclusive features.
Read more about OneVoice
FirstLanguage API is a SaaS-based API service, which enables individual developers or a company to build applications that require NLP tasks.
FirstLanguage API is specifically designed to be generic and can be used by any industry. Specific industry fine-tuning is also possible.
Read more about FirstLanguage
AWS provides machine learning (ML) and artificial intelligence (AI) solutions designed to help businesses analyze data insights, personalize the customer experience, optimize business processes, and more.
Read more about Machine Learning on AWS
Castel Detect LIVE is a voice recognition solution which helps firms of all sizes manage contact center speech analytics with alerts, reminders, scripting and call scoring. The platform allows users to regulate quality assurance via live calls analysis, post-call audits, and data-driven feedback.
Read more about Castel Detect Live
OTO is a speech analytics API and AI-powered voice intelligence technology, which analyzes customer sentiment through tone of voice. The platform can be used by call centers, healthcare providers, home technology providers, and robotics developers to understand and leverage voice data.
Read more about OTO
Translation Worldwide Software by JBI SOFTWARE is designed to help businesses across healthcare, legal, medical, insurance, banking, and other industries manage language translation projects. The artificial intelligence (AI)-enabled solution allows employees to handle text interpretation and translation processes and reduce lawsuits.
Read more about Translation Worldwide Software
SoapBox Labs builds speech recognition technology for kids. Its proprietary technology has been built to deliver private and accurate results for kids ages 2 to 12 of all accents and dialects. Soapbox Labs also takes into account kids’ unpredictable speech patterns and behaviors.
Read more about SoapBox
PollySpeech allows you to turn any text into lifelike speech, allowing you to create various media content such as audiobooks, podcasts, voice content, and also applications that talk and build entirely new categories of speech-enabled products.
Read more about PollySpeech
DeepTranscript is an automatic speech recognition provider for professionnals designed for large volumes and high accuracy. Let's collect all data available in conversations, talks, interview with our plug and play API.
Read more about DeepTranscript
Sesame by Utopia.AI is a cloud-based voice biometric identification solution which uses natural speech to identify callers in real time, by creating voice prints from previous calls without requiring caller enrollment. The software can also analyze caller vocabulary, sentiment, and emotional state.
Read more about Sesame
AISB Engine is a cloud-based & on-premise voice biometrics solution designed to help businesses in industries such as finance, healthcare, retail & air travel identify and authenticate visitors. Key features include password resetting, authentication, secure data access, and fraud detection.
Read more about AISB Engine
Advanced Digital Dictation is a dictation and transcription software, which helps legal firms capture, process, and store case data using cloud speech technology. It automatically stores information in Microsoft Azure or AWS Cloud storage and uses HTTPS transmissions to encrypt data during transfer.
Read more about Digital Dictation
Ebby helps lawyers, podcasters, journalists, researchers, and academic professionals convert audio recordings into text documents using AI technology. The built-in editor automatically synchronizes and plays audio or video files with text data, letting users review and edit transcripts in real-time.
Read more about Ebby
SpeechReport Cursor is an advanced speech recognition software application for medical, legal, and other businesses. The program easily integrates with all information systems, including EHR and MS Office products.
Read more about SpeechReport Cursor
Serenade is a speech engine specifically designed for developers. With Serenade you can write code, update documentation, and send messages without using a keyboard. Programming languages are fully supported so you can run powerful commands using natural speech.
Read more about Serenade
Voci is a cloud-based and on-premise speech analytics software designed to help businesses gain insights into voice data using AI technology and deep learning algorithms. It lets teams transcribe large volumes of audio into analyzable text via high-speed DDR4 SDRAMs.
Read more about Voci
Subcap is a mobile app that provides videos with automatic subtitles. Subcap allows users to upload a video from the gallery or take a video simultaneously. It automatically transcribes the audio to text. To generate subtitles, artifiial intelligence is used for Subcap’s auto-captions maker.
Read more about SubCap
Verbit provides accurate captions & transcription of live and recorded video to make them accessible & engaging to all audiences. These tools help educators, government entities, legal agencies, business leaders and media producers meet the needs of those with disabilities & all engage viewers.
Read more about Verbit
Txtplay.ai transforms your media into text and subtitles within minutes. With the latest Ai technology, we offer accurate qualitative speech to text transcripts in 48+ languages such as English, Swedish, Danish, Norwegian and Finnish for your business.
Read more about Txtplay
Vendors bid for placement within our listings. This option sorts the directory by those bids, highest to lowest. Vendors who bid for placement can be identified by the blue “Visit Website” button on their listing.
Sorts products as a function of their overall star rating, normalized for recency and volume of reviews, from highest to lowest.