Audio, Voice & Transcription
33 tools analyzed and scored across 12 dimensions.
Wispr Flow is an AI-powered voice dictation app for Mac, Windows, iOS, and Android that transcribes speech 4x faster than typing, automatically cleani
Letterly is a speech-to-text app that records voice (up to 90 minutes per recording, including offline) and transcribes it into structured text within
Piezo is a charmingly simple audio recording app for Mac by Rogue Amoeba that lets you capture audio from any application or input device — including
Dialora is an AI voice agent platform that automates inbound and outbound business phone calls with natural, human-like conversations in multiple lang
SoundSource is Rogue Amoeba's macOS audio control utility that lives in the menu bar and provides per-application volume control, per-app audio routin
Suno is a generative AI music platform that converts text prompts into full songs — complete with sung vocals, melody, instrumentation, and production
MegaTranscript is an AI transcription platform that converts audio and video files into accurate text transcripts and subtitles, with additional capab
SpeechNow is an AI text-to-speech platform available on AppSumo that converts written content (eBooks, PDFs, scripts) into natural-sounding voice reco
Intelligent Invoicer could not be confirmed as an established product through official website sources or major deal platforms at time of research. No
Pismo (acquired by Visa in 2024 for $1 billion) is a cloud-native, API-first core banking and payments processing platform serving major financial ins
Whisper Transcription is a macOS app that transcribes audio files and live recordings entirely on-device using OpenAI's Whisper model, with no data le
Unmixr AI is an online audio stem separation tool that splits songs into isolated tracks—vocals, drums, bass, and other instruments—via a simple file
Audio Hijack is a Mac application by Rogue Amoeba that captures and records any audio playing on a Mac — from apps like Zoom, Safari, or FaceTime — as
Farrago is a professional Mac soundboard app by Rogue Amoeba that transforms your keyboard into an instant audio playback controller using a tile-base
Fission is a fast, lossless audio editor for macOS made by Rogue Amoeba that trims, joins, and splits audio files without re-encoding, preserving the
Supermusic is an AI music generation mobile app (iOS and Android) that converts text prompts and user-written lyrics into full songs with AI-generated
Loopback is a cable-free audio routing application for macOS by Rogue Amoeba that creates virtual audio devices, allowing you to combine audio streams
Tuney is a conversational AI music creation platform where you describe your musical ideas in plain language — upload a melody, describe a mood, or re
Prizmo is a professional scanning and OCR app by Creaceed for iPhone, iPad, and Mac that delivers high-accuracy text recognition in 139 languages, inc
Transcript LOL is an AI transcription platform that converts audio and video from 1,500+ platforms into accurate text using OpenAI Whisper, then layer
An AI voice generation platform on AppSumo that aggregates multiple voice engines — including ElevenLabs, Cartesia, OpenAI, FishAudio, and Orpheus — i
AudioHero is a royalty-free music and sound effects library with 300,000+ tracks covering diverse genres, moods, and tempos, all licensed for unlimite
TranscribeX is a macOS app that performs fast, fully local AI transcription of audio and video files using an on-device Whisper-based engine, keeping
WhisperTranscribe is a macOS app (available on Setapp) that transcribes audio and video files — MP3, MP4, WAV, WEBM, and more — using the Whisper mode
UniScribe is an AI transcription service that converts audio files, video files, and YouTube links into accurate transcripts, summaries, and visual mi
Persona Music is a royalty-free music licensing platform offering a library of over 10,000 tracks sourced from Hollywood film trailers, Emmy award-win
Voice commands dashboard. Tier 2 ($49). Alternatives: Otter.ai (AI meeting transcription with speaker identification and summaries), Notta (AI transcr