AUDIO & MUSIC

Krisp is an AI-powered noise cancellation application that removes background noise, echoes, and voices from audio in real-time during calls and recordings. It works with any communication app and is used by remote workers and call center professionals.

Audio & MusicAudio EditingVerified

#noise cancellation#audio enhancement#remote work

ElevenLabs

ElevenLabs is a leading AI voice synthesis platform offering ultra-realistic text-to-speech, voice cloning, and voice dubbing capabilities in 29+ languages. It is widely used by content creators, publishers, and enterprises for audio production and voice AI applications.

Audio & MusicVoice SynthesisVerified

#text-to-speech#voice cloning#voice synthesis

Murf AI

Murf AI is an AI voice generator and voiceover studio that offers 120+ realistic AI voices in 20 languages for creating professional-quality voiceovers for videos, presentations, and podcasts. It includes pitch and speed control, and direct video-to-voice synchronization.

Audio & MusicText-to-SpeechVerified

#text-to-speech#voiceover#ai voices

Speechify

Speechify is an AI text-to-speech reading app that converts any text—PDFs, articles, books, emails—into natural-sounding audio, enabling users to consume written content at up to 9x speed. It is popular among students, people with dyslexia, and professionals for accessible content consumption.

Audio & MusicText-to-SpeechVerified

#text-to-speech#accessibility#reading app

WellSaid Labs

WellSaid Labs is an enterprise-grade AI voice generation platform that creates realistic, studio-quality voiceovers from text for training, marketing, and product experiences. It offers branded voice studio avatars and high compliance standards for enterprise customers.

Audio & MusicText-to-SpeechVerified

#enterprise tts#ai voiceover#voice studio

Resemble AI

Resemble AI is a voice cloning and AI voice generation platform that creates custom AI voice skins from as little as 3 seconds of audio, with real-time voice cloning capabilities and API integration. It is used for applications in gaming, media, advertising, and virtual assistants.

Audio & MusicVoice CloningVerified

#voice cloning#custom ai voice#real-time voice

Coqui

Coqui was an open-source text-to-speech and voice cloning platform offering the TTS library and Studio tool for creating and managing AI voices with emotional range and multilingual support. While Coqui AI shut down in early 2024, the open-source TTS models remain active on GitHub.

Audio & MusicVoice Synthesis

#open source tts#voice cloning#emotional voice

Bark

Bark is an open-source text-to-audio model by Suno AI that can generate highly realistic speech, music, background noise, and sound effects from text prompts. It supports multiple languages and voice styles and is available on GitHub for self-hosting.

Audio & MusicText-to-Speech

#open source#text-to-speech#sound effects

Tortoise TTS

Tortoise TTS is an open-source multi-voice text-to-speech model known for producing highly expressive and natural-sounding speech with voice cloning capabilities from audio references. It prioritizes audio quality over speed and is used in research and creative audio applications.

Audio & MusicText-to-Speech

#open source tts#voice cloning#expressive speech

LOVO AI

LOVO AI is an AI voice generator and video creation platform offering 500+ realistic voices across 100 languages for voiceovers, video production, and podcast creation. It includes an AI writer and video editor alongside its voice generation tools.

Audio & MusicText-to-SpeechVerified

#ai voiceover#text-to-speech#multilingual

Listnr

Listnr is an AI voice generator that converts text content into realistic speech and podcasts, enabling content creators and publishers to create audio versions of their written content. It supports 900+ AI voices across 142 languages with an embeddable audio player.

Audio & MusicText-to-SpeechVerified

#text-to-speech#ai voice#podcast creation

Narakeet

Narakeet is an AI video and audio narration tool that converts scripts, PowerPoint presentations, and documents into narrated videos and audio files with 700+ AI voices in 90+ languages. It simplifies e-learning content creation by automating voiceover production.

Audio & MusicText-to-SpeechVerified

#narration tool#tts#e-learning

Async

Podcastle is an AI-powered podcast creation platform that provides studio-quality recording, AI voice cloning, automated transcription, and editing tools in a browser-based interface. It enables podcasters to create professional episodes without hardware studios.

Audio & MusicPodcast ToolsVerified

#podcast creation#ai recording#voice cloning

Adobe Podcast

Adobe Podcast (Enhance) is an AI-powered audio tool by Adobe that enhances spoken audio quality to sound like it was recorded in a professional studio, removing background noise and microphone imperfections. It is available as a free web tool and integrates with Adobe Creative Cloud.

Audio & MusicPodcast ToolsVerified

#podcast tools#audio enhancement#noise removal

Cleanvoice

Cleanvoice is an AI audio cleaning tool that automatically removes filler words, stutters, mouth noise, and background noise from podcast and audio recordings. It processes audio files in minutes without requiring manual editing.

Audio & MusicAudio EditingVerified

#podcast editing#filler word removal#ai audio cleaning

Suno AI

Suno AI is an AI music generation platform that creates complete, original songs with vocals, instruments, and lyrics from simple text descriptions, enabling anyone to create professional-sounding music without musical training. It has quickly become one of the most popular AI music creation tools.

Audio & MusicMusic GenerationVerified

#ai music generation#text-to-music#song creation

Udio

Udio is an AI music generation platform that creates high-quality, full songs from text prompts across any musical genre, with advanced controls for style, instrumentation, and song structure. It is positioned as a creative tool for musicians and non-musicians alike.

Audio & MusicMusic Generation

#ai music#music generation#song creation

AIVA

AIVA is an AI music composition tool that creates original emotional soundtrack music for films, games, and content using deep learning models trained on classical and contemporary compositions. It supports a wide variety of musical styles and allows full rights to generated music on paid plans.

Audio & MusicMusic Generation

#ai music composer#soundtrack#film music

Soundraw

Soundraw is an AI music generator that creates royalty-free, customizable music tracks for video content, with controls for mood, genre, length, and tempo. Content creators can customize generated tracks to perfectly fit their video timing needs.

Audio & MusicMusic GenerationVerified

#royalty-free music#ai music#customizable music

Boomy

Boomy is an AI music creation platform that enables users to create and publish original music in seconds, even without musical experience, and earn royalties when tracks are streamed on platforms like Spotify. It simplifies music creation for aspiring artists.

Audio & MusicMusic Generation

#ai music creation#royalties#music publishing

Amper Music

Amper Music (now integrated into Shutterstock) was an AI music composition tool that generated royalty-free music for video content. Its technology has been acquired by Shutterstock to power their AI music generation capabilities for the Shutterstock library.

Audio & MusicMusic Generation

#royalty-free music#ai music#shutterstock

Beatoven.ai

Beatoven.ai is an AI music generator that creates royalty-free, mood-based background music for videos and podcasts with customizable track sections. Users can compose unique music by specifying mood, genre, and tempo for each section of their content.

Audio & MusicMusic GenerationVerified

#ai music#mood music#royalty-free

Loudly

Loudly is an AI music platform that provides an extensive library of AI-generated royalty-free music loops and stems for creators, along with AI music generation tools. It is used by music producers, video editors, and social media creators for unique background tracks.

Audio & MusicMusic GenerationVerified

#ai music#music loops#royalty-free

Soundful

Soundful is an AI background music generator platform that provides content creators with unique, royalty-free tracks generated at the click of a button across multiple genres. Each generated track is unique to prevent copyright claims and ensures commercial use rights.

Audio & MusicMusic Generation

#ai background music#royalty-free#unique tracks

Epidemic Sound

Epidemic Sound is a leading music licensing platform with AI-powered tools for discovering the perfect soundtrack and sound effects for creative content, offering a large catalog of royalty-free music. Its subscription covers use across YouTube, social media, podcasts, and streaming platforms.

Audio & MusicMusic GenerationVerified

#music licensing#royalty-free#soundtrack

Artlist

Artlist is a music and sound effects licensing platform with AI-powered search and curation tools that help creators find the perfect royalty-free music for video projects. It offers a flat-fee subscription covering unlimited downloads and worldwide commercial licensing.

Audio & MusicMusic GenerationVerified

#music licensing#royalty-free#sound effects

Splice

Splice is an AI-powered music creation platform offering a vast library of royalty-free samples, loops, and presets alongside AI tools for generating unique sounds and beats. It is widely used by music producers for sample discovery, collaboration, and creative inspiration.

Audio & MusicMusic GenerationVerified

#music production#samples#loops

BandLab

Free

BandLab is a free cloud-based music creation platform with AI tools, a built-in DAW, social sharing features, and collaboration capabilities for musicians of all skill levels. Its AI tools include melody and beat generation assistance for music production.

Audio & MusicMusic GenerationVerified

#music creation#daw#collaboration

Lalal.ai

Lalal.ai is an AI stem separation tool that extracts vocals, instruments, and individual audio elements from mixed tracks with high quality using neural networks. It is used by musicians, producers, and video editors to isolate specific audio components from songs.

Audio & MusicAudio EditingVerified

#stem separation#vocal isolation#audio extraction

Moises AI

Moises AI is an AI music app for musicians that separates audio stems, enables pitch and tempo adjustment, generates chord detection, and provides a smart metronome for practice and performance. It is designed to help musicians learn songs, create remixes, and practice instruments.

Audio & MusicAudio EditingVerified

#stem separation#chord detection#music practice

AudioStrip

Free

AudioStrip is a free online tool that uses AI to separate vocals from instrumentals in audio tracks, providing karaoke versions and vocal isolation in minutes. It is accessible directly from the browser without account creation.

Audio & MusicAudio Editing

#vocal isolation#karaoke#stem separation

Soundtrap

Soundtrap is an AI-powered online music studio by Spotify that enables collaborative music and podcast recording, mixing, and production directly in a browser. It is widely used in education and by beginner to intermediate musicians for collaborative creative projects.

Audio & MusicMusic GenerationVerified

#online music studio#collaboration#podcast recording

Voicemod

Voicemod is an AI-powered real-time voice changer and soundboard for gamers, streamers, and content creators, offering hundreds of voice effects and custom voice skins. It integrates with platforms like Discord, Zoom, and gaming applications.

Audio & MusicVoice SynthesisVerified

#voice changer#real-time voice#gaming

Altered

Altered is a professional AI voice editing and transformation platform that allows users to change their voice to professional AI voices in post-production or in real time, designed for film, podcast, and game audio production. It offers high-quality voice morphing with fine control over pitch, timbre, and style.

Audio & MusicVoice Synthesis

#voice editing#voice morphing#professional audio

Voice.ai

Voice.ai is a free real-time AI voice changer platform that enables users to transform their voice during live calls, gaming, and streaming with a library of 1000+ community voice filters. It provides a desktop application for Windows and Mac users.

Audio & MusicVoice Synthesis

#voice changer#real-time voice#free voice ai

Whisper

Whisper is an open-source automatic speech recognition system by OpenAI trained on 680,000 hours of multilingual web audio data, offering near human-level robustness and accuracy in English and 99 other languages. It is available on GitHub and via the OpenAI API.

Audio & MusicSpeech-to-TextVerified

#speech recognition#open source#transcription

AssemblyAI

AssemblyAI is a speech-to-text API platform offering highly accurate transcription, speaker diarization, sentiment analysis, content moderation, and AI audio intelligence features for developers. It is used to build applications requiring audio understanding and voice analytics.

Audio & MusicSpeech-to-TextVerified

#speech-to-text api#transcription api#audio intelligence

Deepgram

Deepgram is an AI speech recognition API platform that provides real-time and batch transcription with industry-leading accuracy, low latency, and speaker diarization for enterprise and developer applications. It powers voice features in thousands of applications globally.

Audio & MusicSpeech-to-TextVerified

#speech recognition api#real-time transcription#voice ai

Rev AI

Rev AI is a speech recognition API from Rev.com that provides automated transcription, captions, and subtitles with high accuracy for media, enterprises, and developers. It also offers human transcription services for cases requiring the highest accuracy.

Audio & MusicSpeech-to-TextVerified

#transcription api#speech-to-text#captions

Sonix

Sonix is an AI transcription platform that converts audio and video files to text with high accuracy in 40+ languages, offering an in-browser editor for reviewing and correcting transcriptions. It is used by journalists, researchers, and legal professionals for automated transcription workflows.

Audio & MusicSpeech-to-TextVerified

#transcription#ai transcription#multilingual

Happy Scribe

Happy Scribe is an AI transcription and subtitle platform that automatically converts audio and video to text in 120+ languages, with human proofreading options for maximum accuracy. It is designed for journalists, academics, and media professionals.

Audio & MusicSpeech-to-TextVerified

#transcription#subtitles#multilingual

Trint

Trint is an AI transcription platform that converts audio and video files to searchable, editable text, built for journalists, media teams, and research professionals. It offers collaborative editing, story building tools, and integration with newsroom workflows.

Audio & MusicSpeech-to-TextVerified

#transcription#journalism#collaborative editing

Auphonic

Auphonic is an AI audio post-production service that automatically levels, normalizes, and enhances audio quality for podcasts, interviews, and video content using intelligent processing algorithms. It handles loudness normalization to broadcast standards and automated chapter generation.

Audio & MusicAudio EditingVerified

#audio post-production#loudness normalization#podcast processing

iZotope RX

iZotope RX is the industry-standard AI audio repair and restoration software used by professional audio engineers and post-production teams to remove noise, hum, clicks, and other audio artifacts from recordings. It includes advanced AI-powered tools like Music Rebalance and Dialogue Isolation.

Audio & MusicAudio EditingVerified

#audio repair#noise removal#post-production

Landr

Landr is an AI-powered music mastering platform that automatically masters audio tracks to professional standards with intelligent analysis of dynamics, frequency balance, and stereo width. It is used by independent musicians and producers to prepare tracks for release on streaming platforms.

Audio & MusicAudio Editing

#music mastering#ai audio#music distribution

Endel

Endel is an AI-powered soundscape and music app that generates personalized, real-time audio environments for focus, relaxation, sleep, and movement based on inputs like time of day, weather, and heart rate. Its technology is based on psychoacoustic science and has been licensed by major labels.

Audio & MusicMusic GenerationVerified

#personalized soundscapes#focus music#sleep sounds

Brain.fm

Brain.fm is an AI music platform that generates functional music specifically designed to enhance focus, relaxation, and sleep by leveraging neuroscience principles and AI-generated audio patterns. It is used by professionals and students for deep work and cognitive performance.

Audio & MusicMusic Generation

#focus music#ai music#neuroscience

Splash

Splash is an AI music creation platform focused on making music creation accessible and fun through AI-assisted beat and melody generation, particularly targeting younger creators and music enthusiasts. It offers an interactive interface for creating and sharing music.

Audio & MusicMusic Generation

#ai music creation#accessible music#beat making

Stability Audio