ElevenLabs provides AI audio tools, including text-to-speech and AI voice generation, in thousands of voices and 32 languages.
Fish Speech
Fish Speech is a text-to-speech (TTS) tool developed by the creators of So-VITS-SVC and Bert-VITS2.
Fish Speech is a text-to-speech (TTS) tool developed by the creators of So-VITS-SVC and Bert-VITS2.
Starting price
From $0
Best for
AI Voice Generator
Free trial
✓ Free tier
Explore options
See how Fish Speech compares
Product information
What this product does
Fish Speech is a text-to-speech (TTS) tool developed by the creators of So-VITS-SVC and Bert-VITS2. It can synthesize natural and fluent speech from just 15 seconds of any voice, maintaining the given timbre, style, and accent. Fish Audio is a platform for audio generation, offering various voice models for users to discover and use.
Sourced from fish.audio
Works With
Platforms
- Web
- API
AI Models
- Claude
- Gemini
- Midjourney
Problem Solved
What problem does Fish Speech solve?
Text-to-speech tool that synthesizes natural speech from short voice samples.
Company & Maker
Who built Fish Speech?
- Product
- Fish Speech is a text-to-speech (TTS) tool developed by the creators of So-VITS-SVC and Bert-VITS2. It can synthesize natural and fluent speech from just 15 seconds of any voice, maintaining the given timbre, style, and accent. Fish Audio is a platform for audio generation, offering various voice models for users to discover and use.
- Live since
- Jul 2024
The maker hasn't published a public statement we can verify yet.
Researched with AI · Data refreshed 2 weeks ago · How we score
Full Report Access
Unlock the verified intelligence before you decide
Fish Speech is an open-source text-to-speech engine that clones any voice from a 15-second sample, and Fish Audio is its companion platform for hosting and discovering voice models.
Who should use
Buyer-fit is derived from approved personas and use-case evidence. Search-volume noise stays labeled separately.
Best-fit workflows
- Generating speech in a specific voice for audiobooks
- Creating voiceovers for videos
- Developing virtual assistants with personalized voices
- Generating speech for accessibility purposes
- Generates speech from text using a cloned voice from a 15-second audio sample.
- Drafts multilingual voiceovers for videos using community voice models.
Pricing reality
Vendor stated
Free and open-source
Actual cost
Free for self-hosted use; Fish Audio platform may have usage limits or paid tiers for commercial use
Hidden costs
3 hidden cost warnings — unlock to view
Pricing reality
Verified May 2026 / Vendor pricing evidence only
Starting price
From $0
Free tier available.
Mindber only shows named plan cards when they come from linked pricing evidence. If the tier data is incomplete, this page falls back to the verified starting price instead of fabricating plan names or $0 tiers.
Free Tier
Free/mo
- Up to 7 minutes of highest quality S1 and S2 generation
- Up to 500 characters per generation
- Standard generation speed
- 3 public voice slots
Plus
$5.5/mo
- Up to 200 minutes of S1 and S2 generation monthly
- Priority generation on our latest models
- Up to 15,000 characters per generation
- Enhanced voice cloning
- Unlimited public + 10 private voice slots
- Commercial use allowed
- API access (pay-as-you-go)
- 7 day money back guarantee
Pro
$37.5/mo
- Up to 1,620 minutes of S1 and S2 generation monthly
- Priority generation on our latest models
- Up to 30,000 characters per generation
- Enhanced voice cloning
- Unlimited voice slots
- Commercial use allowed
- 3 team seats included
- API access (pay-as-you-go)
Max
$749/mo
- Up to 6,250 minutes of S1 and S2 generation monthly
- Priority generation on our latest models
- Up to 30,000 characters per generation
- Enhanced voice cloning
- Unlimited voice slots
- Commercial use allowed
- 10 team seats included
- API access (pay-as-you-go)
What pricing pages often hide
True cost for your case
LIVE ESTIMATEEstimated spend
$0Recurring billing is usually the real cost driver, not the first-month sticker price.
Compare this product in-line
Start with a suggested alternative or type another tool to jump into a side-by-side comparison.
RELATED PRODUCTS
Suggested from linked alternativesHugging Face is an AI community building the future through open source and open science.
Suno is an AI music generator that allows users to create stunning original music in seconds from simple text prompts or advanced editing tools.
Engagement signals (30d)
Based on 6 linked public signals.
Mindber Activity Score™
Score locked
HOW THIS TOOL SCORES
Scores compare this tool with 61 similar tools in AI Voice Generator. Use it to judge Innovation and Capability before you shortlist.
Innovation Index™
Is it new, different, and easy to understand?
Functionality Score™
Does it cover the jobs buyers expect, and can it keep up?
Medium confidence (50%) - some sub-scores have thin evidence.What this means →
Data sources
Traffic & search proof
Public view keeps the ratio story. Paid view unlocks exact monthly visits, full region coverage, and CPC-level keyword intelligence.
Latest traffic snapshot
Monthly visits
3.9M
Avg visit duration
00:05:24
Pages per visit
7.54
Bounce rate
36.81%
Traffic trend
Month-by-month visits from linked public traffic evidence.
Geography
Top regions
Exact monthly traffic is paid-only. Public view keeps percent share only.
Traffic sources
Source mix snapshot
Public view keeps the ratio snapshot. Minor channels stay behind the paid view.
Top keywords
Brand terms stay labeled separately from intent and comparison demand.
| Keyword | Segment | Traffic | CPC |
|---|---|---|---|
| fish audio | Brand term | 229.82K | |
| fishaudio | Intent / comparison | 35.24K | |
| fish audio ai | Brand term | 9.07K | |
| fish ai | Brand term | 10.79K | |
| loquendo | Intent / comparison | 45.21K |
FAQ
Only linked Q and A rows are shown. Empty FAQ data stays hidden.
Social mentions
Only linked public mention evidence is shown here.
Sources and methodology
Only evidence with a linked source record or explicit feature provenance is shown here. Data without provenance stays off the page.
COMMUNITY CREDITS
Have you used Fish Speech? Earn credits.
Verified contributions help the page stay honest. The wider member action set still lives in your dashboard so the public page stays focused on the two highest-signal workflows.
Write a verified review (200+ words) → earn 1 credit
Share the setup context, what worked, and what broke. Mindber rewards specific, evidence-backed reviews instead of one-line praise.
Write verified reviewSpot incorrect data? Submit a correction → earn 1 credit
Flag stale pricing, wrong traffic signals, or broken links and the editorial team will re-verify the record.
Submit correctionMaker access, comments, ratings, and verification requests remain available inyour dashboard.