People have large personal ebook libraries but audiobook versions are expensive ($15-30 each) or unavailable. Cloud TTS services raise privacy concerns and require connectivity.
A mobile app that batch-converts personal ebook collections into high-quality audiobooks using on-device TTS, with chapter detection, bookmarking, and export to standard audio formats.
One-time purchase ($9.99) or subscription ($3-5/mo) for premium voices and unlimited conversions.
Real pain but not hair-on-fire. Audiobooks cost $15-30 each; a 50-book library = $750-1500 to convert. Academic readers and accessibility users feel this acutely. However, many casual readers tolerate the status quo (just reading). The Reddit signal (132 upvotes on Kokoro TTS on-device) confirms a motivated technical audience, but mass-market pain is moderate.
TAM: ~500M global ebook readers, ~200M audiobook listeners. Serviceable market: ebook owners who want audio versions = ~50-100M. Realistic early adopters: tech-savvy readers with large EPUB/PDF libraries = 2-5M. At $5/mo or $10 one-time, even 100K users = $500K-6M ARR. Not a unicorn market but strong lifestyle business potential.
Mixed signals. Speechify proves people pay $139/yr for TTS, but that's a polished product with heavy marketing. The target audience (personal library owners, privacy-conscious, technical users) tends to prefer one-time purchases and resists subscriptions. $9.99 one-time is very payable but limits revenue. $3-5/mo subscription for premium voices is viable but needs strong justification. Accessibility users and academics may have institutional budgets.
On-device TTS is the hard part. Kokoro TTS runs 20x realtime on CPU, which is promising, but mobile deployment adds complexity: model size (~500MB-1GB for quality voices), battery drain, heat management, and iOS/Android ML runtime differences. EPUB parsing is straightforward; PDF text extraction is notoriously messy. Chapter detection across formats is non-trivial. A solo dev could build a functional MVP in 6-8 weeks but voice quality parity with cloud services will be the ongoing battle. Core risk: Apple/Google could ship better system TTS at any time.
This is the strongest dimension. NO existing product combines: (1) on-device TTS, (2) batch ebook conversion, (3) export to standard audiobook formats (M4B/MP3), and (4) chapter-aware processing. Speechify/ElevenLabs are cloud-only and expensive. Voice Dream doesn't export audio. Built-in OS TTS can't create audiobook files. The 'personal audiobook factory' that runs offline with no subscription is genuinely unoccupied territory.
Challenging for pure subscription. The core value (convert my books) feels like a one-time job, not ongoing. Premium voices, new language packs, and speed/quality improvements can justify ongoing payment, but many users will convert their library and churn. Hybrid model (one-time base + subscription for premium voices) is more honest but caps revenue. Compare: users buy a scanner app once, they don't subscribe to it.
- +Clear competition gap: no one owns 'offline ebook-to-audiobook' yet
- +On-device TTS tech (Kokoro, Apple Neural Engine) just crossed the quality threshold — perfect timing
- +Privacy angle is a genuine differentiator vs Speechify/ElevenLabs
- +Strong accessibility use case provides defensible niche and potential institutional sales
- +Low marginal cost (no cloud inference bills) enables aggressive pricing
- !Apple or Google could add 'create audiobook from EPUB' as an OS feature, instantly commoditizing the core value
- !On-device voice quality may disappoint users expecting Audible-grade narration — expectation management is critical
- !PDF text extraction is unreliable (especially scanned/academic PDFs), leading to poor output and bad reviews
- !One-time purchase model limits LTV; subscription model faces churn after initial library conversion
- !Mobile TTS model size (500MB-1GB+) and battery drain may frustrate users on older devices
Cloud-based TTS app that reads aloud EPUBs, PDFs, web pages, and documents with premium AI voices. Multi-platform with OCR and syncing.
iOS TTS reader app supporting EPUB, PDF, DAISY, and other formats with premium voice packs and accessibility features.
Cross-platform TTS tool for documents, web pages, and ebooks. Offers desktop, web, and mobile apps with AI voices.
Mobile app leveraging ElevenLabs' top-tier AI voices to read PDFs, EPUBs, articles, and documents aloud.
Platform-native reading apps with system TTS integration. Apple also offers auto-narrated audiobooks via Apple Books for publishers.
iOS app (iPhone only) that imports EPUB files, auto-detects chapters, converts to M4B audiobook format using on-device Kokoro TTS with 1-2 English voices. Include basic playback with speed control and bookmarking. Skip PDF initially — EPUB parsing is 10x more reliable and faster to build. Ship with a 'convert 3 books free' trial, then $9.99 unlock. Add premium voices and PDF support in v2.
Free (3 book conversions) → $9.99 one-time unlock (unlimited EPUB, 2 voices) → $3.99/mo Premium (10+ voices, PDF support, multiple languages, priority processing) → B2B/Education licensing for accessibility compliance ($99-299/seat/year) → API/white-label for indie publishers wanting to offer audio versions of their catalog
8-12 weeks to MVP with EPUB + 1-2 voices on iOS. First revenue in week 10-14 after TestFlight beta and App Store approval. Reaching $1K MRR likely takes 3-6 months given niche audience and need for word-of-mouth in reader communities (r/ebooks, r/audiobooks, r/accessibility). Faster if launched with a viral demo video showing side-by-side of $0 on-device vs $15 Audible audiobook.
- “I wanted a reading app where you could read, read and listen or just listen to books”
- “I need a nap that uses Kokoro TTS on Apple because I've been using one on Android called Sherpa”