6.6mediumCONDITIONAL GO

Audiobook Converter for Personal Libraries

Turn any EPUB or PDF into a natural-sounding audiobook on your phone, no upload required.

DevToolsAvid readers with large digital libraries, self-published book readers, acade...
The Gap

People have large personal ebook libraries but audiobook versions are expensive ($15-30 each) or unavailable. Cloud TTS services raise privacy concerns and require connectivity.

Solution

A mobile app that batch-converts personal ebook collections into high-quality audiobooks using on-device TTS, with chapter detection, bookmarking, and export to standard audio formats.

Revenue Model

One-time purchase ($9.99) or subscription ($3-5/mo) for premium voices and unlimited conversions.

Feasibility Scores
Pain Intensity7/10

Real pain but not hair-on-fire. Audiobooks cost $15-30 each; a 50-book library = $750-1500 to convert. Academic readers and accessibility users feel this acutely. However, many casual readers tolerate the status quo (just reading). The Reddit signal (132 upvotes on Kokoro TTS on-device) confirms a motivated technical audience, but mass-market pain is moderate.

Market Size7/10

TAM: ~500M global ebook readers, ~200M audiobook listeners. Serviceable market: ebook owners who want audio versions = ~50-100M. Realistic early adopters: tech-savvy readers with large EPUB/PDF libraries = 2-5M. At $5/mo or $10 one-time, even 100K users = $500K-6M ARR. Not a unicorn market but strong lifestyle business potential.

Willingness to Pay6/10

Mixed signals. Speechify proves people pay $139/yr for TTS, but that's a polished product with heavy marketing. The target audience (personal library owners, privacy-conscious, technical users) tends to prefer one-time purchases and resists subscriptions. $9.99 one-time is very payable but limits revenue. $3-5/mo subscription for premium voices is viable but needs strong justification. Accessibility users and academics may have institutional budgets.

Technical Feasibility6/10

On-device TTS is the hard part. Kokoro TTS runs 20x realtime on CPU, which is promising, but mobile deployment adds complexity: model size (~500MB-1GB for quality voices), battery drain, heat management, and iOS/Android ML runtime differences. EPUB parsing is straightforward; PDF text extraction is notoriously messy. Chapter detection across formats is non-trivial. A solo dev could build a functional MVP in 6-8 weeks but voice quality parity with cloud services will be the ongoing battle. Core risk: Apple/Google could ship better system TTS at any time.

Competition Gap8/10

This is the strongest dimension. NO existing product combines: (1) on-device TTS, (2) batch ebook conversion, (3) export to standard audiobook formats (M4B/MP3), and (4) chapter-aware processing. Speechify/ElevenLabs are cloud-only and expensive. Voice Dream doesn't export audio. Built-in OS TTS can't create audiobook files. The 'personal audiobook factory' that runs offline with no subscription is genuinely unoccupied territory.

Recurring Potential5/10

Challenging for pure subscription. The core value (convert my books) feels like a one-time job, not ongoing. Premium voices, new language packs, and speed/quality improvements can justify ongoing payment, but many users will convert their library and churn. Hybrid model (one-time base + subscription for premium voices) is more honest but caps revenue. Compare: users buy a scanner app once, they don't subscribe to it.

Strengths
  • +Clear competition gap: no one owns 'offline ebook-to-audiobook' yet
  • +On-device TTS tech (Kokoro, Apple Neural Engine) just crossed the quality threshold — perfect timing
  • +Privacy angle is a genuine differentiator vs Speechify/ElevenLabs
  • +Strong accessibility use case provides defensible niche and potential institutional sales
  • +Low marginal cost (no cloud inference bills) enables aggressive pricing
Risks
  • !Apple or Google could add 'create audiobook from EPUB' as an OS feature, instantly commoditizing the core value
  • !On-device voice quality may disappoint users expecting Audible-grade narration — expectation management is critical
  • !PDF text extraction is unreliable (especially scanned/academic PDFs), leading to poor output and bad reviews
  • !One-time purchase model limits LTV; subscription model faces churn after initial library conversion
  • !Mobile TTS model size (500MB-1GB+) and battery drain may frustrate users on older devices
Competition
Speechify

Cloud-based TTS app that reads aloud EPUBs, PDFs, web pages, and documents with premium AI voices. Multi-platform with OCR and syncing.

Pricing: Free tier (limited voices
Gap: Extremely expensive for casual users, cloud-dependent (privacy concern), no offline batch conversion, no export to standard audio files (MP3/M4B), locks you into their ecosystem
Voice Dream Reader

iOS TTS reader app supporting EPUB, PDF, DAISY, and other formats with premium voice packs and accessibility features.

Pricing: Switched to subscription ~$50-60/year (~$5/mo
Gap: No batch export to audio files, subscription switch alienated loyal users, limited voice quality vs cloud AI, iOS-only, no M4B/audiobook format export
NaturalReader

Cross-platform TTS tool for documents, web pages, and ebooks. Offers desktop, web, and mobile apps with AI voices.

Pricing: Free tier; Premium ~$10/mo; Professional ~$20/mo; also one-time desktop purchase option
Gap: Cloud-dependent for best voices, no true on-device processing, mobile app is weaker than desktop, no dedicated audiobook features (chapters, bookmarks), export is capped by character limits
ElevenLabs Reader

Mobile app leveraging ElevenLabs' top-tier AI voices to read PDFs, EPUBs, articles, and documents aloud.

Pricing: Free tier with limited characters; paid via ElevenLabs subscriptions starting ~$5/mo scaling to $22+/mo
Gap: 100% cloud-dependent (privacy issue), no on-device mode, usage caps make bulk book conversion prohibitively expensive, no offline support, no audiobook export (MP3/M4B), no batch processing
Apple Books / Google Play Books (Built-in TTS)

Platform-native reading apps with system TTS integration. Apple also offers auto-narrated audiobooks via Apple Books for publishers.

Pricing: Free (built into OS
Gap: No export to audio files whatsoever, limited to their ecosystem's books, no batch conversion, no chapter-aware audiobook creation, voice quality still noticeably robotic, no EPUB sideloading on Google, no bookmarking/progress sync for TTS sessions
MVP Suggestion

iOS app (iPhone only) that imports EPUB files, auto-detects chapters, converts to M4B audiobook format using on-device Kokoro TTS with 1-2 English voices. Include basic playback with speed control and bookmarking. Skip PDF initially — EPUB parsing is 10x more reliable and faster to build. Ship with a 'convert 3 books free' trial, then $9.99 unlock. Add premium voices and PDF support in v2.

Monetization Path

Free (3 book conversions) → $9.99 one-time unlock (unlimited EPUB, 2 voices) → $3.99/mo Premium (10+ voices, PDF support, multiple languages, priority processing) → B2B/Education licensing for accessibility compliance ($99-299/seat/year) → API/white-label for indie publishers wanting to offer audio versions of their catalog

Time to Revenue

8-12 weeks to MVP with EPUB + 1-2 voices on iOS. First revenue in week 10-14 after TestFlight beta and App Store approval. Reaching $1K MRR likely takes 3-6 months given niche audience and need for word-of-mouth in reader communities (r/ebooks, r/audiobooks, r/accessibility). Faster if launched with a viral demo video showing side-by-side of $0 on-device vs $15 Audible audiobook.

What people are saying
  • I wanted a reading app where you could read, read and listen or just listen to books
  • I need a nap that uses Kokoro TTS on Apple because I've been using one on Android called Sherpa