Public-facing kiosks (retail, healthcare, museums) need touchless interaction for hygiene and accessibility, but building gesture-controlled interfaces from scratch is expensive
A hosted platform where businesses configure touchless kiosk experiences with a drag-and-drop builder, using browser-based hand tracking — no special hardware needed, just a webcam
Subscription per kiosk per month ($29-99/mo) with volume discounts
The pain was acute during COVID but has cooled significantly. Most businesses that urgently needed touchless already deployed QR-to-phone solutions, which are simpler and cheaper. The remaining demand is real but not desperate — it's a nice-to-have for hygiene-conscious brands and an accessibility improvement, not a hair-on-fire problem. Museums and trade shows care more about 'wow factor' than hygiene.
TAM for touchless kiosk segment is $2-4B and growing. The serviceable market for a software-only, webcam-based platform is smaller — likely $200-500M — targeting businesses that want touchless but don't want expensive hardware. Enough room for a good business, not a venture-scale outcome without expanding the definition.
$29-99/kiosk/month is reasonable for enterprise buyers but faces two headwinds: (1) QR-to-phone touchless is nearly free and 'good enough' for many, (2) the decision-maker at a hospital or retail chain will ask 'why not just use QR codes?' You need to sell the experience premium, not just touchless. Museums and trade shows — where the interaction IS the product — are the strongest WTP segment.
A solo dev can build an MVP drag-and-drop builder + MediaPipe hand tracking integration in 6-8 weeks. The core tech works — MediaPipe in-browser is proven. BUT the hard part is reliability in real-world kiosk environments: variable lighting, different distances, multiple hands, sunlight glare, slow hardware. Getting from 'cool demo' to 'works reliably 8 hours/day in a hospital lobby' is a significant engineering gap that will extend beyond MVP.
Clear whitespace: no one offers browser-based webcam hand tracking + drag-and-drop builder + SaaS pricing. Ultraleap requires expensive hardware. Intuiface is expensive and gesture is bolted on. Open-source tools are raw. However, this gap exists partly because the market may not be big enough to attract well-funded competitors, and Intuiface could add MediaPipe integration in a quarter if demand materialized.
Strong subscription fit. Per-kiosk/month pricing is natural and industry-standard for digital signage/kiosk software. Businesses deploying kiosks expect ongoing software costs. Analytics, content updates, and remote management create genuine ongoing value. Multi-location chains scale linearly.
- +Clear competitive whitespace — no one combines webcam-based tracking + drag-and-drop builder + affordable SaaS
- +Zero special hardware requirement dramatically lowers adoption barrier vs. Ultraleap/Neonode
- +Privacy-first architecture (camera data never leaves device) is a genuine selling point for healthcare and government
- +Strong demo/viral potential — browser-based means instant shareable demos
- +Per-kiosk subscription model is proven and scalable in the digital signage industry
- !Reliability gap: webcam tracking in real-world kiosk environments (lighting, glare, distance) is significantly harder than in controlled demos — this will be the make-or-break technical challenge
- !QR-to-phone 'good enough' problem: many businesses have already solved touchless with free QR codes, making gesture control a harder sell
- !Incumbents can pivot: Intuiface adding MediaPipe support or Ultraleap releasing a webcam SDK would compress the competitive window
- !Long enterprise sales cycles: hospitals, retail chains, and museums have 3-12 month procurement processes that will strain a bootstrapped founder
- !Post-COVID demand normalization: touchless urgency has faded, shifting this from 'must-have' to 'nice-to-have' for most buyers
Hardware+software hand tracking for kiosks using proprietary IR sensors with mid-air haptic feedback. Their TouchFree overlay translates hand gestures into touch events on existing kiosk content.
Interactive digital signage and kiosk platform with a WYSIWYG drag-and-drop composer. Supports multiple input types including touch, gestures
Swedish company providing optical touch/gesture sensor modules using infrared light for proximity and gesture detection. Used in automotive, kiosks, and consumer electronics.
Norwegian company offering gesture-controlled digital signage and kiosk solutions using depth cameras
Google's open-source ML framework providing browser-based hand tracking via standard webcam using WebAssembly/WebGL. Free toolkit that runs hand landmark detection entirely client-side.
A single-page web app where a user uploads a menu/catalog/floorplan image, defines clickable hotspot regions via drag-and-drop, and gets a unique URL that runs the gesture-controlled kiosk experience in any browser with a webcam. Start with ONE vertical — museum exhibit guides or trade show product showcases — where the 'wow factor' justifies adoption. Include a simple analytics dashboard (sessions, dwell time, popular selections). Skip multi-kiosk management for MVP.
Free tier (1 kiosk, watermarked, basic analytics) → Pro at $49/kiosk/month (white-label, full analytics, priority support) → Enterprise at $99/kiosk/month (SSO, API access, custom branding, SLA, multi-location management). Upsell with premium gesture-interaction templates for specific verticals (restaurant menus, real estate tours, museum exhibits) at $199-499 one-time.
8-14 weeks to first dollar. 4-6 weeks for MVP build, 2-4 weeks to land first paying pilot (likely a museum, trade show vendor, or real estate showroom via direct outreach). Enterprise contracts (hospitals, retail chains) will take 4-8 months. Realistic path to $5K MRR is 6-9 months.
- “All running client-side in the browser”
- “No backend, no server, camera data never leaves the device”