May 2025 — archived

Voice Marketplace (May 2025)

Industry-tuned TTS voices and STT pipelines for IVR, customer support, training, and accessibility — pay per minute, licensed for commercial use.

A voice marketplace lists pre-trained TTS voices and STT pipelines licensed for commercial use. The interesting growth is in industry-tuned voices that pronounce domain vocabulary correctly — voices that handle "metoprolol" cleanly for cardiology IVRs, account numbers cleanly for banking, and technical English cleanly for non-native call-centre traffic.

The Swfte voice marketplace covers healthcare, banking, telecom, hospitality, and call-centre verticals. Every voice publishes its WER on a domain-specific test set, supports SSML, and integrates with the avatar marketplace for full audio-visual pipelines.

Why buy from a voice marketplace

  • Industry-tuned pronunciation — correct on domain vocabulary, not just generic English.
  • Per-minute pricing, no seat licences, no minimum commitment.
  • Pre-cleared commercial licensing including impersonation safeguards.
  • SSML support out of the box — no integration project.
  • Compose with avatar marketplace listings for end-to-end audio-visual pipelines.

Sample listings

Six representative AI voices from the catalogue. Pricing as of 2026-05-09.

ListingCategoryDescriptionUnit price
Cardiology IVR voice (en-US)HealthcareDomain pronunciation, calm tone$0.020 / minute
Banking IVR voice (en-AU)FinanceAccount numbers, currency, names$0.018 / minute
Telecom support voice (es-MX)TelecomNetwork terms, technical Spanish$0.019 / minute
Hospitality concierge voice (en-GB)HospitalityWarm, multilingual handoff$0.017 / minute
Multilingual TTS (47 langs)GenericNative quality, single voice across langs$0.022 / minute
Call-centre STT pipelineCustomer OpsIndustry-aware transcription + redaction$0.012 / minute

FAQ

What is a voice marketplace?

A catalogue of pre-trained TTS voices and STT pipelines for commercial use. Listings include industry-tuned voices (healthcare, finance, telecom) that handle domain vocabulary correctly, and generic multilingual voices for broad coverage.

How is an industry-tuned voice different from a generic one?

Industry-tuned voices are evaluated on a domain-specific test set — drug names for cardiology, account-number formats for banking, technical English for telecom. Generic voices score 80-85% on these; industry-tuned voices clear 96%+.

How is voice pricing structured?

Almost always per minute of synthesised or transcribed audio. Industry-tuned voices command a small premium (10-30%) over generic voices. Volume discounts kick in past ~50K minutes/month.

How does a voice marketplace compose with avatar and workflow marketplaces?

Buyers commonly chain: workflow marketplace (event detection) → avatar marketplace (video) → voice marketplace (audio) → MCP marketplace (delivery). The composition is the leverage; the marketplaces are just the catalogue.

Further reading

Listings, pricing, and benchmarks updated 2026-05-09. Verticals indexed: AI Marketplace, AI Workflow Marketplace, MCP Marketplace, Avatar Marketplace, Voice Marketplace, Model Marketplace.