Now accepting early access requests

Audio transcription
that doesn't cost
a fortune.

Speaker diarization, 97+ languages, and AI-powered analysis — all through a single API. Starting at $0.50 per audio hour.

97+
Languages supported
$0.50
Per audio hour
40×
Faster than realtime
90%
Cheaper than AWS
Three products, one platform

Everything you need
to process voice data.

Upload audio. Get structured transcripts. Run your own AI prompts against them. Or use our LLM API directly.

Transcription API

Upload audio files via REST API. Get back speaker-diarized transcripts with word-level timestamps in JSON, plain text, CSV, or SRT.

$0.50/hr

AI Analysis Engine

Every transcription automatically receives AI-powered insights: summaries, key topics, sentiment analysis, and action items — all from a frontier-class LLM.

included in +AI plan

Apex LLM API

An OpenAI-compatible endpoint for general-purpose inference. Drop in your existing code — just change the base URL and model name.

$2.15/M input tokens
How it works

Three steps.
No infrastructure required.

Upload your audio

Send any audio file — WAV, MP3, OGG, FLAC. We auto-detect stereo channels and language. Files up to 4 hours supported.

Get diarized transcripts

Receive speaker-labeled segments with precise timestamps. Our pipeline automatically identifies who said what, even in noisy recordings.

Run AI analysis

Send your own prompt templates. We run them against the transcript and return structured JSON. Build any workflow — compliance, coaching, summarization.

Developer experience

One API call. That's it.

Standard REST API. Upload a file, get a transcript. No SDKs required — works with cURL, Python, Node, anything.

python
import requests

response = requests.post(
    "https://api.voxparse.com/v1/transcribe",
    headers={"Authorization": "Bearer vox_sk_..."},
    files={"file": open("call_recording.wav", "rb")},
    data={
        "plan": "transcription-ai",
        "language": "en",
    },
)

transcript = response.json()
for segment in transcript["segments"]:
    print(f"[{segment['start']:.1f}s] {segment['speaker']}: {segment['text']}")
Simple pricing

Add funds. Use as you go.
No subscriptions.

Prepay any amount. Usage is deducted from your balance at the rates below. Balances are valid for 6 months.

How much do you want to add?
$
Transcription
100
audio hours
$0.50 / hr
Transcription + AI
71
audio hours
$0.70 / hr

Two simple plans

Deducted from your balance per job. Speaker diarization, word-level timestamps, and 97+ languages always included.

Transcription
$0.50 / audio hour
  • 97+ languages, 40× real-time speed
  • Speaker diarization included
  • Word-level timestamps
  • JSON output with segments

How we compare

Provider Price / hr Languages Diarization
VoxParse $0.50 97+ Included
VoxParse + AI $0.70 97+ Included
Deepgram Growth $0.39 36 Extra cost
Deepgram Pay-as-you-go $0.46 36 Extra cost
Google Cloud STT $0.96 125+ Extra cost
Azure Speech $1.00 100+ Extra cost
AWS Transcribe $1.44 100+ Included
Apex LLM API

OpenAI-compatible.
Fraction of the cost.

Drop-in replacement for your existing LLM integration. Same SDK, same format — just change two lines of code.

apex-latest

Frontier-class reasoning. OpenAI-compatible endpoint.

Input $2.15/1M
Output $3.20/1M
python
from openai import OpenAI

client = OpenAI(
    base_url="https://api.voxparse.com/v1",
    api_key="vox_sk_...",
)

response = client.chat.completions.create(
    model="apex-latest",
    messages=[{"role": "user", "content": "Summarize this call transcript..."}],
)
Provider Input / 1M Output / 1M
VoxParse Apex $2.15 $3.20
OpenAI GPT-4o $2.50 $10.00
Claude Sonnet 4.6 $3.00 $15.00

Start processing audio today.

Start with just $10. No commitments, no subscriptions. Get your API key in under a minute.

Get your API key