Speaker diarization, 100+ languages, and AI-powered analysis — all through a single API. Starting at $0.30 per audio hour.
Upload audio. Get structured transcripts. Run your own AI prompts against them. Or use our LLM API directly.
Upload audio files via REST API. Get back speaker-diarized transcripts with word-level timestamps in JSON, plain text, CSV, or SRT.
from $0.30/hrBring your own prompts. We execute them against your transcripts using frontier-class LLMs. Compliance, sentiment, QA scoring — you define it.
unlimited promptsAn OpenAI-compatible endpoint for general-purpose inference. Drop in your existing code — just change the base URL and model name.
$2.15/M input tokensSend any audio file — WAV, MP3, OGG, FLAC. We auto-detect stereo channels and language. Files up to 4 hours supported.
Receive speaker-labeled segments with precise timestamps. Our pipeline automatically identifies who said what, even in noisy recordings.
Send your own prompt templates. We run them against the transcript and return structured JSON. Build any workflow — compliance, coaching, summarization.
Standard REST API. Upload a file, get a transcript. No SDKs required — works with cURL, Python, Node, anything.
import requests
response = requests.post(
"https://api.voxparse.com/v1/transcribe",
headers={"Authorization": "Bearer vox_sk_..."},
files={"file": open("call_recording.wav", "rb")},
data={
"language": "en",
"diarize": "true",
"output_format": "json",
},
)
transcript = response.json()
for segment in transcript["segments"]:
print(f"[{segment['start']:.1f}s] {segment['speaker']}: {segment['text']}")
Prepay any amount. Usage is deducted from your balance at the rates below. Balances are valid for 6 months.
Deducted from your balance per job. Diarization and timestamps always included.
| Provider | Price / hr | Languages | Diarization |
|---|---|---|---|
| VoxParse Standard | $0.30 | 100+ | Included |
| Deepgram Growth | $0.39 | 36 | Extra cost |
| Deepgram Pay-as-you-go | $0.46 | 36 | Extra cost |
| Google Cloud STT | $0.96 | 125+ | Extra cost |
| Azure Speech | $1.00 | 100+ | Extra cost |
| AWS Transcribe | $1.44 | 100+ | Included |
Drop-in replacement for your existing LLM integration. Same SDK, same format — just change two lines of code.
Frontier-class reasoning. OpenAI-compatible endpoint.
from openai import OpenAI
client = OpenAI(
base_url="https://api.voxparse.com/v1",
api_key="vox_sk_...",
)
response = client.chat.completions.create(
model="apex-latest",
messages=[{"role": "user", "content": "Summarize this call transcript..."}],
)
| Provider | Input / 1M | Output / 1M |
|---|---|---|
| VoxParse Apex | $2.15 | $3.20 |
| OpenAI GPT-4o | $2.50 | $10.00 |
| Claude Sonnet 4.6 | $3.00 | $15.00 |
Start with just $10. No commitments, no subscriptions. Get your API key in under a minute.
Get your API key