SPX to VOX Converter

Re-encode Speex speech into Dialogic VOX telephony audio

Drop files here. 1 GB maximum file size or Sign Up
to
Facebook Amazon Microsoft Tesla Nestle Walmart L'Oreal

IVR-Ready Audio

Convert your Speex VoIP recordings to VOX — the telephony industry standard for IVR prompts and call center audio.

Fast Processing

Both are lightweight speech codecs. The SPX to VOX conversion completes in seconds, even for long recordings.

Secure Conversion

SPX uploads are erased after conversion. VOX outputs are removed from our servers within 24 hours.

How to convert SPX to VOX

1

Select files from Computer, Google Drive, Dropbox, URL or by dragging it on the page.

2

Choose vox or any other format you need as a result (more than 200 formats supported)

3

Let the file convert and you can download your vox file right afterwards

About formats

Speex is an open-source audio codec purpose-built for speech compression, developed by Jean-Marc Valin under the Xiph.Org Foundation. First released in October 2002, it targets voice-over-IP, conferencing, and any scenario where spoken word needs to travel efficiently over a network. SPX files wrap Speex-encoded audio inside an Ogg container, pairing the codec's speech optimization with Ogg's streaming capabilities. Three sampling rates are supported — narrowband at 8 kHz, wideband at 16 kHz, and ultra-wideband at 32 kHz — along with variable bitrate encoding that adapts in real time to speech complexity. A standout advantage is its patent-free, BSD-licensed nature, which allowed developers to embed it freely in both commercial and open-source products. Speex also bundles acoustic echo cancellation, noise suppression, and automatic gain control, features that rival codecs typically delegate to external libraries. Although its creators officially recommend Opus as a successor since 2012, Speex remains deployed in legacy VoIP systems, archived recordings, and embedded devices where its lightweight decoder footprint is still valued.
Initial release: October 15, 2002
VOX is a headerless audio format built around Dialogic ADPCM encoding, widely adopted in telephony, interactive voice response (IVR) systems, and voice mail platforms since the 1980s. Each audio sample is compressed into 4 bits using an algorithm developed by Oki Electric and implemented in hardware on Dialogic Corporation's telephony interface cards. VOX files typically use a sampling rate of 6000 or 8000 Hz, producing extremely compact recordings optimized for speech intelligibility rather than musical fidelity. Because the format carries no header, playback software must know the sample rate and encoding parameters in advance — a trade-off that reduces overhead but demands careful file management. The primary advantage of VOX is storage efficiency: a one-minute voice recording at 8 kHz occupies roughly 240 KB, making it practical for systems storing thousands of prompts. Dialogic ADPCM conforms to the ITU-T G.726 standard, ensuring interoperability across telephony equipment from different vendors. Even as modern call centers migrate to IP-based systems with codecs like Opus, vast libraries of VOX recordings persist in legacy IVR deployments and compliance archives worldwide.
Initial release: 1983

Frequently Asked Questions

Why convert SPX to VOX?

VOX is the standard format for IVR systems and call center telephony. Converting from SPX creates files for interactive voice response platforms.

Are both SPX and VOX voice formats?

Yes — both handle speech. SPX was designed for VoIP, while VOX targets traditional telephony IVR and call center equipment.

What uses VOX files?

Dialogic telephony hardware, IVR systems, call center platforms, and auto-dialer software use VOX audio files.

Is VOX compressed?

Yes — VOX uses Dialogic OKI ADPCM compression, achieving approximately 4:1 compression for voice content.

Is the conversion free?

Standard SPX to VOX conversions are free on convertio.tools.