MPEG to SPX Converter

Extract speech audio from MPEG as Speex SPX format

Drop files here. 1 GB maximum file size or Sign Up
to

Settings

Set the overall output Speex audio bitrate. Designed for human speech encoding, Speex reaches transparency at ultra-low bitrate with a maximum bitrate of 44 kbps.
Set the number of audio channels. This setting is most useful when downmixing channels (e.g., from 5.1 to stereo).
Set the sample rate of the audio. Music with a full spectrum (20 Hz — 20 kHz) requires values not lower than 44.1 kHz to achieve transparency. More info can be found on the wiki.

mpeg

MPEG (MPEG-1) is a foundational video and audio compression standard published in August 1993 by the Moving Picture Experts Group as ISO/IEC 11172. It was the first international standard for lossy compression of moving pictures and associated audio, establishing principles and techniques that would influence virtually all subsequent video codecs. MPEG-1 video achieves compression through a combination of motion-compensated prediction, discrete cosine transform coding, and variable-length entropy encoding, organized around three frame types: I-frames (intra-coded), P-frames (predicted), and B-frames (bidirectionally predicted). The standard targets bit rates around 1.5 Mbps for combined audio and video, producing quality comparable to VHS tape at SIF resolution (352x240 for NTSC). This compression level was specifically chosen to match the data throughput of 1x-speed CD-ROM drives, enabling the Video CD format that brought digital video to consumers in the early 1990s. The audio component, particularly Layer III (MP3), went on to become the most influential audio format in history. The I/P/B frame structure, motion estimation approach, and block-based transform coding established the architectural template followed by every major video codec since, from MPEG-2 through H.264 and beyond. Though long surpassed in compression efficiency, MPEG-1 remains supported by virtually all media software.
read more

spx

Speex is an open-source audio codec purpose-built for speech compression, developed by Jean-Marc Valin under the Xiph.Org Foundation. First released in October 2002, it targets voice-over-IP, conferencing, and any scenario where spoken word needs to travel efficiently over a network. SPX files wrap Speex-encoded audio inside an Ogg container, pairing the codec's speech optimization with Ogg's streaming capabilities. Three sampling rates are supported — narrowband at 8 kHz, wideband at 16 kHz, and ultra-wideband at 32 kHz — along with variable bitrate encoding that adapts in real time to speech complexity. A standout advantage is its patent-free, BSD-licensed nature, which allowed developers to embed it freely in both commercial and open-source products. Speex also bundles acoustic echo cancellation, noise suppression, and automatic gain control, features that rival codecs typically delegate to external libraries. Although its creators officially recommend Opus) as a successor since 2012, Speex remains deployed in legacy VoIP systems, archived recordings, and embedded devices where its lightweight decoder footprint is still valued.
read more
Facebook Amazon Microsoft Tesla Nestle Walmart L'Oreal

Speech Compression

SPX produces remarkably small files for voice content. Extract dialogue from MPEG with minimal storage footprint.

Voice Extraction

Pull speech audio from MPEG video and encode it with Speex — a codec engineered specifically for human voice quality.

Online Processing

No need to install Speex encoders locally. Our servers handle the MPEG to SPX conversion entirely in the cloud.

How to convert MPEG to SPX

1

Select files from Computer, Google Drive, Dropbox, URL or by dragging it on the page.

2

Choose spx or any other format you need as a result (more than 200 formats supported)

3

Let the file convert and you can download your spx file right afterwards

About formats

MPEG (MPEG-1) is a foundational video and audio compression standard published in August 1993 by the Moving Picture Experts Group as ISO/IEC 11172. It was the first international standard for lossy compression of moving pictures and associated audio, establishing principles and techniques that would influence virtually all subsequent video codecs. MPEG-1 video achieves compression through a combination of motion-compensated prediction, discrete cosine transform coding, and variable-length entropy encoding, organized around three frame types: I-frames (intra-coded), P-frames (predicted), and B-frames (bidirectionally predicted). The standard targets bit rates around 1.5 Mbps for combined audio and video, producing quality comparable to VHS tape at SIF resolution (352x240 for NTSC). This compression level was specifically chosen to match the data throughput of 1x-speed CD-ROM drives, enabling the Video CD format that brought digital video to consumers in the early 1990s. The audio component, particularly Layer III (MP3), went on to become the most influential audio format in history. The I/P/B frame structure, motion estimation approach, and block-based transform coding established the architectural template followed by every major video codec since, from MPEG-2 through H.264 and beyond. Though long surpassed in compression efficiency, MPEG-1 remains supported by virtually all media software.
Initial release: August 1993
Speex is an open-source audio codec purpose-built for speech compression, developed by Jean-Marc Valin under the Xiph.Org Foundation. First released in October 2002, it targets voice-over-IP, conferencing, and any scenario where spoken word needs to travel efficiently over a network. SPX files wrap Speex-encoded audio inside an Ogg container, pairing the codec's speech optimization with Ogg's streaming capabilities. Three sampling rates are supported — narrowband at 8 kHz, wideband at 16 kHz, and ultra-wideband at 32 kHz — along with variable bitrate encoding that adapts in real time to speech complexity. A standout advantage is its patent-free, BSD-licensed nature, which allowed developers to embed it freely in both commercial and open-source products. Speex also bundles acoustic echo cancellation, noise suppression, and automatic gain control, features that rival codecs typically delegate to external libraries. Although its creators officially recommend Opus) as a successor since 2012, Speex remains deployed in legacy VoIP systems, archived recordings, and embedded devices where its lightweight decoder footprint is still valued.
Initial release: October 15, 2002

Frequently Asked Questions

Why convert MPEG to SPX?

Speex is purpose-built for speech compression. Extract dialogue from MPEG and get tiny files perfect for voice-over-IP and archival.

What opens SPX files?

VLC, Audacity, and Speex-compatible VoIP applications handle SPX. It integrates with Ogg-based audio toolchains.

Is SPX good for music?

No — Speex focuses on the speech frequency range. Music content loses detail due to the narrow bandwidth compression.

How compact are SPX files?

Extremely compact for speech. Speex achieves intelligible voice at very low bitrates, producing tiny files.

Is Speex still maintained?

Speex has been largely succeeded by Opus, but it remains in use in legacy VoIP and embedded systems.

MPEG to SPX Quality Rating

3.8 (5 votes)
You need to convert and download at least 1 file to provide feedback!