CAVS to SPX Converter

Pull the soundtrack from CAVS videos into SPX format

Drop files here. 1 GB maximum file size or Sign Up
to

Settings

Set the overall output Speex audio bitrate. Designed for human speech encoding, Speex reaches transparency at ultra-low bitrate with a maximum bitrate of 44 kbps.
Set the number of audio channels. This setting is most useful when downmixing channels (e.g., from 5.1 to stereo).
Set the sample rate of the audio. Music with a full spectrum (20 Hz — 20 kHz) requires values not lower than 44.1 kHz to achieve transparency. More info can be found on the wiki.

cavs

CAVS (Chinese Audio Video Standard) is a video compression standard developed by the Audio Video Coding Standard Workgroup of China and adopted as a national standard (GB/T 20090.2) in February 2006. The project began in 2002 with the aim of creating an independent compression technology that could serve the massive broadcasting and multimedia infrastructure in China without relying on foreign-licensed codecs. CAVS, also referred to as AVS1, achieves compression efficiency comparable to H.264/AVC while utilizing a simpler patent framework with significantly lower licensing costs. The standard supports video resolutions from standard definition up to high definition, making it suitable for both terrestrial digital television broadcasting and broadband streaming. Key technical features include 8x8 block transforms, multiple prediction modes, and a loop filter designed to reduce blocking artifacts at low bit rates. The Chinese government endorsed CAVS as the mandatory compression standard for the national digital TV broadcasting system, ensuring broad deployment across set-top boxes and television receivers in the country. While CAVS has limited international adoption compared to H.264 or HEVC, its significance lies in serving one of the largest media markets in the world and demonstrating a viable national alternative to globally dominant video coding standards.
read more

spx

Speex is an open-source audio codec purpose-built for speech compression, developed by Jean-Marc Valin under the Xiph.Org Foundation. First released in October 2002, it targets voice-over-IP, conferencing, and any scenario where spoken word needs to travel efficiently over a network. SPX files wrap Speex-encoded audio inside an Ogg container, pairing the codec's speech optimization with Ogg's streaming capabilities. Three sampling rates are supported — narrowband at 8 kHz, wideband at 16 kHz, and ultra-wideband at 32 kHz — along with variable bitrate encoding that adapts in real time to speech complexity. A standout advantage is its patent-free, BSD-licensed nature, which allowed developers to embed it freely in both commercial and open-source products. Speex also bundles acoustic echo cancellation, noise suppression, and automatic gain control, features that rival codecs typically delegate to external libraries. Although its creators officially recommend Opus as a successor since 2012, Speex remains deployed in legacy VoIP systems, archived recordings, and embedded devices where its lightweight decoder footprint is still valued.
read more
Facebook Amazon Microsoft Tesla Nestle Walmart L'Oreal

Clean Audio Output

The SPX audio extracted from your CAVS video preserves the original sound quality. Adjust bitrate for the best possible result.

Adjustable Settings

Fine-tune audio parameters — codec, bitrate, and quality — before converting to tailor the output precisely.

Cloud Conversion

Processing runs entirely in the cloud, so your computer or phone does none of the heavy lifting. Just upload and download.

How to convert CAVS to SPX

1

Select files from Computer, Google Drive, Dropbox, URL or by dragging it on the page.

2

Choose spx or any other format you need as a result (more than 200 formats supported)

3

Let the file convert and you can download your spx file right afterwards

About formats

CAVS (Chinese Audio Video Standard) is a video compression standard developed by the Audio Video Coding Standard Workgroup of China and adopted as a national standard (GB/T 20090.2) in February 2006. The project began in 2002 with the aim of creating an independent compression technology that could serve the massive broadcasting and multimedia infrastructure in China without relying on foreign-licensed codecs. CAVS, also referred to as AVS1, achieves compression efficiency comparable to H.264/AVC while utilizing a simpler patent framework with significantly lower licensing costs. The standard supports video resolutions from standard definition up to high definition, making it suitable for both terrestrial digital television broadcasting and broadband streaming. Key technical features include 8x8 block transforms, multiple prediction modes, and a loop filter designed to reduce blocking artifacts at low bit rates. The Chinese government endorsed CAVS as the mandatory compression standard for the national digital TV broadcasting system, ensuring broad deployment across set-top boxes and television receivers in the country. While CAVS has limited international adoption compared to H.264 or HEVC, its significance lies in serving one of the largest media markets in the world and demonstrating a viable national alternative to globally dominant video coding standards.
Initial release: February 2006
Speex is an open-source audio codec purpose-built for speech compression, developed by Jean-Marc Valin under the Xiph.Org Foundation. First released in October 2002, it targets voice-over-IP, conferencing, and any scenario where spoken word needs to travel efficiently over a network. SPX files wrap Speex-encoded audio inside an Ogg container, pairing the codec's speech optimization with Ogg's streaming capabilities. Three sampling rates are supported — narrowband at 8 kHz, wideband at 16 kHz, and ultra-wideband at 32 kHz — along with variable bitrate encoding that adapts in real time to speech complexity. A standout advantage is its patent-free, BSD-licensed nature, which allowed developers to embed it freely in both commercial and open-source products. Speex also bundles acoustic echo cancellation, noise suppression, and automatic gain control, features that rival codecs typically delegate to external libraries. Although its creators officially recommend Opus as a successor since 2012, Speex remains deployed in legacy VoIP systems, archived recordings, and embedded devices where its lightweight decoder footprint is still valued.
Initial release: October 15, 2002

Frequently Asked Questions

Why convert CAVS to SPX?

Extracting audio from a CAVS video into SPX lets you keep just the soundtrack — ideal for listening without the video overhead.

What program opens SPX files?

VLC, Audacity, and VoIP applications handle Speex audio optimized for speech compression.

Is registration necessary?

No. Basic conversions work without an account. Signing up is optional and provides access to extended features and larger uploads.

Will the audio quality match the original?

You can set the output bitrate to match or exceed the original audio quality. Higher settings preserve more detail from the CAVS source.

How fast is the audio extraction?

Audio extraction is quicker than full video conversion since only the sound track is processed. Most files are done within seconds.

Can I choose the audio bitrate?

Yes. Adjust the bitrate, sample rate, and channel count before converting to get the SPX quality that suits your listening needs.