HTK to VOC Converter

Transcode HTK audio to Sound Blaster VOC format online

Drop files here. 1 GB maximum file size or Sign Up
to

Settings

The codec to encode the audio track. Codec "Without reencoding" copies the audio stream from the input file into output without re-encoding if possible.
Set the number of audio channels. This setting is most useful when downmixing channels (e.g., from 5.1 to stereo).
Set the sample rate of the audio. Music with a full spectrum (20 Hz — 20 kHz) requires values not lower than 44.1 kHz to achieve transparency. More info can be found on the wiki.

htk

HTK is the native waveform container for the Hidden Markov Model Toolkit, a software suite developed at Cambridge University's Engineering Department for speech recognition research. First distributed in 1993, HTK rapidly became a reference platform in computational linguistics labs worldwide, and its file format followed suit. Each file stores a sequence of parameter vectors or raw samples prefixed by a 12-byte header specifying the number of frames, the frame period in 100 ns units, the byte count per frame, and a type code indicating the data kind — options range from waveform PCM to Mel-frequency cepstral coefficients and filter-bank energies. This versatility lets a single container carry both source audio and extracted features without changing parsers. The deliberately minimal header avoids alignment padding or optional chunks, making the format trivial to read from C, Python, or MATLAB with a few lines of binary I/O. Three advantages underpin HTK's lasting relevance: tight integration with the HTK training and recognition pipeline, deterministic byte layout that eliminates parser ambiguity, and widespread adoption in academic corpora.
read more

voc

VOC (Creative Voice) is a digital audio container developed by Creative Technology and introduced alongside the original Sound Blaster card in 1989. It served as the native audio format for the Sound Blaster family during the DOS era, when Creative's hardware dominated PC audio. VOC files are block-based: each file consists of typed data blocks that can carry 8-bit unsigned PCM, 4-bit and 2.6-bit Creative ADPCM, 16-bit signed PCM, as well as A-law and mu-law encoded audio. This block structure also supports silence intervals, repeat loops, and marker points, giving game developers fine-grained control over sound playback. A notable advantage was hardware-level decoding — Sound Blaster cards could play VOC data directly via DMA transfer, freeing the CPU for other tasks in an era when processor cycles were precious. The format saw extensive use in DOS games from id Software, Sierra, and LucasArts. With the rise of Windows and the WAV format, VOC gradually fell out of mainstream use, yet it remains important for retro gaming preservation and for anyone working with vintage PC audio archives.
read more
Facebook Amazon Microsoft Tesla Nestle Walmart L'Oreal

HTK to VOC Bridge

Transform HTK recordings into VOC — bringing research audio into a format with real-world usability.

Cloud Processing

No audio tools required locally. Upload HTK, get VOC back — all processing runs on our cloud infrastructure.

Quality Output

VOC delivers excellent audio quality at efficient file sizes — a modern upgrade for your HTK recordings.

How to convert HTK to VOC

1

Select files from Computer, Google Drive, Dropbox, URL or by dragging it on the page.

2

Choose voc or any other format you need as a result (more than 200 formats supported)

3

Let the file convert and you can download your voc file right afterwards

About formats

HTK is the native waveform container for the Hidden Markov Model Toolkit, a software suite developed at Cambridge University's Engineering Department for speech recognition research. First distributed in 1993, HTK rapidly became a reference platform in computational linguistics labs worldwide, and its file format followed suit. Each file stores a sequence of parameter vectors or raw samples prefixed by a 12-byte header specifying the number of frames, the frame period in 100 ns units, the byte count per frame, and a type code indicating the data kind — options range from waveform PCM to Mel-frequency cepstral coefficients and filter-bank energies. This versatility lets a single container carry both source audio and extracted features without changing parsers. The deliberately minimal header avoids alignment padding or optional chunks, making the format trivial to read from C, Python, or MATLAB with a few lines of binary I/O. Three advantages underpin HTK's lasting relevance: tight integration with the HTK training and recognition pipeline, deterministic byte layout that eliminates parser ambiguity, and widespread adoption in academic corpora.
Initial release: 1993
VOC (Creative Voice) is a digital audio container developed by Creative Technology and introduced alongside the original Sound Blaster card in 1989. It served as the native audio format for the Sound Blaster family during the DOS era, when Creative's hardware dominated PC audio. VOC files are block-based: each file consists of typed data blocks that can carry 8-bit unsigned PCM, 4-bit and 2.6-bit Creative ADPCM, 16-bit signed PCM, as well as A-law and mu-law encoded audio. This block structure also supports silence intervals, repeat loops, and marker points, giving game developers fine-grained control over sound playback. A notable advantage was hardware-level decoding — Sound Blaster cards could play VOC data directly via DMA transfer, freeing the CPU for other tasks in an era when processor cycles were precious. The format saw extensive use in DOS games from id Software, Sierra, and LucasArts. With the rise of Windows and the WAV format, VOC gradually fell out of mainstream use, yet it remains important for retro gaming preservation and for anyone working with vintage PC audio archives.
Initial release: 1989

Frequently Asked Questions

Why convert HTK to VOC?

HTK is limited to speech research tools. VOC provides DOS-era PC audio that works with standard media players and applications.

What applications open VOC files?

DOSBox, SOX, and retro computing emulators can handle VOC files. Most are available as free downloads for major operating systems.

How is the VOC audio quality?

VOC provides good quality at standard settings. The output clarity depends on the original HTK recording quality.

How fast is the conversion?

Processing is fast — HTK files are lightweight and VOC encoding completes in seconds on our server hardware.

Are my files kept private?

Uploaded HTK files are deleted immediately after conversion. VOC results are automatically erased from our servers within 24 hours.