VOC to NIST Converter

Save Sound Blaster VOC recordings in NIST format

Drop files here. 1 GB maximum file size or Sign Up
to
Facebook Amazon Microsoft Tesla Nestle Walmart L'Oreal

Research Integration

NIST is the standard container for speech corpora. Converting VOC to NIST integrates Sound Blaster recordings into research pipelines.

Bulk Processing

Build speech corpora efficiently — upload multiple VOC files and produce NIST outputs for all of them in a single session.

Private and Secure

Research audio often contains sensitive speech data. Uploaded VOC files are deleted immediately, NIST outputs within 24 hours.

How to convert VOC to NIST

1

Select files from Computer, Google Drive, Dropbox, URL or by dragging it on the page.

2

Choose nist or any other format you need as a result (more than 200 formats supported)

3

Let the file convert and you can download your nist file right afterwards

About formats

VOC (Creative Voice) is a digital audio container developed by Creative Technology and introduced alongside the original Sound Blaster card in 1989. It served as the native audio format for the Sound Blaster family during the DOS era, when Creative's hardware dominated PC audio. VOC files are block-based: each file consists of typed data blocks that can carry 8-bit unsigned PCM, 4-bit and 2.6-bit Creative ADPCM, 16-bit signed PCM, as well as A-law and mu-law encoded audio. This block structure also supports silence intervals, repeat loops, and marker points, giving game developers fine-grained control over sound playback. A notable advantage was hardware-level decoding — Sound Blaster cards could play VOC data directly via DMA transfer, freeing the CPU for other tasks in an era when processor cycles were precious. The format saw extensive use in DOS games from id Software, Sierra, and LucasArts. With the rise of Windows and the WAV format, VOC gradually fell out of mainstream use, yet it remains important for retro gaming preservation and for anyone working with vintage PC audio archives.
Initial release: 1989
NIST SPHERE (SPeech HEader REsources) is a specialized audio file format created by the National Institute of Standards and Technology for speech research, particularly projects funded by DARPA. The format wraps raw audio samples with a structured ASCII header encoding metadata such as sample rate, channel count, encoding type, speaker demographics, and transcription annotations — making it ideal for distributing speech corpora. NIST files typically store uncompressed PCM or mu-law audio at telephone-quality sample rates (8 kHz or 16 kHz), though the container is flexible enough to hold various encodings. A key advantage is the rich self-documenting header that lets researchers embed detailed corpus metadata directly in the file, eliminating sidecar files. SPHERE has also become the de facto standard for major speech databases like TIMIT, Switchboard, and the Fisher corpus, ensuring broad recognition across academic and government labs. The open specification and availability of command-line tools (sphere, h_strip, w_decode) make it straightforward to convert, inspect, and process these files programmatically in speech processing pipelines.
Initial release: 1990

Frequently Asked Questions

Why convert VOC to NIST?

NIST stores audio with metadata headers for speech research. It integrates legacy audio into academic linguistic research workflows.

What can open NIST files?

The NIST SPHERE toolkit and SoX read NIST. Speech recognition frameworks like Kaldi and HTK also support this format natively.

How is NIST different from WAV?

NIST uses a text-based header with rich metadata fields — far more descriptive than the binary header used by WAV files.

Is NIST format used in Kaldi?

Yes. Kaldi, one of the most popular speech recognition toolkits, reads NIST SPHERE files directly for training and decoding.

Can I batch-convert multiple VOC files?

Upload all your VOC recordings and convert them to NIST simultaneously — efficient when building a speech corpus from multiple sources.