HCOM to SPH Converter

Encode HCOM audio as NIST SPH speech format online

Drop files here. 1 GB maximum file size or Sign Up
to
Facebook Amazon Microsoft Tesla Nestle Walmart L'Oreal

Speech Research Standard

Convert HCOM into SPH — the standard format for NIST speech corpora, linguistic research, and speech recognition training data.

Cloud-Based

No NIST tools needed locally. Convert HCOM to SPH directly from your browser on our cloud servers.

Automatic Cleanup

HCOM uploads are deleted after processing. SPH files are cleaned from servers within 24 hours.

How to convert HCOM to SPH

1

Select files from Computer, Google Drive, Dropbox, URL or by dragging it on the page.

2

Choose sph or any other format you need as a result (more than 200 formats supported)

3

Let the file convert and you can download your sph file right afterwards

About formats

HCOM is a Huffman-coded audio format from the early Macintosh era, designed to shrink digitized sound for distribution on floppy disks and bulletin board systems when storage was precious and modems were slow. The encoder takes 8-bit unsigned PCM input, computes a frequency table of sample-delta values, and builds an optimal Huffman tree that replaces common deltas with short bit sequences. Compression ratios of 2:1 or better were typical for speech recordings, a meaningful saving when a 3.5-inch floppy held only 800 KB. Files were distributed as Macintosh resource forks and played through utilities like SoundApp and the BinHex ecosystem that defined Mac software exchange in the late 1980s. The format supported sample rates up to 22.255 kHz, matching the output capabilities of original Macintosh sound hardware. Tools such as SoX retain HCOM decoding support, ensuring that archived recordings remain accessible decades later. HCOM holds three practical advantages for preservation work: lossless compression that recovers the original samples exactly, a self-contained Huffman table embedded in each file for dependency-free decoding, and historical prevalence across thousands of vintage Mac sound archives.
Developer: Apple Computer
Initial release: 1985
SPH is the file extension for audio stored in the NIST SPHERE (SPeech HEader REsources) format, a standard created by the U.S. National Institute of Standards and Technology around 1990. Built for speech research, SPH files carry a 1024-byte ASCII header packed with metadata — database identifiers, channel counts, sample rates, byte ordering, and compression type — making every recording self-describing. The underlying audio is typically 16-bit linear PCM sampled at 16 kHz, though other configurations are permitted. Researchers at NIST, DARPA, and universities worldwide rely on SPH for distributing speech corpora such as TIMIT, Switchboard, and the LDC collections that underpin modern automatic speech recognition systems. A key advantage is that the human-readable header lets scripts parse recording metadata without binary decoding. The format's strict standardization also eliminates ambiguity when sharing datasets across institutions and platforms. Because SPH files store uncompressed PCM, they preserve full audio fidelity — critical when training acoustic models where even small artifacts can skew results.
Initial release: 1990

Frequently Asked Questions

What is SPH?

SPH (SPHERE) is the SPeech HEader Resources format created by NIST for speech research. It is the standard for linguistic corpora worldwide.

Why convert HCOM to SPH?

For speech research projects that need audio in NIST SPHERE format. Converting HCOM enables analysis with standard linguistic tools.

What tools use SPH?

HTK, Kaldi, NIST scoring tools, and many speech recognition research frameworks support SPH format natively.

Is SPH just PCM with headers?

Essentially yes. SPH wraps PCM audio with a text-based header containing metadata about the recording conditions and content.

Is the conversion private?

HCOM uploads are erased after processing. SPH results are deleted from servers within 24 hours.