WVE to HTK Converter

Transform Psion WVE audio into HTK research format

Drop files here. 1 GB maximum file size or Sign Up
to
Facebook Amazon Microsoft Tesla Nestle Walmart L'Oreal

PDA Audio Rescued

Extract audio from legacy Psion WVE files and convert to HTK — make vintage PDA recordings accessible in a supported format.

No PsiWin Needed

Convert WVE files without PsiWin or SoX. The entire process runs in your web browser on any operating system.

Secure Processing

Uploaded WVE files are deleted immediately after conversion. Output files are purged from our servers within 24 hours.

How to convert WVE to HTK

1

Select files from Computer, Google Drive, Dropbox, URL or by dragging it on the page.

2

Choose htk or any other format you need as a result (more than 200 formats supported)

3

Let the file convert and you can download your htk file right afterwards

About formats

WVE is the audio format native to the Psion Series 3 family of personal digital assistants, released by British company Psion PLC beginning in September 1991. These clamshell PDAs included a built-in voice recorder, and all dictation functionality relied on WVE files to store captured sound. Each file begins with the ASCII signature "ALawSoundFile**" followed by a minimal header, then raw A-law encoded audio sampled at 8 kHz — a rate inherited from digital telephony standards. At 8000 bytes per second, a one-minute recording occupies just 480 KB, which was essential given that Psion devices stored data on SRAM cards typically ranging from 128 KB to 2 MB. The A-law encoding provides reasonable speech clarity within these tight storage constraints, prioritizing intelligibility over high-fidelity reproduction. WVE files can be converted to WAV or other modern formats using SoX, Awave Studio, or specialized Psion file utilities. While the format is firmly a product of early-1990s handheld computing, it holds historical significance as one of the first audio recording formats designed for pocket-sized consumer devices. Collectors and researchers studying mobile computing history occasionally encounter WVE files when recovering data from legacy SRAM media.
Developer: Psion PLC
Initial release: 1991
HTK is the native waveform container for the Hidden Markov Model Toolkit, a software suite developed at Cambridge University's Engineering Department for speech recognition research. First distributed in 1993, HTK rapidly became a reference platform in computational linguistics labs worldwide, and its file format followed suit. Each file stores a sequence of parameter vectors or raw samples prefixed by a 12-byte header specifying the number of frames, the frame period in 100 ns units, the byte count per frame, and a type code indicating the data kind — options range from waveform PCM to Mel-frequency cepstral coefficients and filter-bank energies. This versatility lets a single container carry both source audio and extracted features without changing parsers. The deliberately minimal header avoids alignment padding or optional chunks, making the format trivial to read from C, Python, or MATLAB with a few lines of binary I/O. Three advantages underpin HTK's lasting relevance: tight integration with the HTK training and recognition pipeline, deterministic byte layout that eliminates parser ambiguity, and widespread adoption in academic corpora.
Initial release: 1993

Frequently Asked Questions

Why convert WVE to HTK?

HTK is for speech recognition research. Converting WVE speech data feeds into ML training pipelines.

What can open HTK files?

The HTK toolkit and SoX read HTK research files.

What is the WVE format?

WVE is the native audio format of Psion PDA devices (Series 3, 5, Revo). It stores 8-bit A-law encoded audio — a legacy from the EPOC operating system.

Can modern systems play WVE?

SoX and PsiWin on Windows can process WVE files. Standard media players do not support it — conversion is the easiest path to playback.

Can I convert multiple WVE files?

Yes. Upload several Psion recordings and batch-convert them all at once — efficient for archiving an entire PDA audio library.