MPG to NIST Converter

Extract NIST audio from MPG for speech science online

Drop files here. 1 GB maximum file size or Sign Up
to
Facebook Amazon Microsoft Tesla Nestle Walmart L'Oreal

Official Standard

NIST format is used in official speech recognition benchmarks. Extract MPG audio in the format research competitions require.

Remote Processing

Processing runs on our servers — no NIST tools needed on your local machine.

Secure Extraction

Uploaded MPG files are deleted immediately. NIST outputs are removed from servers within 24 hours.

How to convert MPG to NIST

1

Select files from Computer, Google Drive, Dropbox, URL or by dragging it on the page.

2

Choose nist or any other format you need as a result (more than 200 formats supported)

3

Let the file convert and you can download your nist file right afterwards

About formats

MPG is a common file extension for video files encoded using the MPEG-1 or MPEG-2 compression standards, developed by the Moving Picture Experts Group. The three-character extension originated from early Windows and DOS file systems that restricted extensions to three characters, providing a shorthand for the longer MPEG designation. MPG files contain MPEG program streams that multiplex one video and one or more audio elementary streams into a unified byte stream with synchronization timestamps. The format was widely used throughout the 1990s and 2000s for storing digital video on personal computers, appearing in everything from Video CD rips and DVD extractions to digital TV recordings captured with hardware encoder cards. MPG files using MPEG-1 compression typically contain 352x240 (NTSC) or 352x288 (PAL) video at bit rates around 1.5 Mbps, while MPEG-2 encoded MPG files support higher resolutions up to full HD. The program stream structure assumes a relatively reliable storage medium, unlike the transport stream variant designed for broadcast, making it efficient for file-based playback without the overhead of error recovery packets. Broad compatibility is one of the enduring strengths of the format, as virtually every media player across all operating systems can decode these files without additional codec installation. MPG continues to be encountered in archived video content, surveillance recordings, and legacy digital video workflows.
Initial release: August 1993
NIST SPHERE (SPeech HEader REsources) is a specialized audio file format created by the National Institute of Standards and Technology for speech research, particularly projects funded by DARPA. The format wraps raw audio samples with a structured ASCII header encoding metadata such as sample rate, channel count, encoding type, speaker demographics, and transcription annotations — making it ideal for distributing speech corpora. NIST files typically store uncompressed PCM or mu-law audio at telephone-quality sample rates (8 kHz or 16 kHz), though the container is flexible enough to hold various encodings. A key advantage is the rich self-documenting header that lets researchers embed detailed corpus metadata directly in the file, eliminating sidecar files. SPHERE has also become the de facto standard for major speech databases like TIMIT, Switchboard, and the Fisher corpus, ensuring broad recognition across academic and government labs. The open specification and availability of command-line tools (sphere, h_strip, w_decode) make it straightforward to convert, inspect, and process these files programmatically in speech processing pipelines.
Initial release: 1990

Frequently Asked Questions

Why convert MPG to NIST?

NIST is the format used by the National Institute of Standards and Technology for speech evaluation. Converting prepares audio for official benchmarks.

How is NIST different from SPH?

NIST and SPH (Sphere) are closely related — both use the NIST Sphere format. The distinction is often just in file extension conventions.

What tools read NIST files?

NIST speech tools, sph2pipe, Kaldi, and HTK all process NIST-format audio for speech recognition and analysis.

Is NIST suitable for music?

No — NIST is designed for speech research. For music, choose FLAC, MP3, or WAV instead.

Can I batch convert?

Upload several MPG files and extract NIST audio from each in one session — build research datasets efficiently.