MIT OpenCourseWare
  • OCW home
  • Course List
  • about OCW
  • Help
  • Feedback
  • Support MIT OCW

Lab Database

Media player software, such as QuickTime® Player, RealOne™ Player, or Windows Media® Player, is required to run the .wav files in this section. RealOne™ Player software is required to run the .ram files in this section. MATLAB® software is required to view and run the .mat files in this section. File decompression software, such as Winzip® or StuffIt®, is required to open the .tar and .zip files in this section.

This page contains digitized media files for use in the labs. Most of the files are audio data, but there are also image files and other types of data. For some of the labs, you are asked to record your own voice. If you do not have the tools for making such recordings, example utterances are provided as spoken by both a female speaker and a male speaker. The audio files are given below under the heading of the relevant lab.

Lab Data

1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15

Lab 1: Recording Speech in a Sound-Treated Room, and Spectral Analysis and Waveform Editing by Computer

In Lab 1, you will record a database of speech utterances for use throughout the semester. Below are links to sample databases recorded by two speakers, one female (HH) and one male (KS).

Note that although these files have a .wav suffix, they are in Klatt format, for use with the Klatt speech-analysis tools (xkl); they are NOT Microsoft .wav format. (If there is a desire to use these files with analysis tools that require MS .wav format, we may consider providing both formats in the future.)

You can download tar archives of an entire database here:

female sample database (TAR - 4 MB)
male sample database (TAR - 4 MB)

Individual files can be downloaded by clicking on the links below:

  1. English vowels

    Female Male
    1 beat bit beat bit
    2 bait bet bait bet
    3 pot but pot but
    4 boat bought boat bought
    5 put boot put boot
    6 bat Bert bat Bert
    7 buy boy buy boy
    8 about   about  


  2. English consonants
    Female Male
    9 the putt the bug the putt the bug
    10 the tug the duck the tug the duck
    11 the cut the gut the cut the gut
    12 the mug the nut the mug the nut
    13 the fuss the vanilla the fuss the vanilla
    14 the thug the "the" the thug the "the"
    15 the supper the shutter the supper the shutter
    16 the rug the lug the rug the lug
    17 the wonder the young the wonder the young
    18 the chump the jump the chump the jump
    19 the hut sung a song the hut sung a song
    20 the /zah/ the /zhah/ the /zah/ the /zhah/


  3. Sentences
    Female Male
    21 teacher teacher
    22 He wanted to rebel He wanted to rebel
    23 school school
    24 high school teacher high school teacher
    25 He stopped the rebel He stopped the rebel
    26 school teacher school teacher
    27 tall school teacher tall school teacher
    28 Was he a high school teacher? Was he a high school teacher?
    29 The tall school teacher left early. The tall school teacher left early.
    30 teach teach
    31 We live in Cambridge and we study at MIT We live in Cambridge and we study at MIT
    32 He won those shoes. He won those shoes.


  4. Rainbow passage, female
    Rainbow passage, male


  5. Spontaneous speech, female
    Spontaneous speech, male
[top]

Lab 2: Spectrographic and Spectrum Analysis of the English Vowel System

You will use the following utterances recorded during Lab 1:

  • The word "shutter"
  • The simple vowel pairs (Items 1-7)
[top]

Lab 3: Broadband Spectral Analysis of Sonorant Consonants

You will use at least one of the following subsets of utterances recorded during Lab 1:

  • The words "mug", "nut", and "sung"
  • The words "rug" and "lug"
  • The words "wonder" and "young"
[top]

Lab 4: Spectral Analysis of Stop and Fricative Consonants: Sound Generation from Turbulence in the Vocal Tract

You will use at least one of the following subsets of utterances recorded during Lab 1:

  • The words "bug", "duck", "gut", "putt", "tug", "cut", "hut"
  • The words "fuss", "thug", "supper", "shutter", "vanilla", "the", /zschwa/, /zhschwa/
  • The words "chump", "jump", "tug", "duck", "shutter", /zhschwa/
[top]

Lab 5: Sound Generation at the Larynx: Characteristics of the Glottal Source

  • Part A: You will use the words "bat" and "pat" recorded by five speakers. These utterances have already been recorded and stored in the labc account. They can also be downloaded below.
  • Parts B-C: You will record the words "pat", "pad", and "bad" during the lab. If you are not able to make new recordings, you can download samples recorded by a female speaker (HH) below.

    Download a tar/zip archive of the utterances (xkl format) required for this lab (TAR)

[top]

Lab 6: Formant Frequency Calculations from Area Function Data

There are no utterances required for this lab.

[top]

Lab 7: Sound Segment Reduction

You will use the spontaneous speech recorded during Lab 1.

If you do not have your own recordings, you can download speech samples that contain clear renditions of several words along with renditions of the same words as produced with reduced segments. The tar file will contain samples from the same male and female speakers (KS and HH) that recorded the speech samples for Lab 1.

Download the Lab 7 examples (xkl format) here. (TAR)

[top]

Lab 8: Prosody: Acoustic Evidence for Prosodic Constituent Structure

You will use the following utterances, produced by four speakers and stored in the labc account:

  • Part A: Please say Maine or Duke and Rice will play.
  • Part B: Please say Thomas or Kenneth and Lucas will stay.

Each sentence was produced with two different prosodic forms, and each prosodic form was produced twice by each speaker, for a total of 32 utterances. The utterances are available in two formats, Xwaves (.sd and .f0 files) and Xkl (Klatt .wav files).

Download a tar/zip archive of the utterances required for this lab (TAR - 12 MB).

[top]

Lab 9: Intonation Labelling OR Factors Affecting Duration

You will use either the utterances for Part A or the utterances for Part B:

  • Part A:
    1) Utterances 3-10 in files lcorn.16k (Xwaves format), located in the labc account (Download files lcorn.16k (LCORN) and lcorn.f0 (LCORN) here)
    2) Utterances from Lab 8, Part A
  • Part B:
    The following words, recorded by two speakers: "fat", "fad", "fade", "cup", "cub", "coop", "pick", "pig", "peak", "men", "main', "mate", "tuck", "tug", "take". These files are in two formats, Xwaves (.sd files) and Xkl (Klatt .wav). (Download a tar/zip archive of the utterances here (TAR - 15 MB).)
[top]

Lab 10: Speech Intelligibility and Confusion Matrices

In this lab, you will participate as a subject in two types of intelligibility tests. If you attend the lab, you will hear the stimuli on audiotape. If you do not attend class, you can download audio files of the tests below. In order to take run the tests on your own, you will need to download the response and answers sheets from the Lab 10 page.

  • Consonant-vowel (CV) nonsense syllables: 10 dB SNR (RAM) | 5 dB SNR (RAM) | 0 dB SNR (RAM
  • Sentences in noise (Spin test): Introduction (RAM) | Practice Items (RAM) | 0db SBR (RAM) | 10db SBR (RAM

A third type of test is available, although it is not presented in class. You will need to download the response and answers sheets from the Lab 10 page.

  • Consonant-vowel-consonant (CVC) nonsense syllables: 0 db SNR (RAM
[top]

Lab 11: Introduction to Speech Disorders

You will use utterances produced by adults and children with several types of speech disorders, all in Xkl (Klatt .wav ) format:

  • Part 1: Utterances produced by children who misarticulate certain sounds and utterances produced by normal children

    Download a tar/zip archive for Part 1 here (TAR)

  • Part 2: Utterances produced by deaf children and one normal-hearing adult

    Download a tar/zip archive for Part 2 here (TAR)

  • Part 3: Utterances produced by adults with dysarthria and normal-speaking adults
    Download a tar/zip archive for Part 3 here (TAR)
[top]

Lab 12: Interpretation of Cineradiographic Motion Pictures of Selected Utterances

In this lab you will use .gif files of cineradiographic images. Because the full database of these images is rather large, it will only be available to students attending the lab. Sample .gif files can be downloaded here:

Download file KNS_6400995.gif (GIF), /a/ in "he'tat"
Download file KNS_6401423.gif (GIF), /a/ in "soggy"
Download file KNS_6400919.gif (GIF), /u/ in "he'tu"
Download file KNS_6401621.gif (GIF), /u/ in "shoes"

[top]

Lab 13: Speech Synthesis Using a Formant Synthesizer

For this lab, students will record new utterances. If you are unable to record new data, you can use the sample utterances provided for Lab 5, part B (either "pat" or "bad").

[top]

Lab 14: Measuring Speech Movements Using Data from X-ray Microbeam System

In this lab, you will use data recorded by an x-ray microbeam system. There are data for two speakers and they are in MATLAB format (.mat files):

Microbeam data for speaker JW16 (ZIP - 1.7 MB) (The ZIP file contains four .mat files)
Microbeam data for speaker JW18 (ZIP - 1.8 MB) (The ZIP file contains four .mat files)

[top]

Lab 15: Higher-Level Synthesis with a Formant Synthesizer, Using Quasi-Articulatory Parameters

There are no utterances required for this lab.

[top]