» MIT OpenCourseWare » Electrical Engineering and Computer Science » Laboratory on the Physiology, Acoustics, and Perception of Speech, Fall 2001

Lab Database

Media player software, such as QuickTime® Player, RealOne™ Player, or Windows Media® Player, is required to run the .wav files in this section. RealOne™ Player software is required to run the .ram files in this section. MATLAB® software is required to view and run the .mat files in this section. File decompression software, such as Winzip® or StuffIt®, is required to open the .tar and .zip files in this section.

This page contains digitized media files for use in the labs. Most of the files are audio data, but there are also image files and other types of data. For some of the labs, you are asked to record your own voice. If you do not have the tools for making such recordings, example utterances are provided as spoken by both a female speaker and a male speaker. The audio files are given below under the heading of the relevant lab.

Lab Data

1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15

Lab 1: Recording Speech in a Sound-Treated Room, and Spectral Analysis and Waveform Editing by Computer

In Lab 1, you will record a database of speech utterances for use throughout the semester. Below are links to sample databases recorded by two speakers, one female (HH) and one male (KS).

Note that although these files have a .wav suffix, they are in Klatt format, for use with the Klatt speech-analysis tools (xkl); they are NOT Microsoft .wav format. (If there is a desire to use these files with analysis tools that require MS .wav format, we may consider providing both formats in the future.)

You can download tar archives of an entire database here:

female sample database (TAR - 4 MB)
male sample database (TAR - 4 MB)

Individual files can be downloaded by clicking on the links below:

English vowels

	Female		Male
1	beat	bit	beat	bit
2	bait	bet	bait	bet
3	pot	but	pot	but
4	boat	bought	boat	bought
5	put	boot	put	boot
6	bat	Bert	bat	Bert
7	buy	boy	buy	boy
8	about		about

English consonants

	Female		Male
9	the putt	the bug	the putt	the bug
10	the tug	the duck	the tug	the duck
11	the cut	the gut	the cut	the gut
12	the mug	the nut	the mug	the nut
13	the fuss	the vanilla	the fuss	the vanilla
14	the thug	the "the"	the thug	the "the"
15	the supper	the shutter	the supper	the shutter
16	the rug	the lug	the rug	the lug
17	the wonder	the young	the wonder	the young
18	the chump	the jump	the chump	the jump
19	the hut	sung a song	the hut	sung a song
20	the /zah/	the /zhah/	the /zah/	the /zhah/

Sentences

	Female	Male
21	teacher	teacher
22	He wanted to rebel	He wanted to rebel
23	school	school
24	high school teacher	high school teacher
25	He stopped the rebel	He stopped the rebel
26	school teacher	school teacher
27	tall school teacher	tall school teacher
28	Was he a high school teacher?	Was he a high school teacher?
29	The tall school teacher left early.	The tall school teacher left early.
30	teach	teach
31	We live in Cambridge and we study at MIT	We live in Cambridge and we study at MIT
32	He won those shoes.	He won those shoes.

Rainbow passage, female
Rainbow passage, male
Spontaneous speech, female
Spontaneous speech, male

[top]

Lab 2: Spectrographic and Spectrum Analysis of the English Vowel System

You will use the following utterances recorded during Lab 1:

The word "shutter"
The simple vowel pairs (Items 1-7)

[top]

Lab 3: Broadband Spectral Analysis of Sonorant Consonants

You will use at least one of the following subsets of utterances recorded during Lab 1:

The words "mug", "nut", and "sung"
The words "rug" and "lug"
The words "wonder" and "young"

[top]

Lab 4: Spectral Analysis of Stop and Fricative Consonants: Sound Generation from Turbulence in the Vocal Tract

You will use at least one of the following subsets of utterances recorded during Lab 1:

The words "bug", "duck", "gut", "putt", "tug", "cut", "hut"
The words "fuss", "thug", "supper", "shutter", "vanilla", "the", /z/, //
The words "chump", "jump", "tug", "duck", "shutter", //

[top]

Lab 5: Sound Generation at the Larynx: Characteristics of the Glottal Source

Part A: You will use the words "bat" and "pat" recorded by five speakers. These utterances have already been recorded and stored in the labc account. They can also be downloaded below.
Parts B-C: You will record the words "pat", "pad", and "bad" during the lab. If you are not able to make new recordings, you can download samples recorded by a female speaker (HH) below.
Download a tar/zip archive of the utterances (xkl format) required for this lab (TAR)

[top]

Lab 6: Formant Frequency Calculations from Area Function Data

There are no utterances required for this lab.

[top]

Lab 7: Sound Segment Reduction

You will use the spontaneous speech recorded during Lab 1.

If you do not have your own recordings, you can download speech samples that contain clear renditions of several words along with renditions of the same words as produced with reduced segments. The tar file will contain samples from the same male and female speakers (KS and HH) that recorded the speech samples for Lab 1.

Download the Lab 7 examples (xkl format) here. (TAR)

[top]

Lab 8: Prosody: Acoustic Evidence for Prosodic Constituent Structure

You will use the following utterances, produced by four speakers and stored in the labc account:

Part A: Please say Maine or Duke and Rice will play.
Part B: Please say Thomas or Kenneth and Lucas will stay.

Each sentence was produced with two different prosodic forms, and each prosodic form was produced twice by each speaker, for a total of 32 utterances. The utterances are available in two formats, Xwaves (.sd and .f0 files) and Xkl (Klatt .wav files).

Download a tar/zip archive of the utterances required for this lab (TAR - 12 MB).

[top]

Lab 9: Intonation Labelling OR Factors Affecting Duration

You will use either the utterances for Part A or the utterances for Part B:

Part A:
1) Utterances 3-10 in files lcorn.16k (Xwaves format), located in the labc account (Download files lcorn.16k (LCORN) and lcorn.f0 (LCORN) here)
2) Utterances from Lab 8, Part A
Part B:
The following words, recorded by two speakers: "fat", "fad", "fade", "cup", "cub", "coop", "pick", "pig", "peak", "men", "main', "mate", "tuck", "tug", "take". These files are in two formats, Xwaves (.sd files) and Xkl (Klatt .wav). (Download a tar/zip archive of the utterances here (TAR - 15 MB).)

[top]

Lab 10: Speech Intelligibility and Confusion Matrices

In this lab, you will participate as a subject in two types of intelligibility tests. If you attend the lab, you will hear the stimuli on audiotape. If you do not attend class, you can download audio files of the tests below. In order to take run the tests on your own, you will need to download the response and answers sheets from the Lab 10 page.

Consonant-vowel (CV) nonsense syllables: 10 dB SNR (RAM) | 5 dB SNR (RAM) | 0 dB SNR (RAM)
Sentences in noise (Spin test): Introduction (RAM) | Practice Items (RAM) | 0db SBR (RAM) | 10db SBR (RAM)

A third type of test is available, although it is not presented in class. You will need to download the response and answers sheets from the Lab 10 page.

Consonant-vowel-consonant (CVC) nonsense syllables: 0 db SNR (RAM)

[top]

Lab 11: Introduction to Speech Disorders

You will use utterances produced by adults and children with several types of speech disorders, all in Xkl (Klatt .wav ) format:

Part 1: Utterances produced by children who misarticulate certain sounds and utterances produced by normal children
Download a tar/zip archive for Part 1 here (TAR)
Part 2: Utterances produced by deaf children and one normal-hearing adult
Download a tar/zip archive for Part 2 here (TAR)
Part 3: Utterances produced by adults with dysarthria and normal-speaking adults
Download a tar/zip archive for Part 3 here (TAR)

[top]

Lab 12: Interpretation of Cineradiographic Motion Pictures of Selected Utterances

In this lab you will use .gif files of cineradiographic images. Because the full database of these images is rather large, it will only be available to students attending the lab. Sample .gif files can be downloaded here:

Download file KNS_6400995.gif (GIF), /a/ in "he'tat"
Download file KNS_6401423.gif (GIF), /a/ in "soggy"
Download file KNS_6400919.gif (GIF), /u/ in "he'tu"
Download file KNS_6401621.gif (GIF), /u/ in "shoes"

[top]

Lab 13: Speech Synthesis Using a Formant Synthesizer

For this lab, students will record new utterances. If you are unable to record new data, you can use the sample utterances provided for Lab 5, part B (either "pat" or "bad").

[top]

Lab 14: Measuring Speech Movements Using Data from X-ray Microbeam System

In this lab, you will use data recorded by an x-ray microbeam system. There are data for two speakers and they are in MATLAB format (.mat files):

Microbeam data for speaker JW16 (ZIP - 1.7 MB) (The ZIP file contains four .mat files)
Microbeam data for speaker JW18 (ZIP - 1.8 MB) (The ZIP file contains four .mat files)

[top]

Lab 15: Higher-Level Synthesis with a Formant Synthesizer, Using Quasi-Articulatory Parameters

There are no utterances required for this lab.

[top]

Search

Lab Database

Lab Data