Materials Part III - Lack of Correspondence between Vowels and Patterns of Relative Spectral Energy Maxima or Formant Patterns
M8.2: Vowel Perception at Fundamental Frequencies above Statistical
Values of the Respective First Formant Frequency
Content of illustration
Figure 6 shows intelligible high-pitched sounds of the vowels / y, e, ø, ɛ, o / at F0 of c. 750 Hz, and Figure 7 exhibits
intelligible high-pitched sounds of the corner vowels / i, a, u / at F0 of c. 850 Hz. Note again the pronounced spectral differences
for these high-pitched sounds of different vowels supporting the thesis of a parallelism between differences in perceived
vowel quality and related acoustic differences, that is, the thesis of vowel-specific harmonic spectra.
Figures 8 to 10 show examples of speech extracts of untrained speakers, journalists, TV hosts and actresses and actors, which
manifest pitch contours for utterances of single speakers exceeding age- and gender- related statistical F1 of the vowels
/ i, y, u / (450 Hz for children, 400 Hz for women and 350 Hz for men). The ranges of F0 indicated— overall ranges for the
speech sounds of a single speaker or a group of speakers (see below)—were determined acoustically in terms of approximations
by listening to the sounds. (Please ignore some errors in the graphics exceeding the verified ranges given below. These errors
are due, for example, to background noise or music, or the sound of an audience or to automatic pitch calculation.) The order
of presentation within a figure accords, firstly, to the number of examples per speaker or a group of speakers, and secondly,
to the identification number of the speaker.
Figure 8 shows pitch contours of speech extracts produced by untrained speakers, journalists, TV hosts and actresses talking
on TV (not acting), to experience in every day life:
- The examples for speaker 172 (see pitch contours 8-1 to 8-3) relates to extracts of a woman selling grilled chicken in a market
in Paris. Overall range of F0 = c. 220–700 Hz (excluding very high-pitched exclamations).
- The examples for the two speakers subsumed under the ID number 379 and for the speaker 380 (see pitch contours 8-4 to 8-6)
relate to extracts of two American women and one American man demonstrating infant child directed speech. Overall range of
F0 = c. 200–800 Hz for the women (except one higher peak at c. 1 kHz) and c. 150–600 Hz for the man.
- The examples for speaker 336 (see pitch contours 8-7 and 8-8, the latter from 0.7 to 2.5 sec.) relate to extracts of a female
Indonesian singer talking in a TV show and to an exclamation of her name during the show. Overall range of F0 = c. 350–950
Hz.
- The two examples for the speakers subsumed under the ID number 348 (see pitch contours 8-9 and 8-10) relate to extracts of
two female TV hosts announcing the results of a singing contest (announcements in English). Overall range of F0 = c. 200–700
Hz.
- The example for speaker 135 (see pitch contour 8-11) relates to two sentences of a boy (age 6). Range of F0 = c. 220–600 Hz.
- The example for speaker 174 (see pitch contour 8-12) relates to an extract of a female North American journalist speaking
on television. Range of F0 = c. 175–600 Hz.
- The example for speaker 217 (see pitch contour 8-13) relates to an extract of a North American woman talking about her child
on television. Range of F0 = c. 160–550 Hz.
- The example for speaker 220 (see pitch contour 8-14) relates to an extract of a female French doctor talking on television.
Range of F0 = c. 250–520 Hz.
- The example for speaker 238 (see pitch contour 8-15) relates to an extract of a male French TV host. Range of F0 = c. 130–
420 Hz (exceeding only gender-related statistical F1 of the vowels / i, y, u /).
- The example for speaker 383 (see pitch contour 8-16) relates to an extract of a French woman talking on television in a TV
spot. Range of F0 = c. 220–830 Hz.
- The example for two speakers subsumed under the ID number 379 (see pitch contour 8-17) relates to an extract of a female French
journalist (first part) questioning a French woman on the street, and the answer of the latter (second part). Overall range
of F0 for the utterances of both women = c. 230–600 Hz.
Figure 9 shows pitch contours of speech extracts of performing actresses (film, comic, voice-over, dubbing):
- The example for speaker 216 (see pitch contours 9-1 and 9-6) relates to extracts of a female Swiss narrator of fairy tales.
Overall range of F0 = c. 150–900 Hz.
- The examples for speaker 177 (see pitch contours 9-7 to 9-9) relate to extracts of a French comic actress performing on stage.
Overall range of F0 = c. 180–780 Hz.
- The examples for speaker 178 (see pitch contours 9-10 to 9-12) relate to extracts of another French comic actress performing
on stage. Overall range of F0 = c. 200–850 Hz.
- The examples for speaker 212 (see pitch contours 9-13 to 9-15) relate to extracts of the speech of a French actress in a cartoon.
Overall range of F0 = c. 300–700 Hz.
- The examples for speakers 251 (see pitch contours 9-16 to 9-18) relate to extracts of two British actresses performing as
the voices of the two main characters in a computer-animated fantasy film. Overall range of F0 = c. 150–800 Hz.
- The examples for speaker 276 (see pitch contours 9-19 to 9-21) relate to extracts of a French comedy actress performing on
stage. Overall range of F0 = c. 400–780 Hz.
- The example for speaker 175 (see pitch contour 9-22) relates to an extract of a North American actress performing as a female
character in a film. Range of F0 = c. 270–700 Hz (excluding one high-pitched exclamation at F0 of c. 880 Hz).
- The example for speaker 223 (see pitch contour 9-23) relates to an extract of a German actress dubbing a female character
in a film. Range of F0 = c. 220–780 Hz (excluding one high-pitched exclamation at the end).
- The example for speaker 234 (see pitch contour 9-24) relates to an extract of a French comic actress performing on stage.
Range of F0 = c. 200–850 Hz.
- The example for speaker 258 (see pitch contour 9-25) relates to an extract of a French actress performing as the voice of
a female character in an animation film. Range of F0 = c. 220– 780 Hz.
- The example for speaker 275 (see pitch contour 9-26) relates to an extract of a German comic actress performing on stage.
Range of F0 = c. 180–850 Hz.
- The example for speaker 291 (see pitch contour 9-27) relates to an extract of a British actress performing in a fantasy film.
Range of F0 = c. 100–700 Hz.
- The example for speaker 296 (see pitch contour 9-28) relates to an extract of a German comic actress. Range of F0 = c. 150–
600 Hz.
- The example for speaker 350 (see pitch contour 9-29) relates to an extract of a North American actress performing as a female
character in a film. Range of F0 = c. 160–900 Hz (excluding some very high-pitched exclamations).
- The example for speaker 398 (see pitch contour 9-30) relates to an extract of a North American actress performing as a female
character in a TV series. Range of F0 = c. 300–980 Hz.
Figure 10 shows pitch contours of speech extracts of performing actors (film, comic, voice-over, dubbing):
- The examples for speaker 225 (see pitch contours 10-1 to 10-4) relate to speech extracts of a Swiss comic actor performing
as a female character. Overall range of F0 = c. 220–780 Hz.
- The examples for speaker 163 (see pitch contours 10-5 to 10-7) relate to extracts of an Indonesian comic actor performing
on stage in a Drama Gong. Overall range of F0 = c. 300–600 Hz.
- The examples for speaker 169 (see pitch contours 10-8 and 10- 10) relate to extracts of a German actor dubbing a male character
in a film. Overall range of F0 = c. 100–700 Hz.
- The examples for speaker 214 (see pitch contours 10-11 to 10- 13) relate to extracts of a Japanese Kabuki actor. Overall range
of F0 = c. 250–700 Hz.
- The examples for speaker 297 (see pitch contours 10-14 to 10- 16) relate to extracts of speech of another Swiss comic actor
performing in a TV show. Overall range of F0 = c. 130–620 Hz.
- The examples for speaker 194 (see pitch contours 10-17 and 10-18) relate to extracts of a French comic actor performing on
stage. Overall range of F0 = c. 130–700 Hz.
- The example for speaker 394 (see pitch contours 10-19 and 10- 20) relates to extracts of two French actors performing as the
voices of male characters in an animation film. Overall range of F0 = c. 310–650 Hz.
- The example for speaker 171 (see pitch contour 10-21) relates to extracts of speech of a German actor dubbing the voice of
a male character. Range of F0 = c. 180–550 Hz.
- The example for speaker 274 (see pitch contour 10-22) relates to extracts of speech of a Swiss actor performing as ventriloquist.
Range of F0 = c. 120–600 Hz.
- The example for speaker 294 (see pitch contour 10-23) relates to an extract of speech of a North American actor performing
as the voice of a female character in a comedy-variety film. Range of F0 = c. 200–800 Hz.
- The example for speaker 351 (see pitch contour 10-24) relates to an extract of speech of a German comic actor performing in
a TV show. Range of F0 = c. 150–580 Hz (excluding one highpitched exclamation at F0 of c. 780 Hz).
For earlier accounts, see Maurer and Landis (1996, 2000), Maurer, Mok, Friedrichs, and Dellwo (2014), Friedrichs, Maurer,
and Dellwo (2015), Friedrichs, Maurer, Suter, and Dellwo (2015).
Link to the spectra of the Figures
Figure 6. Five intelligible sounds of /y, e, ø, ɛ, o/ produced by children and women at F0 in the range of 700–800Hz.
>> Link to Figure 6
Figure 7: Three intelligible sounds of the corner vowels /i, a, u/ produced by women at F0 of c. 850 Hz.
>> Link to Figure 7
Figure 8: Pitch contours of speech extracts produced by untrained speakers, journalists, TV hosts and actresses talking on
TV (not acting), to experience in every day life.
>> Link to Figure 8
Figures 4: Pitch contours of extracts of speech produced by actresses while performing (film, comic, voice over, dubbing).
>> Link to Figure 9
Figures 5: Pitch contours of extracts of speech produced by actors while performing (film, comic, voice over, dubbing).
>> Link to Figure 10
Addition 2016-02-23 (performing actors/actresses)
The following examples for speaker 410 relate to extracts of speech of a French comic actress performing on stage. Overall
range of F0 = c. 200–880 Hz.
>> Link to Addition 2016-02-23-A
The following example for speaker 404 relates to an extract of speech of a North American voice-over actress performing in
an animated television series. Range of F0 = c. 220–700 Hz.
>> Link to Addition 2016-02-23-B
The following examples for speaker 294 relate to short extracts of speech of a North American actor performing as the voice
of a female character in a comedy-variety film (see also above). These extracts are used as Mojis (short video clips) in the
Skype program. Range of F0 = c. 150–780 Hz.
>> Link to Addition 2016-02-23-C
Addition 2016-04-16 (everyday life)
The following examples for speaker 412 relate to extracts of speech of a North American female politician (former senator);
the speech was given in a political meeting. Overall range of F0 = c. 170–800 Hz, with one peak up to c. 880 Hz.
>> Link to Addition 2016-04-16-A
The following examples for speaker 417 relate to extracts of speech of a Malaysian female speaker of a call center, making
announcements in English. Overall range of F0 = c. 150–400 Hz. These examples are given in order to illustrate a variation
of fundamental frequency of more than one octave in short announcements. However, the fundamental frequency does not substantially
exceed 400 Hz.
>> Link to Addition 2016-04-16-B
Addition 2016-04-16 (performing actresses)
The following examples for speaker 411 relate to extracts of speech of a North American comic actor hosting an Oscar Award.
Overall range of F0 = c. 150–900 Hz.
>> Link to Addition 2016-04-16-C
The following examples for speaker 413 relate to extracts of speech of a North American actress performing in a film. Overall
range of F0 of the first two extracts = c. 125–600 Hz; overall range of F0 of the third extract = c. 600–1000 Hz, representing
acoustic features of crying.
>> Link to Addition 2016-04-16-D
The following example for speaker 414 relates to an extract of speech of a North American actress performing in a film. Overall
range of F0 = c. 330–700 Hz.
>> Link to Addition 2016-04-16-E
The following example for speaker 415 relates to an extract of speech of a North American actress performing in a film. Overall
range of F0 = c. 150–600 Hz.
>> Link to Addition 2016-04-16-F
The following examples for speaker 419 relates to an extract of speech of a French voice-over actress performing in an animation
film. Range of F0 = c. 175–800 Hz.
>> Link to Addition 2016-02-23-G
Addition 2016-10-25 (everyday life)
The following examples for speaker 420 relate to extracts of speech of a Indonesian Imam, given in a religious event. Overall
range of F0 = c. 140–650 Hz, with rare peaks up to c. 700–800 Hz.
>> Link to Addition 2016-10-30-A
The following examples for speaker 421 relate to extracts of speech of a Indonesian female TV commentator; the speech was
given in a TV show. Overall range of F0 = c. 200–650 Hz.
>> Link to Addition 2016-10-30-B
The following examples for speaker 427 relate to extracts of speech of a North American male politician, given in a political
meeting. Overall range of F0 = c. 125–390 Hz.
>> Link to Addition 2016-10-30-C
The following examples for speaker 428 relate to extracts of speech of a North American male supporter of a politician, given
in a political meeting. Overall main range of F0 = c. 200–400 Hz.
>> Link to Addition 2016-10-30-D
The following examples for speaker 429 relate to extracts of speech of a North American female supporter of a politician,
given in a political meeting. Overall range of F0 = c. 220–660 Hz.
>> Link to Addition 2016-10-30-E
The following examples for speaker 430 relate to extracts of speech of a French female advocat during a discussion on television.
Main Range of F0 = c. 200–400 Hz, overall range of F0 = 150-520 Hz.
>> Link to Addition 2016-10-30-F
Addition 2016-10-25 (performing actresses)
The following examples for speaker 422 relate to extracts of speech of a North American voice-over actor performing as the
voice of a male character in an animation film. Range of F0 = c. 100–900 Hz.
>> Link to Addition 2016-10-20-G
The following examples for speaker 423 relate to extracts of speech of a North American voice-over actress performing as the
voice of a female character in an animation film. Range of F0 = c. 230–880 Hz, with one additional peak up to 1 kHz.
>> Link to Addition 2016-10-30-H