Acoustic Analysis of Breathy and Rough Voice Characterizing Elderly Speech

Takeshi Miyazaki; Mitsunori Mizumachi; Katsuyuki Niyada

doi:10.20965/jaciii.2010.p0135

single-jc.php

« previous

JACIII Vol.14 No.2 pp. 135-141

doi: 10.20965/jaciii.2010.p0135

(2010)

Paper:

Views over last 60 days: 538

Acoustic Analysis of Breathy and Rough Voice Characterizing Elderly Speech

Takeshi Miyazaki, Mitsunori Mizumachi, and Katsuyuki Niyada

Department of Electrical Engineering, Kyushu Institute of Technology, 1-1 Sensui-cho, Tobata-ku, Kitakyushu-shi, Fukuoka 804-8550, Japan

Received:

July 9, 2009

Accepted:

December 9, 2009

Published:

March 20, 2010

Keywords:

elderly speech, breathy voice, rough voice, listening test, spectral analysis

Abstract

This paper aims at investigating acoustic features, which can objectively explain breathiness and roughness of elderly speech, respectively. In this paper, acoustic analysis was carried out using word sequences, which were uttered by 153 male speakers in the age range of between 20 and 89 years old. Concerning the breathiness, we confirmed that elderly breathy voices caused energy lift in higher frequency region over 4 kHz in average power spectra during the stationary parts in the uttered vowels. Concerning roughness, we observed the slight fluctuations, which synchronized with vocal cord vibration, in amplitude spectra during stationary parts of vowels. Based on acoustic analysis results, we propose physical parameters for measuring breathiness and roughness, respectively. In this paper, listening tests were carried out to quantitatively give the subject degrees of breathiness and roughness, respectively. It was confirmed that the proposed physical parameters had correlation with each of subjective degrees. Relationships between age and acoustic characteristics of breathiness and roughness were investigated using the proposed parameters. It is confirmed that the degree of breathiness and roughness increased in proportion to age, especially in age ranges over 60 years old.

Cite this article as:

T. Miyazaki, M. Mizumachi, and K. Niyada, “Acoustic Analysis of Breathy and Rough Voice Characterizing Elderly Speech,” J. Adv. Comput. Intell. Intell. Inform., Vol.14 No.2, pp. 135-141, 2010.

Data files:

References

[1] D. Hartman, “The Perceptual Identity and Characteristics of Aging in Normal Male Adult Speakers,” J. Comm. Disorders, Vol.12, No.1, pp. 53-61, 1979.
[2] D. Hartman and J. Danhauer, “Perceptual Features of Speech for Males in Four Perceived Age Decades,” J. Acoust. Soc. Am., Vol.59, pp. 713-715, 1976.
[3] P. Ptacek and E. Sander, “Age Recognition from Voice,” J. Speech Hear. Res., Vol.9, No.2, pp. 273-277, 1996.
[4] W. Ryan and K. Burk, “Perceptual and Acoustic Correlates of Aging in the Speech of Males,” J. Comm. Disorders, Vol.7, No.2, pp. 181-192, 1974.
[5] T. Miyazaki, M. Mizumachi, and K. Niyada, “Selection of Adjective Phrases for Characterizing Elderly Speech,” IEICE technical report. Speech, Vol.108, No.265, pp. 47-52, 2008. (in Japanese)
[6] E. Yumoto and H. Okamura, “Objective Assessment of Hoarseness: Psychophysical Measurement and Acoustic Analysis,” J. Acoust. Soc. Jpn., Vol.5, No.3, pp. 157-163, 1984.
[7] S. Hiki et al, “Acoustical Analysis of Hoarseness,” J. Acoust. Soc. Jpn., Vol.31, No.8, pp. 504-506, 1975.
[8] P. Mueller, “The Aging Voice,” Semin. Speech Lang., Vol.18, pp. 159-169, 1997.
[9] S. Makino et al, “Tohoku University and Panasonic Isolated Spoken Word Database,” J. Acoust. Soc. Jpn., Vol.48, No.12, pp. 899-905, 1992. (in Japanese)
[10] M. Hoshimi, M. Yamada, and K. Niyada, “A Practical Speech Recognition Method for Unspecified Speakers on a Single DSP Chip,” IEICE Trans., J72-D-2, pp. 2096-2103, 1996. (in Japanese)
[11] The Japan Society of Logopedics and Phoniatrics, “The Understanding Dysphonia Based on DVD Motion Picture, Ver.1.0,” Interuna publishers, 2005. (in Japanese)
[12] Y. Hisa, “Characteristics of Voice Disorder by Advancing Age,” MB. ENTONI, Vol.20, pp. 1-5, 2002. (in Japanese)

This article is published under a Creative Commons Attribution-NoDerivatives 4.0 Internationa License.

[1] [1] D. Hartman, “The Perceptual Identity and Characteristics of Aging in Normal Male Adult Speakers,” J. Comm. Disorders, Vol.12, No.1, pp. 53-61, 1979.

[2] [2] D. Hartman and J. Danhauer, “Perceptual Features of Speech for Males in Four Perceived Age Decades,” J. Acoust. Soc. Am., Vol.59, pp. 713-715, 1976.

[3] [3] P. Ptacek and E. Sander, “Age Recognition from Voice,” J. Speech Hear. Res., Vol.9, No.2, pp. 273-277, 1996.

[4] [4] W. Ryan and K. Burk, “Perceptual and Acoustic Correlates of Aging in the Speech of Males,” J. Comm. Disorders, Vol.7, No.2, pp. 181-192, 1974.

[5] [5] T. Miyazaki, M. Mizumachi, and K. Niyada, “Selection of Adjective Phrases for Characterizing Elderly Speech,” IEICE technical report. Speech, Vol.108, No.265, pp. 47-52, 2008. (in Japanese)

[6] [6] E. Yumoto and H. Okamura, “Objective Assessment of Hoarseness: Psychophysical Measurement and Acoustic Analysis,” J. Acoust. Soc. Jpn., Vol.5, No.3, pp. 157-163, 1984.

[7] [7] S. Hiki et al, “Acoustical Analysis of Hoarseness,” J. Acoust. Soc. Jpn., Vol.31, No.8, pp. 504-506, 1975.

[8] [8] P. Mueller, “The Aging Voice,” Semin. Speech Lang., Vol.18, pp. 159-169, 1997.

[9] [9] S. Makino et al, “Tohoku University and Panasonic Isolated Spoken Word Database,” J. Acoust. Soc. Jpn., Vol.48, No.12, pp. 899-905, 1992. (in Japanese)

[10] [10] M. Hoshimi, M. Yamada, and K. Niyada, “A Practical Speech Recognition Method for Unspecified Speakers on a Single DSP Chip,” IEICE Trans., J72-D-2, pp. 2096-2103, 1996. (in Japanese)

[11] [11] The Japan Society of Logopedics and Phoniatrics, “The Understanding Dysphonia Based on DVD Motion Picture, Ver.1.0,” Interuna publishers, 2005. (in Japanese)

[12] [12] Y. Hisa, “Characteristics of Voice Disorder by Advancing Age,” MB. ENTONI, Vol.20, pp. 1-5, 2002. (in Japanese)