Text-Style Conversion of Speech Transcript into Web Document for Lecture Archive

Masashi Ito; Tomohiro Ohno; Shigeki Matsubara

doi:10.20965/jaciii.2009.p0499

single-jc.php

« previous

JACIII Vol.13 No.4 pp. 499-505

(2009)

doi: 10.20965/jaciii.2009.p0499

Paper:

Views over last 60 days: 788

Text-Style Conversion of Speech Transcript into Web Document for Lecture Archive

Masashi Ito^, Tomohiro Ohno^, and Shigeki Matsubara^

^*Graduate School of Information Science, Nagoya University

^**Graduate School of International Development, Nagoya University

^***Information Technology Center, Nagoya University, Furo-cho, Chikusa-ku, Nagoya 464-8601, Japan

Received:

November 25, 2008

Accepted:

March 25, 2009

Published:

July 20, 2009

Keywords:

natural languages, spoken language processing, digital archiving, web contents, paraphrasing

Abstract

It is very significant to the knowledge society to accumulate spoken documents on the web. However, because of the high redundancy of spontaneous speech, the faithfully transcribed text is not readable on an Internet browser, and therefore not suitable as a web document. This paper proposes a technique for converting spoken documents into web documents for the purpose of building a speech archiving system. The technique edits automatically transcribed texts and improves their readability on the browser. The readable text can be generated by applying technology such as paraphrasing, segmentation, and structuring transcribed texts. Editing experiments using lecture data demonstrated the feasibility of the technique. A prototype system of spoken document archiving was implemented to confirm its effectiveness.

Cite this article as:

M. Ito, T. Ohno, and S. Matsubara, “Text-Style Conversion of Speech Transcript into Web Document for Lecture Archive,” J. Adv. Comput. Intell. Intell. Inform., Vol.13 No.4, pp. 499-505, 2009.

Data files:

References

[1] K. Bain, S. Basson, A. Faisman, and D. Kanevsky, “Accessibility, Transcription, and Access Everywhere,” IBM System Journal, Vol.44, No.3, pp. 589-603, 2005.
[2] T. Shibata and S. Kurohashi, “Automatic Slide Generation Based on Discourse Structure Analysis,” Proc. 2nd IJCNLP, pp. 754-766, 2005.
[3] P. Chatain, E. W. D. Whittaker, J. A. Mmzinski, and S. Furui, “Topic and Stylistic Adaptation for Speech Summarisation,” Proc. IEEE ICASSP, 2006.
[4] C. James and L. Mirella, “Models for Sentence Compression: a Comparison Across Domains, Training Requirements and Evaluation Measures,” Proc. ACL/COLING2006, pp. 377-384, 2006.
[5] X. Zhu and G. Penn, “Summarization of Spontaneous Conversations,” Proc. 9th ICSLP, pp. 1531-1534, 2006.
[6] G. Murray, S. Renals, J. Carletta, and J. Moore, “Incorporating Speaker and Discourse Features into Speech Summarization,” Proc. HLT/NAACL, pp. 367-374, 2006.
[7] K. Shitaoka, H. Nanjo, and T. Kawahara, “Automatic Transformation of Lecture Transcription into Document Style Using Statistical Framework,” Proc. 8th ICSLP, pp. 2169-2172, 2004.
[8] S. Matsubara, A. Takagi, N. Kawaguchi, and Y. Inagaki, “Bilingual Spoken Monologue Corpus for Simultaneous Machine Interpretation Research,” Proc. 3rd LREC. pp. 153-159, 2002.
[9] S. Kurohashi and M. Nagao, “A Syntactic Analysis Method of Long Japanese Sentences Based on the Detection of Conjunctive Structures,” Computational Linguistics, Vol.20, No.4, pp. 507-534, 1994.
[10] S. Kurohashi and M. Nagao, “Improvements of Japanese Morphological Analyser JUMAN,” Proc. 15th COLING, Vol.2, pp. 1123-1127, 1994.
[11] http://www.hke.jp/products/voice/voice_index.htm
[12] http://www.nhk.or.jp/strl/tvml/english/player2/index.html

This article is published under a Creative Commons Attribution-NoDerivatives 4.0 Internationa License.

[B1] [1] K. Bain, S. Basson, A. Faisman, and D. Kanevsky, “Accessibility, Transcription, and Access Everywhere,” IBM System Journal, Vol.44, No.3, pp. 589-603, 2005.

[B2] [2] T. Shibata and S. Kurohashi, “Automatic Slide Generation Based on Discourse Structure Analysis,” Proc. 2nd IJCNLP, pp. 754-766, 2005.

[B3] [3] P. Chatain, E. W. D. Whittaker, J. A. Mmzinski, and S. Furui, “Topic and Stylistic Adaptation for Speech Summarisation,” Proc. IEEE ICASSP, 2006.

[B4] [4] C. James and L. Mirella, “Models for Sentence Compression: a Comparison Across Domains, Training Requirements and Evaluation Measures,” Proc. ACL/COLING2006, pp. 377-384, 2006.

[B5] [5] X. Zhu and G. Penn, “Summarization of Spontaneous Conversations,” Proc. 9th ICSLP, pp. 1531-1534, 2006.

[B6] [6] G. Murray, S. Renals, J. Carletta, and J. Moore, “Incorporating Speaker and Discourse Features into Speech Summarization,” Proc. HLT/NAACL, pp. 367-374, 2006.

[B7] [7] K. Shitaoka, H. Nanjo, and T. Kawahara, “Automatic Transformation of Lecture Transcription into Document Style Using Statistical Framework,” Proc. 8th ICSLP, pp. 2169-2172, 2004.

[B8] [8] S. Matsubara, A. Takagi, N. Kawaguchi, and Y. Inagaki, “Bilingual Spoken Monologue Corpus for Simultaneous Machine Interpretation Research,” Proc. 3rd LREC. pp. 153-159, 2002.

[B9] [9] S. Kurohashi and M. Nagao, “A Syntactic Analysis Method of Long Japanese Sentences Based on the Detection of Conjunctive Structures,” Computational Linguistics, Vol.20, No.4, pp. 507-534, 1994.

[B10] [10] S. Kurohashi and M. Nagao, “Improvements of Japanese Morphological Analyser JUMAN,” Proc. 15th COLING, Vol.2, pp. 1123-1127, 1994.

[B11] [11] http://www.hke.jp/products/voice/voice_index.htm

[B12] [12] http://www.nhk.or.jp/strl/tvml/english/player2/index.html

Text-Style Conversion of Speech Transcript into Web Document for Lecture Archive

Masashi Ito*, Tomohiro Ohno**, and Shigeki Matsubara***

Masashi Ito^, Tomohiro Ohno^, and Shigeki Matsubara^