Morphosyntactic Resources for Automatic Speech Recognition

Pascale Sebillot

Outline

Analysis of the Results

Natural Language Processing

Morphosyntactic Resources for Automatic Speech Recognition

Pascale Sebillot

2008, Language Resources and Evaluation

Sign up for access to the world's latest research

checkGet notified about relevant papers

checkSave papers to use in your research

checkJoin the discussion with peers

checkTrack your impact

Abstract

Texts generated by automatic speech recognition (ASR) systems have some specificities, related to the idiosyncrasies of oral productions or the principles of ASR systems, that make them more difficult to exploit than more conventional natural language written texts. This paper aims at studying the interest of morphosyntactic information as a useful resource for ASR. We show the ability of automatic

Ricardo Ribeiro

2002

Abstract The purpose of this paper is to present the development of a morphossyntactic disambiguation system (or part-of-speech tagging system) which is intended to be used as a component of a Text-to-Speech (TTS) system for European Portuguese. In the development of the tagger, we compared two approaches: a probabilistic-based approach and a hybrid approach. Besides comparing these two approaches, this paper considers the effects of the different classes of errors on the performance of the complete TTS system.

downloadDownload free PDF View PDFchevron_right

Morphosyntactic Annotation and Lemmatization Based on the Finite-State Dictionary of Wordformation Elements

Brian O'Donovan, Alexander Troussov

Dictionary-based methods in morphological analysis can provide accurate lemmatization and rich annotation, including part-of-speech, number, gender, etc. A morphological guesser can be used to process out-ofvocabulary words. Industrial text processing applications require high performance, which suggests the need to merge these two types of applications. In this paper we discuss the conversion of a pre-existing high coverage morphosyntactic lexicon into a deterministic finite-state device which: preserves accurate lemmatization and annotation for vocabulary words, allows acquisition and exploitation of implicit morphological knowledge from the dictionaries in the form of ending guessing rules to process out-of-vocabulary words, allows seamless integration of additional hand-crafted ending guessing rules.

downloadDownload free PDF View PDFchevron_right

Are Morphosyntactic Taggers Suitable to Improve Automatic Transcription?

Pascale Sebillot

Lecture Notes in Computer Science, 2006

The aim of our paper is to study the interest of part of speech (POS) tagging to improve speech recognition. We first evaluate the part of misrecognized words that can be corrected using POS information; the analysis of a short extract of French radio broadcast news shows that an absolute decrease of the word error rate by 1.1% can be expected. We also demonstrate quantitatively that traditional POS taggers are reliable when applied to spoken corpus, including automatic transcriptions. This new result enables us to effectively use POS tag knowledge to improve, in a postprocessing stage, the quality of transcriptions, especially correcting agreement errors.

downloadDownload free PDF View PDFchevron_right

Morphosyntactic Analysis of the CHILDES and TalkBank Corpora

Brian Macwhinney

This paper describes the construction and usage of the MOR and GRASP programs for part of speech tagging and syntactic dependency analysis of the corpora in the CHILDES and TalkBank databases. We have written MOR grammars for 11 languages and GRASP analyses for three. For English data, the MOR tagger reaches 98% accuracy on adult corpora and 97% accuracy on child language corpora. The paper discusses the construction of MOR lexicons with an emphasis on compounds and special conversational forms. The shape of rules for controlling allomorphy and morpheme concatenation are discussed. The analysis of bilingual corpora is illustrated in the context of the Cantonese-English bilingual corpora. Methods for preparing data for MOR analysis and for developing MOR grammars are discussed. We believe that recent computational work using this system is leading to significant advances in child language acquisition theory and theories of grammar identification more generally.

downloadDownload free PDF View PDFchevron_right

Phonological Realization of Morphosyntactic Features

Giorgos Markopoulos

2018

downloadDownload free PDF View PDFchevron_right

ANGIE: a new framework for speech analysis based on morpho-phonological modelling

Helen Meng

Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96

This paper describes a new system for speech analysis, ANGIE, which characterizes word substructure in terms of a trainable grammar. ANGIE capture morpho-phonemic and phonological phenomena through a hierarchical framework. The terminal categories can be alternately letters or phone units, yielding a reversible letter-tosound/sound-to-letter system. In conjunction with a segment network and acoustic phone models, the system can produce phonemicto-phonetic alignments for speech waveforms. For speech recognition, ANGIE uses a one-pass bottom-up best-first search strategy. Evaluated in the ATIS domain, ANGIE achieved a phone error rate of 36%, as compared with 40% achieved with a baseline phone-bigram based recognizer under similar conditions. ANGIE potentially offers many attractive features, including dynamic vocabulary adaptation, as well as a framework for handling unknown words. Previous experiments have yielded improved pronunciation accuracy without this layer.

downloadDownload free PDF View PDFchevron_right

A Morpho-Graphemic Approach for the Recognition of Spontaneous Speech in Agglutinative Languages-Like Hungarian

Tibor Fegyo

Eighth Annual Conference …, 2007

A coupled acoustic-and language-modeling approach is presented for the recognition of spontaneous speech primarily in agglutinative languages. The effectiveness of the approach in large vocabulary spontaneous speech recognition is demonstrated on the Hungarian MALACH corpus. The derivation of morphs from word forms is based on a statistical morphological segmentation tool while the mapping of morphs into graphemes is obtained trivially by splitting each morph into individual letters. Using morphs instead of words in language modeling gives significant WER reductions in case of both phoneme-and grapheme-based acoustic modeling. The improvements are larger after speaker adaptation of the acoustic models. In conclusion, morphophonemic and the proposed morpho-graphemic ASR approaches yield the same best WERs, which are significantly lower than the word-based baselines but essentially without language dependent rules or pronunciation dictionaries in the latter case.

downloadDownload free PDF View PDFchevron_right

Analysis of morph-based speech recognition and the modeling of out-of-vocabulary words across languages

Andreas Stolcke

2007

We analyze subword-based language models (LMs) in large-vocabulary continuous speech recognition across four "morphologically rich" languages: Finnish, Estonian, Turkish, and Egyptian Colloquial Arabic. By estimating n-gram LMs over sequences of morphs instead of words, better vocabulary coverage and reduced data sparsity is obtained. Standard word LMs suffer from high out-of-vocabulary (OOV) rates, whereas the morph LMs can recognize previously unseen word forms by concatenating morphs. We show that the morph LMs generally outperform the word LMs and that they perform fairly well on OOVs without compromising the accuracy obtained for in-vocabulary words.

downloadDownload free PDF View PDFchevron_right

MORPHON: Lexicon-based text-to-phoneme conversion and phonological rules

Ramadya Abitza

Analysis and Synthesis of Speech

In this contribution MORPHON is outlined. This module provides the text-to-speech System with phonological rules. It will be argued that such rules are needed because the pronunciation of a sentence does not consist of the concatenaüon of the pronunciation of the constituting morphemes, but the pronunciation of morphemes is modified in certain contexts. These rules can only apply properly if exceptions can be listed in a lexicon, and if rules can refer to morphological and morpho-syntactic Information. Therefore a lexicon-based approach to text-tophoneme transcription conversion was chosen. Finally, the pronunciation accuracy of MORPHON is compared with that of two rule based text-to-phoneme transcription Systems.

downloadDownload free PDF View PDFchevron_right

The Corpus of Spoken Icelandic and Its Morphosyntactic Annotation

Eiríkur Rögnvaldsson

We describe the Corpus of Spoken Icelandic (ÍS-TAL) which is made up of 15 hours of spontaneous naturally occurring conversations, 31 conversations in all. The corpus comprises 184,080 tokens, 14,297 types and 9,221 lemmas. It has been transcribed using standard orthography. We present a list of the 30 most common lemmas in the corpus and compare it to a list of the most frequent lemmas in the written language, concluding that the differences between the two lists are smaller than expected. We have tagged the corpus morphologically with a statistical tagger that had been trained on written texts. The results are much better than we expected, and the tagging accuracy is as least as high as for the written texts. The final part of the paper is a report on a work in progress. We have been experimenting with converting the morphological tagging into a shallow syntactic markup by applying a few simple hand-written rules. Even though the analysis we get by using this procedure is bound to be incomplete and contain several errors, we conclude that the results are promising and we can use this method to build a simple yet useful treebank with minimal effort.

downloadDownload free PDF View PDFchevron_right

Loading Preview

Sorry, preview is currently unavailable. You can download the paper by clicking the button above.

References (1)

References T. Brants. 2000. TnT -a statistical part-of-speech tagger. In Proc. of the Conference on Applied Natural Language Processing (ANLP).

Marie-Thérèse LE NORMAND

Behav Res Methods Instrum Comput 32: 3. 468-481 , 2000

Automatic analysis of transcripts is not always as simple as it should be. Some of the tasks involved are quite tedious, although computer tools are already a great help. One of these tasks is the disambiguation of lexically tagged texts. In a language such as French or English, more than 70% of the words in a full adult lexicon (more than 100,000 words) are ambiguous. Smaller lexicons have fewer ambiguous words but will produce omission errors. When creating a lexicon, it is very difficult to decide in advance that a word is not going to be ambiguous in a given corpus. It is better to use a full child or adult lexicon and choose from the whole set of lexical possibilities. This task can be very time consuming when analyzing a large transcript. Fortunately, it is possible to make this process fully automatic by using an advanced part-of-speech program that can tag and disambiguate a corpus in a few seconds, with an accuracy rate that may be better than or about the same as human processing accuracy. Also, the adequacy of such automatic processing shows that the morphosyntax of child language is very consistent, in itself and in relation to adult language.

downloadDownload free PDF View PDFchevron_right

Using Morphossyntactic Information in TTS Systems: Comparing Strategies for European Portuguese

Ricardo Ribeiro

2003

To improve the quality of the speech produced by a Text-to-Speech (TTS) system, it is important to obtain the maximum amount of information from the input text that may help in this task. This covers a wide range of possibilities that can go from the simple conversion of non orthographic items to more complex syntactic and semantic analysis. In this paper, we present the development of a morphossyntactic tagging system and analyze its influence on the performance of a TTS system for European Portuguese.

downloadDownload free PDF View PDFchevron_right

Automatic disambiguation of morphosyntax in spoken language corpora

Christophe Parisse

Behavior Research Methods, 2000

The use of computer tools does not always speed up the analysis of young children's transcripts. Although it is now easy to lexically tag every word in a corpus, you still have to choose between numerous ambiguous forms, especially with languages such as French or English, where nearly 50% of the words are ambiguous. Computational linguistics now offer well-developed part of speech labeling which permits fully automatical disambiguation of lexical tags: the tool presented here (POST) can tag and disambiguate a large text in a few seconds. This could form a complement to many systems dealing with language transcript, and also suggests further theoretical developments about the assessment of the status of morphosyntax in child language. The program works for French but is open to other languages such as English. The analyses and computation of a corpus produced by normal children aged two to four, as well as of a sample corpus produced by SLI children are given as examples.

downloadDownload free PDF View PDFchevron_right

Morpho-Syntactic Analysis Framework for Tone Language Text-to-Speech Systems

Moses Ekpenyong

Computer and Information Science, 2012

This paper presents a morpho-syntactic analysis framework using the data-driven methodology. The proposed framework complements the front-end design of a recent text-to-speech (TTS) project and is generic for other tone language systems. We experiment the design for Ibibio (ISO 693-2: nic; Ethnologue: IBB), a Lower Cross language of the (New) Benue Congo language family, widely spoken in the southeastern region of Nigeria. Implementation shows that the design is sufficient for morpho-syntactic parsing and useful for prosody improvement in TTS systems. Also, the methodology adopted detaches a greater part of the linguistic features specification from the program code. This allows for easy morphological alterations of utterances and replication of the synthesizer for other languages.

downloadDownload free PDF View PDFchevron_right

Morphosyntactic Parser for Brazilian Portuguese: Methodology for Development and Assessment

Izabel Christine Seara

inf.pucrs.br

In text-to-speech (TTS) systems, an effective morphosyntactic classification is important to improve the prosody of synthesized speech as well as the pronunciation of words subject to vocalic alternation. This research work presents a methodology used for developing and ...

downloadDownload free PDF View PDFchevron_right

Analysis of Morph-Based Language Modeling and Speech Recognition in Slovak

Jozef Juhár

Advances in Electrical and Electronic Engineering, 2012

The inflection of the Slovak language causes a large number of unique word forms, which produces not only a large vocabulary, but also a number of out-ofvocabulary words. Morph-based language models solve this problem by decomposition of inflected word forms into small sub-word units and resolve the general problem of sparsity the training data. In this paper, we present several rule-based and data-driven approaches to the automatic segmentation of words into morphs. These data are later used in the modeling of the Slovak language for large vocabulary continuous speech recognition. Preliminary results show a significant decrease in the number of out-of-vocabulary words and reduction of resultant language model perplexity.

downloadDownload free PDF View PDFchevron_right

Morphosyntactic Resources for Automatic Speech Recognition

Sign up for access to the world's latest research

Abstract

Related papers

References (1)

Related papers

Related topics