A Rational Design for a Weighted Finite-State Transducer Library

1 Overview We describe the design principles and main algorithms for an object-oriented library for weighted nite-state transducers, which are nite automata in which each transition has an output and a weight as well as the more familiar input. The main goal of the library is to provide algorithms and representations for all the symbolic processing components (language models, dictionaries, acoustic realization rules, word lattices) in large-vocabulary speech recognition systems. This goal leads to several requirements: generality, to support the representation and use of the various information sources in speech recognition; modularity, to allow rapid experimentation with diierent representations of speech recognition tasks; and eeciency, to support competitive large-vocabulary recognition. Rational power series provide the theoretical foundation for the library by giving the semantics for the objects and operations in the library and by creating the opportunity for optimizations (on-demand composition, determinization and minimization) that are not available in more \ad hoc" speech recognition frameworks. The generality of the library has made it also valuable in other language-processing applications, such as word segmentation for Chinese text 25]. 1.1 Design Rationale Current speech-recognition systems rely on a variety of probabilistic nite-state models, for instance n-gram language models 21], multiple-pronunciation dictionaries 11], and context-dependent acoustic models 10]. However, most speech recognizers do not take advantage of the shared properties of the information sources they use. Instead, they rely on special-purpose algorithms for speciic representations. That means that the recognizer has to be rewritten if representations are changed for a new application or for increased accuracy or performance. Experiments with diierent representations are therefore diicult, as they require changing or even completely replacing fairly intricate recognition programs. This situation is not too diierent from that in programming-language parsing before lex and yacc 2]. Furthermore, specialized representations and algorithms preclude certain global optimizations based on the general properties of nite-state models. Again, the situation is similar to the lack of general

[1]  Marcel Paul Schützenberger,et al.  On the Definition of a Family of Automata , 1961, Inf. Control..

[2]  Alfred V. Aho,et al.  The Design and Analysis of Computer Algorithms , 1974 .

[3]  Samuel Eilenberg,et al.  Automata, languages, and machines. A , 1974, Pure and applied mathematics.

[4]  Imre Simon,et al.  Limited subsets of a free monoid , 1978, 19th Annual Symposium on Foundations of Computer Science (sfcs 1978).

[5]  Alfred V. Aho,et al.  Compilers: Principles, Techniques, and Tools , 1986, Addison-Wesley series in computer science / World student series edition.

[6]  Arto Salomaa,et al.  Semirings, Automata, Languages , 1985, EATCS Monographs on Theoretical Computer Science.

[7]  Jean Berstel,et al.  Rational series and their languages , 1988, EATCS monographs on theoretical computer science.

[8]  Dominique Revuz,et al.  Minimisation of Acyclic Deterministic Automata in Linear Time , 1992, Theor. Comput. Sci..

[9]  Andrej Ljolje,et al.  Optimal speech recognition using phone recognition and lexical access , 1992, ICSLP.

[10]  Mehryar Mohri Minimization of Sequential Transducers , 1994, CPM.

[11]  Chilin Shih,et al.  A Stochastic Finite-State Word-Segmentation Algorithm for Chinese , 1994, ACL.

[12]  Martin Kay,et al.  Regular Models of Phonological Rule Systems , 1994, CL.

[13]  Mehryar Mohri,et al.  Syntactic Analysis by Local Grammars Automata: an Efficient Algorithm , 1994, ArXiv.

[14]  Lauri Karttunen,et al.  The Replace Operator , 1995, ACL.

[15]  Roberto Pieraccini,et al.  Non-deterministic stochastic language models for speech recognition , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[16]  Richard Sproat,et al.  An Efficient Compiler for Weighted Rewrite Rules , 1996, ACL.

[17]  M. A. Hill An Eecient Compiler for Weighted Rewrite Rules , 1996 .

[18]  Mehryar Mohri,et al.  On some applications of finite-state automata theory to natural language processing , 1996, Nat. Lang. Eng..

[19]  Jarkko Kari,et al.  Digital Images and Formal Languages , 1997, Handbook of Formal Languages.

[20]  Mehryar Mohri,et al.  Finite-State Transducers in Language and Speech Processing , 1997, CL.