專案描述

Mpaligner is the many-to-many string alignment tool based on the generative model which is modified to find a minimum mapping between two strings, such as notation and pronunciation. Mpaligner has some functions. The partial annotation function enables you to give correct alignment to part data by manpower (to provide Semi-supervised training). The detection function of special data detects data that is difficult to do alignment (for example tri'plei). The data which alignment is done is employed as training data. For example, when two strings which alignment is done are notation and pronunciation, it is employed as training data to construct a model for grapheme-to-phoneme conversion (g2p conversion). The license of mpaligner is GNU GPL.

If you hope to learn a model with aligned data produced by mpaligner to estimate pronunciation and to estimate a pronunciation with the learned model, please use slearp ( http://sourceforge.jp/projects/slearp/ ) which implements the learning methods for the model and a predict function to estimate a pronunciation.

Developer implementing mpaligner is below.

NAIST(Nara Institute of Science and Technology)
Graduate School of Information Science
Augmented Human Communication Laboratory
The Doctoral Program
Keigo Kubo

安裝

mpaligner のインストール方法 以下の通りです. $ tar xvfz mpaligner_<version>.tar.gz $ cd mpaligner_<version> $ make $ cp mpaligner <パスの通ったディレクトリ> 顯示如何安裝

用法

使用例: $ cat source/test.utf8.txt | ./script/separate_for_char.pl utf8 \ source/joint_chars.utf8.txt > source/test.utf8.char_unit $ mpaligner -i source/test.utf8.char_unit このコマンドにより,... 顯示用法

下載

您的評分
撰寫專案評

使用統計

最近的活動