The original paper is in English. Non-English content has been machine-translated and may contain typographical errors or mistranslations. ex. Some numerals are expressed as "XNUMX".
Copyrights notice
The original paper is in English. Non-English content has been machine-translated and may contain typographical errors or mistranslations. Copyrights notice
Makalah ini mencadangkan kaedah hibrid baharu untuk transliterasi mesin. Kaedah kami adalah berdasarkan gabungan kaedah medan rawak bersyarat dua langkah (CRF) yang baru dicadangkan dan model saluran sumber bersama (JSCM) yang terkenal. Sumbangan kertas kerja ini adalah seperti berikut: (1) Model CRF dua langkah untuk transliterasi mesin dicadangkan. CRF pertama membahagikan rentetan aksara perkataan input kepada ketulan dan yang kedua menukarkan setiap ketul kepada aksara dalam bahasa sasaran. (2) Kaedah pengoptimuman bersama model CRF dua langkah dan algoritma penyahkodan pantas juga dicadangkan. Percubaan kami menunjukkan bahawa pengoptimuman bersama model CRF dua langkah berfungsi serta atau lebih baik daripada JSCM, dan algoritma penyahkodan pantas mengurangkan masa penyahkodan dengan ketara. (3) Kaedah pembangunan pesat berdasarkan rangka kerja transduser keadaan terhingga berwajaran (WFST) untuk JSCM dicadangkan. (4) Gabungan model CRF dua langkah yang dicadangkan dan JSCM mengatasi hasil terkini dari segi ketepatan top-1.
The copyright of the original papers published on this site belongs to IEICE. Unauthorized use of the original or translated papers is prohibited. See IEICE Provisions on Copyright for details.
Salinan
Dong YANG, Paul DIXON, Sadaoki FURUI, "A New Hybrid Method for Machine Transliteration" in IEICE TRANSACTIONS on Information,
vol. E93-D, no. 12, pp. 3377-3383, December 2010, doi: 10.1587/transinf.E93.D.3377.
Abstract: This paper proposes a new hybrid method for machine transliteration. Our method is based on combining a newly proposed two-step conditional random field (CRF) method and the well-known joint source channel model (JSCM). The contributions of this paper are as follows: (1) A two-step CRF model for machine transliteration is proposed. The first CRF segments a character string of an input word into chunks and the second one converts each chunk into a character in the target language. (2) A joint optimization method of the two-step CRF model and a fast decoding algorithm are also proposed. Our experiments show that the joint optimization of the two-step CRF model works as well as or even better than the JSCM, and the fast decoding algorithm significantly decreases the decoding time. (3) A rapid development method based on a weighted finite state transducer (WFST) framework for the JSCM is proposed. (4) The combination of the proposed two-step CRF model and JSCM outperforms the state-of-the-art result in terms of top-1 accuracy.
URL: https://global.ieice.org/en_transactions/information/10.1587/transinf.E93.D.3377/_p
Salinan
@ARTICLE{e93-d_12_3377,
author={Dong YANG, Paul DIXON, Sadaoki FURUI, },
journal={IEICE TRANSACTIONS on Information},
title={A New Hybrid Method for Machine Transliteration},
year={2010},
volume={E93-D},
number={12},
pages={3377-3383},
abstract={This paper proposes a new hybrid method for machine transliteration. Our method is based on combining a newly proposed two-step conditional random field (CRF) method and the well-known joint source channel model (JSCM). The contributions of this paper are as follows: (1) A two-step CRF model for machine transliteration is proposed. The first CRF segments a character string of an input word into chunks and the second one converts each chunk into a character in the target language. (2) A joint optimization method of the two-step CRF model and a fast decoding algorithm are also proposed. Our experiments show that the joint optimization of the two-step CRF model works as well as or even better than the JSCM, and the fast decoding algorithm significantly decreases the decoding time. (3) A rapid development method based on a weighted finite state transducer (WFST) framework for the JSCM is proposed. (4) The combination of the proposed two-step CRF model and JSCM outperforms the state-of-the-art result in terms of top-1 accuracy.},
keywords={},
doi={10.1587/transinf.E93.D.3377},
ISSN={1745-1361},
month={December},}
Salinan
TY - JOUR
TI - A New Hybrid Method for Machine Transliteration
T2 - IEICE TRANSACTIONS on Information
SP - 3377
EP - 3383
AU - Dong YANG
AU - Paul DIXON
AU - Sadaoki FURUI
PY - 2010
DO - 10.1587/transinf.E93.D.3377
JO - IEICE TRANSACTIONS on Information
SN - 1745-1361
VL - E93-D
IS - 12
JA - IEICE TRANSACTIONS on Information
Y1 - December 2010
AB - This paper proposes a new hybrid method for machine transliteration. Our method is based on combining a newly proposed two-step conditional random field (CRF) method and the well-known joint source channel model (JSCM). The contributions of this paper are as follows: (1) A two-step CRF model for machine transliteration is proposed. The first CRF segments a character string of an input word into chunks and the second one converts each chunk into a character in the target language. (2) A joint optimization method of the two-step CRF model and a fast decoding algorithm are also proposed. Our experiments show that the joint optimization of the two-step CRF model works as well as or even better than the JSCM, and the fast decoding algorithm significantly decreases the decoding time. (3) A rapid development method based on a weighted finite state transducer (WFST) framework for the JSCM is proposed. (4) The combination of the proposed two-step CRF model and JSCM outperforms the state-of-the-art result in terms of top-1 accuracy.
ER -