The original paper is in English. Non-English content has been machine-translated and may contain typographical errors or mistranslations. ex. Some numerals are expressed as "XNUMX".
Copyrights notice
The original paper is in English. Non-English content has been machine-translated and may contain typographical errors or mistranslations. Copyrights notice
Kertas kerja ini membentangkan model pengujaan glottal pseudo untuk jenis vocoder ramalan linear dengan pertuturan dikodkan pada 1.6 kbps. Walaupun selang pertuturan dan senyap tidak bersuara diproses dengan buku kod stokastik sebanyak 512 entri, buku kod glotal dengan 32 entri untuk pengujaan bersuara digunakan untuk menerangkan ciri fasa glotal. Langkah-langkah merumuskan pengujaan glottal pseudo untuk satu tempoh pic terdiri daripada 1) menggunakan model polinomial untuk mensimulasikan juzuk frekuensi rendah baki, 2) memasukkan urutan nadi boleh laras magnitud untuk mencirikan pengujaan utama, dan 3) memperkenalkan turbulen bunyi bersiri dengan pengujaan yang terhasil. Prosedur diterangkan untuk pembinaan buku kod sebagai tambahan kepada analisis dan sintesis pengujaan pseudo glottal. Keputusan dalam ujian skor min pendapat (MOS) menunjukkan bahawa kualiti yang dihasilkan oleh pengekod yang dicadangkan adalah hampir sama baiknya dengan pengekod CELP 4.8 kbps untuk ujaran lelaki, tetapi kualiti untuk ujaran wanita masih agak rendah.
The copyright of the original papers published on this site belongs to IEICE. Unauthorized use of the original or translated papers is prohibited. See IEICE Provisions on Copyright for details.
Salinan
Hwai-Tsu HU, Fang-Jang KUO, Hsin-Jen WANG, "A Pseudo Glottal Excitation Model for the Linear Prediction Vocoder with Speech Signals Coded at 1.6 kbps" in IEICE TRANSACTIONS on Information,
vol. E83-D, no. 8, pp. 1654-1661, August 2000, doi: .
Abstract: This paper presents a pseudo glottal excitation model for the type of linear prediction vocoders with speech being coded at 1.6 kbps. While unvoiced speech and silence intervals are processed with a stochastic codebook of 512 entries, a glottal codebook with 32 entries for voiced excitation is used to describe the glottal phase characteristics. Steps of formulating the pseudo glottal excitation for one pitch period consist of 1) applying a polynomial model to simulate the low-frequency constituent of the residual, 2) inserting a magnitude-adjustable pulse sequence to characterize the main excitation, and 3) introducing turbulent noise in series with the resulting excitation. Procedures are described for codebook construction in addition to analysis and synthesis of the pseudo glottal excitation. Results in a mean opinion score (MOS) test show that the quality produced by the proposed coder is almost as good as that by 4.8 kbps CELP coder for male utterances, but the quality for female utterances is yet somewhat inferior.
URL: https://global.ieice.org/en_transactions/information/10.1587/e83-d_8_1654/_p
Salinan
@ARTICLE{e83-d_8_1654,
author={Hwai-Tsu HU, Fang-Jang KUO, Hsin-Jen WANG, },
journal={IEICE TRANSACTIONS on Information},
title={A Pseudo Glottal Excitation Model for the Linear Prediction Vocoder with Speech Signals Coded at 1.6 kbps},
year={2000},
volume={E83-D},
number={8},
pages={1654-1661},
abstract={This paper presents a pseudo glottal excitation model for the type of linear prediction vocoders with speech being coded at 1.6 kbps. While unvoiced speech and silence intervals are processed with a stochastic codebook of 512 entries, a glottal codebook with 32 entries for voiced excitation is used to describe the glottal phase characteristics. Steps of formulating the pseudo glottal excitation for one pitch period consist of 1) applying a polynomial model to simulate the low-frequency constituent of the residual, 2) inserting a magnitude-adjustable pulse sequence to characterize the main excitation, and 3) introducing turbulent noise in series with the resulting excitation. Procedures are described for codebook construction in addition to analysis and synthesis of the pseudo glottal excitation. Results in a mean opinion score (MOS) test show that the quality produced by the proposed coder is almost as good as that by 4.8 kbps CELP coder for male utterances, but the quality for female utterances is yet somewhat inferior.},
keywords={},
doi={},
ISSN={},
month={August},}
Salinan
TY - JOUR
TI - A Pseudo Glottal Excitation Model for the Linear Prediction Vocoder with Speech Signals Coded at 1.6 kbps
T2 - IEICE TRANSACTIONS on Information
SP - 1654
EP - 1661
AU - Hwai-Tsu HU
AU - Fang-Jang KUO
AU - Hsin-Jen WANG
PY - 2000
DO -
JO - IEICE TRANSACTIONS on Information
SN -
VL - E83-D
IS - 8
JA - IEICE TRANSACTIONS on Information
Y1 - August 2000
AB - This paper presents a pseudo glottal excitation model for the type of linear prediction vocoders with speech being coded at 1.6 kbps. While unvoiced speech and silence intervals are processed with a stochastic codebook of 512 entries, a glottal codebook with 32 entries for voiced excitation is used to describe the glottal phase characteristics. Steps of formulating the pseudo glottal excitation for one pitch period consist of 1) applying a polynomial model to simulate the low-frequency constituent of the residual, 2) inserting a magnitude-adjustable pulse sequence to characterize the main excitation, and 3) introducing turbulent noise in series with the resulting excitation. Procedures are described for codebook construction in addition to analysis and synthesis of the pseudo glottal excitation. Results in a mean opinion score (MOS) test show that the quality produced by the proposed coder is almost as good as that by 4.8 kbps CELP coder for male utterances, but the quality for female utterances is yet somewhat inferior.
ER -