The original paper is in English. Non-English content has been machine-translated and may contain typographical errors or mistranslations. ex. Some numerals are expressed as "XNUMX".
Copyrights notice
The original paper is in English. Non-English content has been machine-translated and may contain typographical errors or mistranslations. Copyrights notice
Kertas kerja ini mencadangkan kaedah percampuran pertuturan kerumitan rendah dengan codec pertuturan berdasarkan pengekodan ramalan untuk persidangan multimedia. Kaedah yang dicadangkan menggunakan teknik pengurusan keadaan penapis (FSM) pada kaedah pencampuran separa untuk mengelakkan ketidakkonsistenan keadaan penapis pengekod. Ketidakkonsistenan dibuat dengan menukar pengekod apabila pembesar suara yang akan dicampur ditukar. Keputusan penilaian subjektif kualiti pertuturan menunjukkan bahawa kaedah yang dicadangkan mengelakkan ketidakkonsistenan, dan mencapai kualiti pertuturan yang jauh lebih tinggi daripada kaedah percampuran separa konvensional tanpa FSM dan kualiti pertuturan yang hampir sama dengan kaedah percampuran penuh. Keputusan penilaian kerumitan menunjukkan bahawa kaedah yang dicadangkan mencapai kerumitan yang jauh lebih rendah daripada kaedah pencampuran penuh.
The copyright of the original papers published on this site belongs to IEICE. Unauthorized use of the original or translated papers is prohibited. See IEICE Provisions on Copyright for details.
Salinan
Hironori ITO, Kazunori OZAWA, "Low Complexity Speech Mixing with Speech Codecs Based on Predictive Coding for Multimedia Conferences" in IEICE TRANSACTIONS on Communications,
vol. E92-B, no. 7, pp. 2477-2483, July 2009, doi: 10.1587/transcom.E92.B.2477.
Abstract: This paper proposes a method of low complexity speech mixing with speech codecs based on predictive coding for multimedia conferences. The proposed method applies a filter state management (FSM) technique to a partial mixing method in order to avoid inconsistency of the filter states of encoders. The inconsistency is created by switching of the encoders when the speakers to be mixed are switched. The results of subjective evaluations of speech quality show that the proposed method avoids the inconsistency, and achieves significantly higher speech quality than the conventional partial mixing method without the FSM and almost the same speech quality as the full mixing method. The complexity evaluation results show that the proposed method achieves much lower complexity than the full mixing method.
URL: https://global.ieice.org/en_transactions/communications/10.1587/transcom.E92.B.2477/_p
Salinan
@ARTICLE{e92-b_7_2477,
author={Hironori ITO, Kazunori OZAWA, },
journal={IEICE TRANSACTIONS on Communications},
title={Low Complexity Speech Mixing with Speech Codecs Based on Predictive Coding for Multimedia Conferences},
year={2009},
volume={E92-B},
number={7},
pages={2477-2483},
abstract={This paper proposes a method of low complexity speech mixing with speech codecs based on predictive coding for multimedia conferences. The proposed method applies a filter state management (FSM) technique to a partial mixing method in order to avoid inconsistency of the filter states of encoders. The inconsistency is created by switching of the encoders when the speakers to be mixed are switched. The results of subjective evaluations of speech quality show that the proposed method avoids the inconsistency, and achieves significantly higher speech quality than the conventional partial mixing method without the FSM and almost the same speech quality as the full mixing method. The complexity evaluation results show that the proposed method achieves much lower complexity than the full mixing method.},
keywords={},
doi={10.1587/transcom.E92.B.2477},
ISSN={1745-1345},
month={July},}
Salinan
TY - JOUR
TI - Low Complexity Speech Mixing with Speech Codecs Based on Predictive Coding for Multimedia Conferences
T2 - IEICE TRANSACTIONS on Communications
SP - 2477
EP - 2483
AU - Hironori ITO
AU - Kazunori OZAWA
PY - 2009
DO - 10.1587/transcom.E92.B.2477
JO - IEICE TRANSACTIONS on Communications
SN - 1745-1345
VL - E92-B
IS - 7
JA - IEICE TRANSACTIONS on Communications
Y1 - July 2009
AB - This paper proposes a method of low complexity speech mixing with speech codecs based on predictive coding for multimedia conferences. The proposed method applies a filter state management (FSM) technique to a partial mixing method in order to avoid inconsistency of the filter states of encoders. The inconsistency is created by switching of the encoders when the speakers to be mixed are switched. The results of subjective evaluations of speech quality show that the proposed method avoids the inconsistency, and achieves significantly higher speech quality than the conventional partial mixing method without the FSM and almost the same speech quality as the full mixing method. The complexity evaluation results show that the proposed method achieves much lower complexity than the full mixing method.
ER -