The original paper is in English. Non-English content has been machine-translated and may contain typographical errors or mistranslations. ex. Some numerals are expressed as "XNUMX".
Copyrights notice
The original paper is in English. Non-English content has been machine-translated and may contain typographical errors or mistranslations. Copyrights notice
Pemprosesan kapsyen (pawagam) yang menambah teks deskriptif pada jujukan bingkai ialah fungsi manipulasi video penting yang harus disokong oleh editor video. Makalah ini mencadangkan pendekatan domain mampat MC-DCT yang cekap untuk memasukkan kapsyen ke dalam aliran video termampat MPEG. Ia pada asasnya menambah blok DCT imej kapsyen kepada blok DCT yang sepadan bagi bingkai input satu demi satu dalam domain MC-DCT seperti dalam [6]. Walau bagaimanapun, kekuatan imej kapsyen dilaraskan dalam domain DCT untuk mengelakkan pekali DCT yang terhasil daripada melebihi nilai maksimum yang dibenarkan dalam MPEG. Untuk melaraskan kekuatan imej kapsyen secara adaptif, kita perlu mengetahui nilai piksel yang tepat bagi imej input. Ini adalah tugas yang sukar dalam domain DCT. Kami mencadangkan skema anggaran untuk nilai piksel yang mana nilai DC blok digunakan sebagai nilai piksel yang dijangkakan untuk semua piksel dalam blok itu. Walaupun anggaran ini mungkin membawa kepada beberapa ralat dalam kawasan kapsyen, ia masih memberikan kualiti imej yang agak tinggi dalam kawasan bukan kapsyen, manakala masa pemprosesan adalah kira-kira 4.9 kali lebih pantas daripada kaedah nyahkod-kapsyen-pengekodan semula.
The copyright of the original papers published on this site belongs to IEICE. Unauthorized use of the original or translated papers is prohibited. See IEICE Provisions on Copyright for details.
Salinan
Jongho NANG, Seungwook HONG, Ohyeong KWON, "An Efficient Caption Insertion Scheme for MPEG Video in MC-DCT Compressed Domain" in IEICE TRANSACTIONS on Communications,
vol. E84-B, no. 8, pp. 2292-2300, August 2001, doi: .
Abstract: The (cinema) caption processing that adds descriptive text on a sequence of frames is an important video manipulation function that a video editor should support. This paper proposes an efficient MC-DCT compressed domain approach to insert the caption into the MPEG compressed video stream. It basically adds the DCT blocks of the caption image to the corresponding DCT blocks of the input frames one by one in the MC-DCT domain as in [6]. However, the strength of the caption image is adjusted in the DCT domain to prevent the resulting DCT coefficients from exceeding the maximum value allowed in MPEG. In order to adjust the strength of the caption image adaptively we need to know the exact pixel value of the input image. This is a difficult task in DCT domain. We propose an approximation scheme for the pixel values in which the DC value of a block is used as the expected pixel value for all pixels in that block. Although this approximation may lead to some errors in the caption area, it still provides a relatively high image quality in the non-caption area, whereas the processing time is about 4.9 times faster than the decode-captioning-reencode method.
URL: https://global.ieice.org/en_transactions/communications/10.1587/e84-b_8_2292/_p
Salinan
@ARTICLE{e84-b_8_2292,
author={Jongho NANG, Seungwook HONG, Ohyeong KWON, },
journal={IEICE TRANSACTIONS on Communications},
title={An Efficient Caption Insertion Scheme for MPEG Video in MC-DCT Compressed Domain},
year={2001},
volume={E84-B},
number={8},
pages={2292-2300},
abstract={The (cinema) caption processing that adds descriptive text on a sequence of frames is an important video manipulation function that a video editor should support. This paper proposes an efficient MC-DCT compressed domain approach to insert the caption into the MPEG compressed video stream. It basically adds the DCT blocks of the caption image to the corresponding DCT blocks of the input frames one by one in the MC-DCT domain as in [6]. However, the strength of the caption image is adjusted in the DCT domain to prevent the resulting DCT coefficients from exceeding the maximum value allowed in MPEG. In order to adjust the strength of the caption image adaptively we need to know the exact pixel value of the input image. This is a difficult task in DCT domain. We propose an approximation scheme for the pixel values in which the DC value of a block is used as the expected pixel value for all pixels in that block. Although this approximation may lead to some errors in the caption area, it still provides a relatively high image quality in the non-caption area, whereas the processing time is about 4.9 times faster than the decode-captioning-reencode method.},
keywords={},
doi={},
ISSN={},
month={August},}
Salinan
TY - JOUR
TI - An Efficient Caption Insertion Scheme for MPEG Video in MC-DCT Compressed Domain
T2 - IEICE TRANSACTIONS on Communications
SP - 2292
EP - 2300
AU - Jongho NANG
AU - Seungwook HONG
AU - Ohyeong KWON
PY - 2001
DO -
JO - IEICE TRANSACTIONS on Communications
SN -
VL - E84-B
IS - 8
JA - IEICE TRANSACTIONS on Communications
Y1 - August 2001
AB - The (cinema) caption processing that adds descriptive text on a sequence of frames is an important video manipulation function that a video editor should support. This paper proposes an efficient MC-DCT compressed domain approach to insert the caption into the MPEG compressed video stream. It basically adds the DCT blocks of the caption image to the corresponding DCT blocks of the input frames one by one in the MC-DCT domain as in [6]. However, the strength of the caption image is adjusted in the DCT domain to prevent the resulting DCT coefficients from exceeding the maximum value allowed in MPEG. In order to adjust the strength of the caption image adaptively we need to know the exact pixel value of the input image. This is a difficult task in DCT domain. We propose an approximation scheme for the pixel values in which the DC value of a block is used as the expected pixel value for all pixels in that block. Although this approximation may lead to some errors in the caption area, it still provides a relatively high image quality in the non-caption area, whereas the processing time is about 4.9 times faster than the decode-captioning-reencode method.
ER -