Signal and Data Processing
پردازش علائم و دادهها
JSDP
Engineering & Technology
http://jsdp.rcisp.ac.ir
1
admin
2538-4201
2538-421X
10.52547/jsdp
1
8888
fa
jalali
1398
12
1
gregorian
2020
3
1
16
4
online
1
fulltext
fa
بازشناسی مرز واژگان در گفتار پیوسته فارسی از روی منحنی زیروبمی
Word segmentation in Persian continuous speech using F0 contour
مقالات پردازش گفتار
Paper
پژوهشي
Research
<div style="text-align: justify;"><strong><span style="font-family:b nazanin;"><span style="font-size:10.0pt;">یکی از فعالیتهای شناختی پیچیده در چارچوب نظام آوایی زبان این است که اهل زبان قادرند گفتار پیوسته را بهصورت زنجیره واژگان گسسته درک کنند. یافتههای پیشین مطالعات آزمایشگاهی بر روی زبان فارسی و دیگر زبانها نشان داده است، </span></span></strong><strong><span style="color:black;"><span style="font-family:b nazanin;"><span style="font-size:10.0pt;">در زبان­هایی که در آنها تکیه بهطور ثابت (یا با فراوانی وقوع زیاد) در مرز آغازی یا پایانی واژه قرار می­گیرد، شنونده­ها از نشانه­های آکوستیکی تکیه برای تقطیع گفتار پیوسته به واژگان سازنده آن استفاده می­کنند. همچنین، اینگونه فرض شده است که حضور تکیه در جایگاهی غیر از مرز آغازی یا پایانی واژه مانع از کارکرد مرزنمایی این عامل نوایی میشود. در زبان فارسی حضور واژه­بست در واژه باعث می­شود که تکیه در جایگاهی غیر از پایان واژه واقع شود. پژوهش حاضر با هدف پاسخگویی به یک سؤال اساسی درباره نحوه پردازش درکی گفتار پیوسته فارسی انجام شد: آیا مرز پایانی واژگان (اعم از واژگان حاوی واژه­بست و واژگان فاقد واژه­بست) با توجه به ساخت نواختی واژگان در دستور واجی آهنگ فارسی برای شنونده قابل شناسایی است؟ برای این منظور </span></span></span></strong><strong><span style="color:black;"><span style="font-family:b nazanin;"><span style="font-size:10.0pt;">دو</span></span></span></strong><strong><span style="color:black;"><span style="font-family:b nazanin;"><span style="font-size:10.0pt;">آزمایش شنیداری انجام شد. نتایج این آزمایش­ها نشان داد که </span></span></span></strong><strong><span style="font-family:b nazanin;"><span style="font-size:10.0pt;">شنونده هر نقطه پایانی </span></span></strong><strong><span dir="LTR"><span style="font-family:times new roman,serif;"><span style="font-size:8.0pt;">H</span></span></span></strong><strong><span style="font-family:b nazanin;"><span style="font-size:10.0pt;"> (در یک گستره نواختی </span></span></strong><strong><span dir="LTR"><span style="font-family:times new roman,serif;"><span style="font-size:8.0pt;">H</span></span></span></strong><strong><span style="font-family:b nazanin;"><span style="font-size:10.0pt;">) در منحنی آهنگ گفتار فارسی را بهصورت مرز پایانی یک واژه شناسایی می­کند. همچنین نتایج بهدستآمده نشان داد که درک شنیداری الگوی برجستگی نوایی وابسته به محل وقوع قله </span></span></strong><strong><span dir="LTR"><span style="font-family:times new roman,serif;"><span style="font-size:8.0pt;">H</span></span></span></strong><strong><span style="font-family:b nazanin;"><span style="font-size:10.0pt;"> تکیه زیروبمی است.</span></span></strong><br>
</div>
<div style="text-align: justify;"><strong>Word segmentation in continuous speech is a complex cognitive process. Previous research on spoken word segmentation has revealed that in fixed-stress languages, listeners use acoustic cues to stress to de-segment speech into words. It has been further assumed that stress in non-final or non-initial position hinders the demarcative function of this prosodic factor. In Persian, stress is retracted to a non-final position in words containing enclitic affixes. </strong><br>
<strong>The present research explores the question as to whether Persian listeners are able to identify word boundaries given the tonal structure of words in Persian phonology or not. The paper was also intended to investigate to what extent Persian native speakers use H peaks to identify word stress pattern. Two perceptual experiments were conducted in this regard. Given the tonal structure of words in utterance non-final position in Persian, it was hypothesized that listeners are likely to identify the end of a high plateau as a cue to word boundary. In addition, given that peaks in utterance non-final position are delayed, it was further hypothesized that perceived prominent is likely to be attributed to a syllable that precedes another syllable carrying a pitch peak. </strong><br>
<strong>The basic stimulus for the first experiment was a nonsense sequence of nine “</strong><strong>dA</strong><strong>”</strong> <strong>syllables with equal duration (</strong><strong>[</strong><strong>dA</strong><strong><sub>1</sub></strong><strong>.</strong><strong>dA</strong><strong><sub>2</sub></strong><strong>.</strong><strong>dA</strong><strong><sub>3</sub></strong><strong>.</strong><strong>dA</strong><strong><sub>4</sub></strong><strong>.</strong><strong>dA</strong><strong><sub>5</sub></strong><strong>.</strong><strong>dA</strong><strong><sub>6</sub></strong><strong>.</strong><strong>dA</strong><strong><sub>7</sub></strong><strong>.</strong><strong>dA</strong><strong><sub>8</sub></strong><strong>.</strong><strong>dA</strong><strong><sub>9</sub></strong><strong>]</strong><strong>) across the syllables. The peak was located at the beginning of the consonant in </strong><strong>[</strong><strong>dA</strong><strong><sub>4</sub></strong><strong>] in the stimulus.</strong><strong> The duration of the H plateau following the H peak was varied continuously to create 6 different stimuli with varying temporal plateau. The stimuli were presented randomly to 10 native speakers of Persian. The participants were asked to chunk the sequence of identical syllables they hear into two parts as if they were two independent words. They were also asked to identify the most prominent syllable in a separate identification test. The results showed that the ending point of a high H plateau acts as a prosodic cue to word boundary detection in Persian. For example, when the end of the H plateau was located on the end of the vowel in</strong> <strong>dA</strong><strong><sub>4</sub></strong><strong>,</strong><strong> listeners identified the end of </strong><strong>dA</strong><strong><sub>4 </sub></strong><strong>as boundary between two hypothetical words. However, when the end of the plateau was located on the end of the vowel in</strong> <strong>dA</strong><strong><sub>5</sub></strong><strong> or the beginning of the consonants in </strong><strong>.</strong><strong>dA</strong><strong><sub>6 </sub></strong><strong>listeners identified the end of </strong><strong>dA</strong><strong><sub>5</sub></strong><strong> as the word final boundary. </strong><strong>The results of this experiment further revealed that listeners are sensitive to the position of H peaks to identify within-word position of prominence in Persian. Listeners consistently identified </strong><strong>dA</strong><strong><sub>3 </sub></strong><strong>as the most prominent syllable as this syllable preceded </strong><strong>dA</strong><strong><sub>4</sub></strong> <strong>on which the peak was located, and the rate of their identification was not affected by the duration of H plateau following the pitch peak.</strong><br>
<strong>In the second experiment, listeners’ ability to use F0 contour as a cue to word boundary was tested on resynthesized speech in which the spectral properties of the signals were intentionally deformed. The results replicated the findings previously obtained for the first experiment, indicating that the end of a high plateau acts as a robust cue to word boundary detection in Persian.</strong></div>
مرز واژه, آهنگ گفتار, برجستگی نوایی, گستره نواختی H, محل وقوع قله
word boundary, intonation, prosodic prominence, H plateau, position of H peaks
135
150
http://jsdp.rcisp.ac.ir/browse.php?a_code=A-10-34-1&slc_lang=fa&sid=1
Vahid
sadeghi
وحید
صادقی
vsadeghi5603@gmail.com
10031947532846008602
10031947532846008602
Yes
Imam Khomeini International university
دانشگاه بینالمللی امام خمینی