<?xml version="1.0" encoding="utf-8"?>
<journal>
<title>Signal and Data Processing</title>
<title_fa>پردازش علائم و داده‌ها</title_fa>
<short_title>JSDP</short_title>
<subject>Engineering &amp; Technology</subject>
<web_url>http://jsdp.rcisp.ac.ir</web_url>
<journal_hbi_system_id>1</journal_hbi_system_id>
<journal_hbi_system_user>admin</journal_hbi_system_user>
<journal_id_issn>2538-4201</journal_id_issn>
<journal_id_issn_online>2538-421X</journal_id_issn_online>
<journal_id_pii></journal_id_pii>
<journal_id_doi>10.61882/jsdp</journal_id_doi>
<journal_id_iranmedex></journal_id_iranmedex>
<journal_id_magiran></journal_id_magiran>
<journal_id_sid>1</journal_id_sid>
<journal_id_nlai>8888</journal_id_nlai>
<journal_id_science></journal_id_science>
<language>fa</language>
<pubdate>
	<type>jalali</type>
	<year>1398</year>
	<month>9</month>
	<day>1</day>
</pubdate>
<pubdate>
	<type>gregorian</type>
	<year>2019</year>
	<month>12</month>
	<day>1</day>
</pubdate>
<volume>16</volume>
<number>3</number>
<publish_type>online</publish_type>
<publish_edition>1</publish_edition>
<article_type>fulltext</article_type>
<articleset>
	<article>


	<language>fa</language>
	<article_id_doi></article_id_doi>
	<title_fa>بررسی روش‌های مؤثر بر عملکرد تجزیه‌گر دستور مستقل از متن آماری زبان فارسی</title_fa>
	<title>Studying impressive parameters on the performance of Persian probabilistic context free grammar parser</title>
	<subject_fa>مقالات پردازش متن </subject_fa>
	<subject>Paper</subject>
	<content_type_fa>پژوهشي</content_type_fa>
	<content_type>Research</content_type>
	<abstract_fa>&lt;p style=&quot;text-align: justify;&quot;&gt;&lt;strong background-color:=&quot;&quot; font-size:=&quot;&quot; sans=&quot;&quot; source=&quot;&quot; style=&quot;color: rgb(51, 51, 51); font-family: Tahoma, Arial, Verdana, &quot; text-align:=&quot;&quot;&gt;&lt;span style=&quot;color: black;&quot;&gt;&lt;span b=&quot;&quot; style=&quot;font-family: &quot;&gt;عدم دقّت در طراحی دستورهای مستقل از متن و استفاده از ساختارهای نامناسب مانند فرم نرمال چامسکی به خودی خود می&amp;shy;&#8204;تواند عملکرد تجزیه&#8204;&amp;zwj;&amp;shy;گرهای آماری مستقل از متن را تضعیف کند. در این پژوهش ساختار ترکیبات عطفی درخت&#8204;بانک فارسی را مورد بررسی قرار دادیم. نتایج حاصل از این پژوهش نشان می&#8204;&amp;shy;دهد که با اضافه&#8204;کردن وابستگی&amp;shy;&#8204;های ساختاری به دستورهای مستقل از متن و اصلاح قواعد اولیه، می&#8204;&#8204;توان از ترکیبات عطفی رفع ابهام کرد و صحت عملکرد تجزیه&amp;shy;&#8204;گر دستور مستقل از متن آماری را افزایش داد. فرض استقلال ضعیف، یکی از مشکلات مربوط به دستورهای مستقل از متن است که سعی شده است تا با تزریق وابستگی&amp;shy;&#8204;های ساختاری از طریق نشانه&amp;shy;&#8204;گذاری گره&#8204;&amp;shy;های والد و فرزند مرتفع شود. تأثیر ریزدانگی و درشت&amp;shy;دانگی برچسب&#8204;&amp;shy;های اجزای واژگانی کلام و همین&#8204;طور ادغام ناپایانه&amp;shy;&#8204;ها بر تجزیه&#8204;&amp;shy;گر دستور مستقل از متن آماری فارسی از جمله مواردِ مورد بررسی قرار گرفته&#8204;شده در این پژوهش&amp;shy; است.&lt;/span&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;</abstract_fa>
	<abstract>&lt;p style=&quot;text-align: justify;&quot;&gt;&lt;strong&gt;In linguistics, a tree bank is a parsed text corpus that annotates syntactic or semantic sentence structure. The exploitation of tree bank data has been important ever since the first large-scale tree bank, The Penn Treebank, was published. However, although originating in computational linguistics, the value of tree bank is becoming more widely appreciated in linguistics research as a whole. For example, annotated tree bank data has been crucial in syntactic research to test linguistic theories of sentence structure against large quantities of naturally occurring examples&lt;span dir=&quot;RTL&quot;&gt;.&lt;/span&gt;&lt;br&gt;
The natural language parser consists of two basic parts, POS tagger and the syntax parser. A Part-Of-Speech Tagger (POS Tagger) is a piece of software that reads text in some languages and assigns parts of speech to each word (and other token), such as noun, verb, adjective, etc., although generally computational applications use more fine-grained POS tags like &amp;#39;noun-plural&amp;#39;. A natural language parser is a program that works out the grammatical structure of sentences, for instance, which groups of words go together (as &amp;quot;phrases&amp;quot;) and which words are the subject or object of a verb&lt;span dir=&quot;RTL&quot;&gt;.&lt;/span&gt;&lt;br&gt;
Probabilistic parsers use knowledge of language gained from hand-parsed sentences to try to produce the most likely analysis of new sentences. These statistical parsers still make some mistakes, but commonly work rather well. Inaccurate design of context-free grammars and using bad structures such as Chomsky normal form can reduce accuracy of probabilistic context-free grammar parser&lt;span dir=&quot;RTL&quot;&gt;. &lt;/span&gt;&lt;br&gt;
Weak independence assumption is one of the problems related to CFG. We have tried to improve this problem with parent and child annotation, which copies the label of a parent node onto the labels of its children, and it can improve the performance of a PCFG&lt;span dir=&quot;RTL&quot;&gt;.&lt;/span&gt;&lt;br&gt;
In grammar, a conjunction (conj) is a part of speech that connects words, phrases, or clauses that are called the conjuncts of the conjunctions. In this study, we examined the conjunction phrases in the Persian tree bank. The results of this study show that adding structural dependencies to grammars and modifying the basic rules can remove conjunction ambiguity and increase accuracy of probabilistic context-free grammar parser&lt;span dir=&quot;RTL&quot;&gt;.&lt;/span&gt;&lt;br&gt;
When a part-of-speech (PoS) tagger assigns word class labels to tokens, it has to select from a set of possible labels whose size usually ranges from fifty to several hundred labels depending on the language. In this study, we have investigated the effect of fine and coarse grain POS tags and merging non-terminals on Persian PCFG parser.&lt;/strong&gt;&lt;/p&gt;</abstract>
	<keyword_fa>دستور مستقل از متن آماری, تجزیه‌گر, ترکیبات عطفی, نشانه‌گذاری قواعد, برچسب اجزای واژگانی کلام</keyword_fa>
	<keyword>Probabilistic context free grammar, parser, tree bank, conjunction phrases, parent annotation, child annotation, part of speech tags</keyword>
	<start_page>36</start_page>
	<end_page>23</end_page>
	<web_url>http://jsdp.rcisp.ac.ir/browse.php?a_code=A-10-794-1&amp;slc_lang=fa&amp;sid=1</web_url>


<author_list>
	<author>
	<first_name>mohammadbagher</first_name>
	<middle_name></middle_name>
	<last_name>sadeghzadeh</last_name>
	<suffix></suffix>
	<first_name_fa>محمدباقر</first_name_fa>
	<middle_name_fa></middle_name_fa>
	<last_name_fa>صادق زاده</last_name_fa>
	<suffix_fa></suffix_fa>
	<email>mohammadbaghersadeghzadeh@gmail.com</email>
	<code>10031947532846008234</code>
	<orcid>10031947532846008234</orcid>
	<coreauthor>Yes
</coreauthor>
	<affiliation>amirkabir university of technology</affiliation>
	<affiliation_fa>دانشگاه صنعتی امیرکبیر</affiliation_fa>
	 </author>


	<author>
	<first_name>mohammadreza</first_name>
	<middle_name></middle_name>
	<last_name>razzazi</last_name>
	<suffix></suffix>
	<first_name_fa>محمدرضا</first_name_fa>
	<middle_name_fa></middle_name_fa>
	<last_name_fa>رزازی</last_name_fa>
	<suffix_fa></suffix_fa>
	<email>mohammadbaghersadeghzadeh@gmail.com</email>
	<code>10031947532846008235</code>
	<orcid>10031947532846008235</orcid>
	<coreauthor>No</coreauthor>
	<affiliation>amirkabir university of technology</affiliation>
	<affiliation_fa>دانشگاه صنعتی امیرکبیر</affiliation_fa>
	 </author>


	<author>
	<first_name>Masood</first_name>
	<middle_name></middle_name>
	<last_name>ghayoomi</last_name>
	<suffix></suffix>
	<first_name_fa>مسعود</first_name_fa>
	<middle_name_fa></middle_name_fa>
	<last_name_fa>قیومی</last_name_fa>
	<suffix_fa></suffix_fa>
	<email>mohammadbaghersadeghzadeh@gmail.com</email>
	<code>10031947532846008236</code>
	<orcid>10031947532846008236</orcid>
	<coreauthor>No</coreauthor>
	<affiliation>free university of berlin</affiliation>
	<affiliation_fa>پژوهشگاه علوم انسانی و مطالعات فرهنگی</affiliation_fa>
	 </author>


</author_list>


	</article>
</articleset>
</journal>
