<?xml version="1.0" encoding="utf-8"?>
<journal>
<title>Signal and Data Processing</title>
<title_fa>پردازش علائم و داده‌ها</title_fa>
<short_title>JSDP</short_title>
<subject>Engineering &amp; Technology</subject>
<web_url>http://jsdp.rcisp.ac.ir</web_url>
<journal_hbi_system_id>1</journal_hbi_system_id>
<journal_hbi_system_user>admin</journal_hbi_system_user>
<journal_id_issn>2538-4201</journal_id_issn>
<journal_id_issn_online>2538-421X</journal_id_issn_online>
<journal_id_pii></journal_id_pii>
<journal_id_doi>10.61882/jsdp</journal_id_doi>
<journal_id_iranmedex></journal_id_iranmedex>
<journal_id_magiran></journal_id_magiran>
<journal_id_sid>1</journal_id_sid>
<journal_id_nlai>8888</journal_id_nlai>
<journal_id_science></journal_id_science>
<language>fa</language>
<pubdate>
	<type>jalali</type>
	<year>1400</year>
	<month>2</month>
	<day>1</day>
</pubdate>
<pubdate>
	<type>gregorian</type>
	<year>2021</year>
	<month>5</month>
	<day>1</day>
</pubdate>
<volume>18</volume>
<number>1</number>
<publish_type>online</publish_type>
<publish_edition>1</publish_edition>
<article_type>fulltext</article_type>
<articleset>
	<article>


	<language>fa</language>
	<article_id_doi></article_id_doi>
	<title_fa>ارائه مدلی برای تشخیص شایعات فارسی مبتنی بر تحلیل ویژگی‌های محتوایی در متن شبکه‌های اجتماعی</title_fa>
	<title>A Model for Detecting of Persian Rumors based on the Analysis of Contextual Features in the Content of Social Networks</title>
	<subject_fa>مقالات پردازش متن </subject_fa>
	<subject>Paper</subject>
	<content_type_fa>پژوهشي</content_type_fa>
	<content_type>Research</content_type>
	<abstract_fa>&lt;div style=&quot;text-align: justify;&quot;&gt;&lt;strong&gt;&lt;span style=&quot;font-family:B Nazanin;&quot;&gt;&lt;span style=&quot;font-size:10.0pt;&quot;&gt;شایعه یک تلاش جمعی است که در آن از قدرت واژگان برای تفسیر یک موقعیت مبهم&amp;rlm; ولی جذاب استفاده می&amp;shy;شود؛ بنابراین، شناسایی زبان شایعه می&amp;shy;تواند در تشخیص شایعات کمک&amp;shy;کننده باشد. پژوهش&#8204;های پیشین&amp;nbsp; برای حل مسأله تشخیص شایعه بیشتر بر روی اطلاعات متنی موجود در ریتوییت و توییت پاسخ کاربران و کمتر بر روی متن اصلی شایعه متمرکز شده&amp;shy;اند. اغلب این پژوهش&#8204;ها بر روی زبان انگلیسی بوده و کارهای محدودی در زبان فارسی انجام شده است؛ از این&amp;shy;رو، این مقاله تنها با تمرکز برروی متن اصلی شایعات فارسی و معرفی ویژگی&amp;shy;هایی با ارزش اطلاعات محتوایی بالا، مدلی مبتنی بر ویژگی&amp;shy;های محتوایی فیزیکی و غیرفیزیکی برای تشخیص شایعات فارسی منتشر&#8204;شده برروی توییتر و تلگرام ارائه می&#8204;کند. مدل پیشنهادی شایعات فارسی مجموعه&#8204;داده توییتر را با معیار-&lt;/span&gt;&lt;/span&gt;&lt;/strong&gt;&lt;strong&gt;&lt;span dir=&quot;LTR&quot;&gt;&lt;span style=&quot;font-size:8.0pt;&quot;&gt;F&lt;/span&gt;&lt;/span&gt;&lt;/strong&gt;&lt;strong&gt;&lt;span style=&quot;font-family:B Nazanin;&quot;&gt;&lt;span style=&quot;font-size:10.0pt;&quot;&gt;&amp;nbsp; 848/0، شایعات مجموعه&#8204;داده زلزله کرمانشاه را با معیار-&lt;/span&gt;&lt;/span&gt;&lt;/strong&gt;&lt;strong&gt;&lt;span dir=&quot;LTR&quot;&gt;&lt;span style=&quot;font-size:8.0pt;&quot;&gt;F&lt;/span&gt;&lt;/span&gt;&lt;/strong&gt;&lt;strong&gt;&lt;span style=&quot;font-family:B Nazanin;&quot;&gt;&lt;span style=&quot;font-size:10.0pt;&quot;&gt; 952/0&lt;/span&gt;&lt;/span&gt;&lt;/strong&gt; &lt;strong&gt;&lt;span style=&quot;font-family:B Nazanin;&quot;&gt;&lt;span style=&quot;font-size:10.0pt;&quot;&gt;و شایعات تلگرامی را با معیار-&lt;/span&gt;&lt;/span&gt;&lt;/strong&gt;&lt;strong&gt;&lt;span dir=&quot;LTR&quot;&gt;&lt;span style=&quot;font-size:8.0pt;&quot;&gt;F&lt;/span&gt;&lt;/span&gt;&lt;/strong&gt;&lt;strong&gt;&lt;span style=&quot;font-family:B Nazanin;&quot;&gt;&lt;span style=&quot;font-size:10.0pt;&quot;&gt; 867/0 شناسایی کرده است؛ که نشان&#8204;دهنده توانمندی مدل پیشنهادی برای شناسایی شایعات تنها با تمرکز بر ویژگی&amp;shy;های محتوایی متن شایعه منبع است. &lt;/span&gt;&lt;/span&gt;&lt;/strong&gt;&lt;strong&gt;&lt;span style=&quot;font-family:B Nazanin;&quot;&gt;&lt;span style=&quot;font-size:11.0pt;&quot;&gt;&lt;/span&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/div&gt;</abstract_fa>
	<abstract>&lt;p style=&quot;margin: 0px; text-align: justify; unicode-bidi: embed; direction: ltr;&quot;&gt;&lt;strong&gt;The rumor is a collective attempt to interpret a vague but attractive situation by using the power of words. Therefore, identifying the rumor language can be helpful in identifying it. The previous research has focused more on the contextual information to reply tweets and less on the content features of the original rumor to address the rumor detection problem. Most of the studies have been in the English language, but more limited work has been done in the Persian language to detect rumors. This study analyzed the content of the original rumor and introduced informative content features to early identify Persian rumors (i.e., when it is published on news media but has not yet spread on social media) on Twitter and Telegram. Therefore, the proposed model is based on physical and non-physical content features in three categories including, lexical, syntactic, and pragmatic. These features are a combination of the common content features along with the proposed new content-based features. Since no social context information is available at the time of posting rumors, the proposed model is independent of propagation-based features and relies on the content-based information of the original rumor. Although in the proposed model, much information (including user information, the user&amp;#39;s reaction to the rumor, and propagation structures) are ignored, but helpful content information can be obtained for classification by content analysis of the original rumor.&lt;/strong&gt;&lt;br&gt;
&lt;strong&gt;Several experiments have been performed on the various combinations of feature sets (i.e., common and proposed content features) to explore the capability of features in distinguishing rumors and non-rumors separately and jointly. To this end, three machine learning algorithms including, Random Forest (RF), AdaBoost, and Support Vector Machine (SVM) have been used as strong classifications to evaluate the accuracy of the proposed model. To achieve the best performance of classification algorithms on the training dataset, it is necessary to use feature selection techniques. In this study, the Sequential Forward Floating Search (SFFS) approach has been used to select valuable features. Also, the statistical results of the t-test on the P-value (&lt;=0.05) demonstrate that most of the new features proposed in this study reveal statistically significant differences between rumor and non-rumor documents. The experimental results are shown the performance of new proposed features to improve the accuracy of the rumor detection. The F-measure of the proposed model to detect Persian rumors on the Twitter dataset was 0.848, on the Kermanshah earthquake dataset was 0.952 and on the Telegram dataset was 0.867, which indicated the ability of the proposed method to identify rumors only by focusing on the content features of the original rumor text. The results of evaluating the proposed model on Twitter rumors show that, despite the short length of Twitter tweets and the extraction of limited content information from tweets, the proposed model can detect Twitter rumors with acceptable accuracy. Hence, the ability of content features to distinguish rumors from non-rumors is proven.&lt;/strong&gt;&lt;/p&gt;</abstract>
	<keyword_fa>تشخیص شایعات فارسی, تحلیل محتوی, ویژگی‌های محتوایی فیزیکی و غیرفیزیکی, پردازش متن</keyword_fa>
	<keyword>Persian rumors detection, Content analysis, Physical and non-physical content features, Text processing</keyword>
	<start_page>50</start_page>
	<end_page>29</end_page>
	<web_url>http://jsdp.rcisp.ac.ir/browse.php?a_code=A-10-1862-1&amp;slc_lang=fa&amp;sid=1</web_url>


<author_list>
	<author>
	<first_name>Zoleikha</first_name>
	<middle_name></middle_name>
	<last_name>Jahanbakhsh-Nagadeh</last_name>
	<suffix></suffix>
	<first_name_fa>زلیخا</first_name_fa>
	<middle_name_fa></middle_name_fa>
	<last_name_fa>جهانبخش نقده</last_name_fa>
	<suffix_fa></suffix_fa>
	<email>zoleikha.jahanbakhsh@srbiau.ac.ir</email>
	<code>10031947532846009881</code>
	<orcid>10031947532846009881</orcid>
	<coreauthor>No</coreauthor>
	<affiliation>Department of Computer Engineering, Science and Research Branch, Islamic Azad University, Tehran, Iran.</affiliation>
	<affiliation_fa>دانشگاه آزاد اسلامی واحد علوم و تحقیقات تهران</affiliation_fa>
	 </author>


	<author>
	<first_name>Mohammad-Reza</first_name>
	<middle_name></middle_name>
	<last_name>Feizi-Derakhshi</last_name>
	<suffix></suffix>
	<first_name_fa>محمد رضا</first_name_fa>
	<middle_name_fa></middle_name_fa>
	<last_name_fa>فیضی درخشی</last_name_fa>
	<suffix_fa></suffix_fa>
	<email>mfeizi@tabrizu.ac.ir</email>
	<code>10031947532846009882</code>
	<orcid>10031947532846009882</orcid>
	<coreauthor>Yes
</coreauthor>
	<affiliation>Department of Computer Engineering University of Tabriz, Tabriz, Iran.</affiliation>
	<affiliation_fa>دانشگاه تبریز</affiliation_fa>
	 </author>


	<author>
	<first_name>Arash</first_name>
	<middle_name></middle_name>
	<last_name>Sharifi</last_name>
	<suffix></suffix>
	<first_name_fa>آرش</first_name_fa>
	<middle_name_fa></middle_name_fa>
	<last_name_fa>شریفی</last_name_fa>
	<suffix_fa></suffix_fa>
	<email>a.sharifi@srbiau.ac.ir</email>
	<code>10031947532846009883</code>
	<orcid>10031947532846009883</orcid>
	<coreauthor>No</coreauthor>
	<affiliation>Department of Computer Engineering, Science and Research Branch, Islamic Azad University, Tehran, Iran.</affiliation>
	<affiliation_fa>دانشگاه آزاد اسلامی واحد علوم و تحقیقات تهران</affiliation_fa>
	 </author>


</author_list>


	</article>
</articleset>
</journal>
