การพัฒนาระบบตรวจจับข่าวเท็จบน Twitter

นภสินธุ์ ตันธุวนิตย์; นริศ ทากุ; บรรพต ดาราวรรณกุล

Please use this identifier to cite or link to this item: https://ir.swu.ac.th/jspui/handle/123456789/10055

Title:	การพัฒนาระบบตรวจจับข่าวเท็จบน Twitter
Other Titles:	A Development of System for DetectingFake News on Twitter
Advisor :	วีรยุทธ เจริญเรืองกิจ
Authors:	นภสินธุ์ ตันธุวนิตย์ นริศ ทากุ บรรพต ดาราวรรณกุล
Keywords:	Detecting fake news ระบบตรวจจับข่าวเท็จ ทวิตเตอร์ Twitter
Issue Date:	2562
Publisher:	ภาควิชาวิทยาการคอมพิวเตอร์ มหาวิทยาลัยศรีนครินทรวิโรฒ
Abstract:	งานวิจัยนี้มีวัตถุประสงค์เพื่อพัฒนาระบบสำหรับการตรวจจับข่าวเท็จบน Twitter โดยใช้เกี่ยวกับการวิเคราะห์ข้อความโดยข้อมูลด้านอารมณ์ความรู้สึกจากข้อความ มาใช้ร่วมกับการวัดความน่าเชื่อถือของโดเมน ซึ่งการดำเนินงานวิจัยนี้ จะใช้กระบวนการพัฒนาทั้งหมด 4 งาน โดยใช้ Long Short-Term Memory เป็นโมเดลในการ Classify ซึ่งงานที่ 1 คือ สร้างโมเดลสำหรับระบบตรวจจับข่าวลือ ซึ่งใช้หลักการ Natural Language Processing ในการสร้างฟีเจอร์ โดยใช้ Word2Vec,Sentiment Analysis,TF-IDF และ Part Of Speech ซึ่งมีค่าความแม่นยำของการตรวจจับสูงสุด คือ 86.18 % ในส่วนของงานที่ 2 คือ การเก็บรวบรวมข้อมูลทั้ง Tweet และ Reply จาก Twitter เพื่อนำไปเป็นข้อมูลที่ได้ไปวิเคราะห์ในงานถัดไป ในส่วนของงานที่ 3 คือ สร้างโมเดลสำหรับแยกแยะความคิดเห็นของ tweet และ reply ของ tweet นั้น ว่ามีความคิดเห็นไปในทิศทางใด เช่น สนับสนุน, ปฏิเสธ, ตั้งคำถาม หรือเป็นแค่เพียงการแสดงความคิดเห็นเท่านั้น เพื่อจะนำมาใช้เป็นฟีเจอร์ในการตรวจจับข่าวเท็จต่อไป ในส่วนของงานที่ 4 คือ สร้างโมเดลสำหรับตรวจจับข่าวเท็จ ซึ่งเราจะนำฟีเจอร์ที่ได้จากงานที่ 3 พร้อมด้วยฟีเจอร์อื่น ๆ มาใช้ โดยกลุ่มวิจัยจะนำเอาโมเดลที่ได้จากงานที่ 1, งานที่ 3 และ งานที่ 4 มาผสมกัน โดยใช้การเรียนรู้ที่เรียกว่า Multitask-Learning This research aims to develop fake news detection for Twitter using text analysis, sentiment analysis, and domain credibility by developing 4 modules. All modules implement Long Short-Term Memory (LSTM) to train data into a classification model. The first module is to create the LSTM model for rumour detection by using Natural Language Processing (NLP) technique to create features implementing Word2Vec, Sentiment Analysis, Part of Speech, and Term Frequency Inverse Document Frequency (TD-IDF). The results from the LSTM model achieve 86.18% accuracy. The second module is gathering data for tracking the tweets and replies from twitter in order to analyze them for the next module. The third module creates a LSTM model to classify the opinion direction on twitter such as supporting, denying or questioning, which is used as a feature in the fake news detection model. The fourth module is to create a LSTM model to detect fake news. Using features from the 3rd module with the feature from the source tweet. The classification results are obtained from the combination of all models of the 1st module, 3rd module and 4th module using Multitask-Learning.
URI:	https://ir.swu.ac.th/jspui/handle/123456789/10055
Appears in Collections:	ComSci-Senior Projects

Files in This Item:

File	Description	Size	Format
default.html	Report (SWU Only)	344 B	HTML	View/Open

Show full item record