Feature selection approach for twitter sentiment analysis and text classification based on chi-square and naive bayes

S. Paudel, P. W. C. Prasad, Abeer Alsadoon, MD. Rafiqul Islam, Amr Elchouemi

Research output: Chapter in Book / Conference PaperConference Paperpeer-review

5 Citations (Scopus)

Abstract

![CDATA[With the rapid growth of web and mobile technology, Social networking services like Twitter are widely used, resulting in large amounts of data being generated daily in social networking sites. Efficient Sentiment analysis of such data is very important for a range of applications and improvement of accuracy in detecting sentiment is the main aim of this research. This report examines the combination of a Chi-Squared feature selection algorithm, k-mean clustering and TF-IDF for attribute weighting based on Naive Bayes, for classification of text and sentiment in communications generated on Twitter. This approach is compared with other approaches based on Naive Bayes to give an account of their relative strengths and weaknesses. When running experiments on multi-domain twitter datasets, results indicate that the proposed method shows superior performance across a range of. The main aim of this research is to enhance the performance of the Naive Bayes classifier using a feature selection technique.]]
Original languageEnglish
Title of host publicationInternational Conference on Applications and Techniques in Cyber Security and Intelligence ATCI 2018: Applications and Techniques in Cyber Security and Intelligence
PublisherSpringer
Pages281-298
Number of pages18
ISBN (Print)9783319987750
DOIs
Publication statusPublished - 2019
EventInternational Conference on Applications and Techniques in Cyber Intelligence -
Duration: 22 Jun 2018 → …

Publication series

Name
ISSN (Print)2194-5357

Conference

ConferenceInternational Conference on Applications and Techniques in Cyber Intelligence
Period22/06/18 → …

Fingerprint

Dive into the research topics of 'Feature selection approach for twitter sentiment analysis and text classification based on chi-square and naive bayes'. Together they form a unique fingerprint.

Cite this