Opinion mining on newspaper headlines using SVM and NLP

Chaudhary Jashubhai Rameshbhai, Joy Paulose

Abstract


Opinion Mining also known as Sentiment Analysis, is a technique or procedure which uses Natural Language processing (NLP) to classify the outcome from text. There are various NLP tools available which are used for processing text data. Multiple research have been done in opinion mining for online blogs, Twitter, Facebook etc. This paper proposes a new opinion mining technique using Support Vector Machine (SVM) and NLP tools on newspaper headlines. Relative words are generated using Stanford CoreNLP, which is passed to SVM using count vectorizer. On comparing three models using confusion matrix, results indicate that Tf-idf and Linear SVM provides better accuracy for smaller dataset. While for larger dataset, SGD and linear SVM model outperform other models.


Keywords


newspaper; sentiment analysis; opinion mining; NLTK; stanford coreNLPm; SVM; SGDClassifier; Tf-idf; CountVectorizer;

Full Text:

PDF


DOI: http://doi.org/10.11591/ijece.v9i3.pp2152-2163
Total views : 450 times


Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

ISSN 2088-8708, e-ISSN 2722-2578