Open Access   Article Go Back

Analysis of Indian Election using Random Forest Algorithm

Kirti Chouksey1 , Amit Ranjan2

Section:Research Paper, Product Type: Journal Paper
Volume-07 , Issue-10 , Page no. 50-57, May-2019

CrossRef-DOI:   https://doi.org/10.26438/ijcse/v7si10.5057

Online published on May 05, 2019

Copyright © Kirti Chouksey, Amit Ranjan . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

View this paper at   Google Scholar | DPI Digital Library

How to Cite this Paper

  • IEEE Citation
  • MLA Citation
  • APA Citation
  • BibTex Citation
  • RIS Citation

IEEE Style Citation: Kirti Chouksey, Amit Ranjan, “Analysis of Indian Election using Random Forest Algorithm,” International Journal of Computer Sciences and Engineering, Vol.07, Issue.10, pp.50-57, 2019.

MLA Style Citation: Kirti Chouksey, Amit Ranjan "Analysis of Indian Election using Random Forest Algorithm." International Journal of Computer Sciences and Engineering 07.10 (2019): 50-57.

APA Style Citation: Kirti Chouksey, Amit Ranjan, (2019). Analysis of Indian Election using Random Forest Algorithm. International Journal of Computer Sciences and Engineering, 07(10), 50-57.

BibTex Style Citation:
@article{Chouksey_2019,
author = {Kirti Chouksey, Amit Ranjan},
title = {Analysis of Indian Election using Random Forest Algorithm},
journal = {International Journal of Computer Sciences and Engineering},
issue_date = {5 2019},
volume = {07},
Issue = {10},
month = {5},
year = {2019},
issn = {2347-2693},
pages = {50-57},
url = {https://www.ijcseonline.org/full_spl_paper_view.php?paper_id=973},
doi = {https://doi.org/10.26438/ijcse/v7i10.5057}
publisher = {IJCSE, Indore, INDIA},
}

RIS Style Citation:
TY - JOUR
DO = {https://doi.org/10.26438/ijcse/v7i10.5057}
UR - https://www.ijcseonline.org/full_spl_paper_view.php?paper_id=973
TI - Analysis of Indian Election using Random Forest Algorithm
T2 - International Journal of Computer Sciences and Engineering
AU - Kirti Chouksey, Amit Ranjan
PY - 2019
DA - 2019/05/05
PB - IJCSE, Indore, INDIA
SP - 50-57
IS - 10
VL - 07
SN - 2347-2693
ER -

           

Abstract

The proliferation of social media in the recent past has provided end users a powerful platform to voice their opinions. Businesses (or similar entities) need to identify the polarity of these opinions in order to understand user orientation and thereby make smarter decisions. One such application is in the field of politics, where political entities need to understand public opinion and thus determine their campaigning strategy. Sentiment analysis on social media data has been seen by many as an effective tool to monitor user preferences and inclination. Sentiment analysis is the computational study of opinions, sentiments, evaluations, attitudes, views and emotions expressed in text. It refers to a classification problem where the main focus is to predict the polarity of words and then classify them into positive or negative sentiment. Sentiment analysis over Twitter offers people a fast and effective way to measure the public’s feelings towards their party and politicians. The primary issues in previous sentiment analysis techniques are classification accuracy, as they incorrectly classify most of the tweets with the biasing towards the training data. We performed data (text) mining on thousands of tweets collected over a period of a month that referenced five national political parties in India, during the campaigning period for general state elections in 2018. We made use of both supervised and unsupervised approaches. We utilized Dictionary Based, Random Forest algorithm as the main algorithm to build our classifier and classified the test data as positive, negative and neutral. We identified the sentiment of Twitter users towards each of the considered Indian political parties. The result of the analysis was for the Bhartiya Janta Party. Proposed algorithm predicted a chance that the BJP would win more elections in the general election. Therefore, here we adopt a lexicon based sentiment analysis method, which will exploit the sense definitions, as semantic indicators of sentiment. Our method also uses a negation handling as a pre-processing step in order to achieve high accuracy.

Key-Words / Index Term

Negation Handling; Sentiment Analysis; WordNet; SentiWordNet; Word Sense Disambiguation

References

[1] Pak and P. Paroubek, Twitter as a corpus for sentiment analysis and opinion mining Proceedings of the 7th International Conference on Language Resources and Evaluation, 2010, pp.1320-1326.
[2] “Number of monthly active Twitter users worldwide from 1st quarter 2010 to 3rd quarter 2017 “, Availabe at “https://www.statista.com/statistics/282087/number-ofmonthly- active-twitter-users/. “ [Online; accessed 4-December-2017].
[3] K. Bannister, “Sentiment Analysis”. Available: “ https://www.brandwatch.com/blog/understanding-sentimentanalysis/ ”.
[4] J. Dean and S. Ghemawat, “MapReduce: Simplified data processing on large clusters,” in Proceedings of the 6th Conference on Symposium on Operating Systems Design & Implementation, Volume 6, pp. 10–10, 2004.
[5] Xiaohui Yu, Yang Liu, Jimmy Xiangji Huang, Aijun An, “Mining Online Reviews for Predicting Sales Performance: A Case Study in the Movie Domain”, IEEE TRANSACTIONS ON KNOWLEDGE AND DATA, vol. 24, No.4, APRIL 2012.
[6] Andrei Oghina, Mathias Breuss, Manos Tsagkias&Maarten de Rijke. (2012) Predicting IMDB movie ratings using social media. Proceedings of the 34th European conference on Advances in Information Retrieval, pp. 503-507.
[7] Liu, Bing, and Lei Zhang. "A survey of opinion mining and sentiment analysis." In Mining text data, pp. 415-463. Springer US, 2012.
[8] P. Burnap, R. Gibson, L. Sloan, R. Southern, and M. Williams, 140 characters to victory?: Using Twitter to predict the UK 2015 General Election Journal of Electoral Studies, vol. 41, pp. 230-233, 2016.
[9] M.P. Cameron., P. Barrett, and B. Stewardson, Can social media predict election results? Evidence from New Zealand Journal of Political Marketing, vol. 15, pp. 416-432, 2016.
[10] D. Gayo- Limits of electoral predictions using Twitter Proceedings of the 5th ICWSM, 2011, pp. 178 18.
[11] H.T. Le, G.R. Boynton, Y. Mejova, Z. Shafiq, and P. Srinivasan, Revisiting The American Voter on Twitter Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems, 2017, pp. 4507-4519.
[12] A. Pak and P. Paroubek, Twitter as a corpus for sentiment analysis and opinion mining Proceedings of the 7th International Conference on Language Resources and Evaluation, 2010, pp.1320-1326.
[13] B.O. Connor, R. Balasubramanyan, B.R. Routledge, and N.A. Smith,” From tweets to Polls: Linking Text Sentiment to public opinion time series” Proceedings of the 4th ICWSM, 2010, pp 122 129.
[14] Rui Xia, FengXu, ChengqingZong, QianmuLi, Yong Qi, and Tao Li, August 2015,” Dual Sentiment Analysis: Considering Two Sides of One Review”, IEEE transactions on knowledge and data engineering, Vol.27, AnNo.8.
[15] Pang, Bo, Lillian Lee, and Shivakumar Vaithyanathan. "Thumbs up?: sentiment classification using machine learning techniques." Proceedings of the ACL-02 conference on Empirical methods in natural language processing-Volume 10. Association for Computational Linguistics, 2002.
[16] Go, Alec, Richa Bhayani, and Lei Huang. "Twitter sentiment classification using distant supervision." CS224N Project Report, Stanford 1.2009 (2009).
[17] Mohammad, Saif M., Svetlana Kiritchenko, and Xiaodan Zhu. "NRC Canada: Building the state-of-the-art in sentiment analysis of tweets."arXiv preprint arXiv:1308.6242 (2013).
[18] Pontiki, Maria, et al. "SemEval-2016 task 5: Aspect based sentiment analysis." ProWorkshop on Semantic Evaluation (SemEval-2016). Association for Computational Linguistics, 2016.
[19] Rosenthal, Sara, Noura Farra, and Preslav Nakov. "SemEval-2017 task: Sentiment analysis in Twitter." Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval-2017). 2017.
[20] Yang, Ang, et al. "Enhanced Twitter Sentiment Analysis by Using Feature Selection and Combination." Security and Privacy in Social Networks and Big Data (SocialSec), 2015 International Symposium on. IEEE, 2015.
[21] Amitava Das, SivajiBandopadaya, SentiWordnet for Bangla, Knowledge Sharing Event -4: Task, Volume 2, 2010.
[22] Amitava Das, SivajiBandopadaya, ”SentiWordnet for Indian Languages”, Proceedings of the 8th Workshop on Asian Language Resources, Pages 5663, Beijing, China, August 2010.
[23] Yakshi Sharma, VeenuMangat, MandeepKaur, A practical Approach to Semantic Analysis of Hindi tweets”, 1st International Conference on Next Generation Computing Technologies(NGCT-2015), Dehradun, India,Page No(677-680), September 4-5, 2015.
[24] Yu Huangfu, Guoshiwu, Yu Su Jing Li, Pengfei Sun Jie Hu, “Än Improved Sentiment Analysis Algorithm for Chinese news”, 12th International Conference on Fuzzy Systems and Knowedge Discovery(FSKD), Page No(1366-1371), 2015.
[25] Purtata Bhoir, Shilpa Kolte, “Sentiment Analysis of Movie Reviews using Lexicon approach”, IEEE International Conference on Computational Intelligence and Computing Research, 2015.