Open Access   Article Go Back

Hybrid Recommendation Engine with Web Scraping and Sentiment Analysis

N. Sontakke1 , abarinath S2 , S. Sangamnerkar3 , V. Iyer4

Section:Research Paper, Product Type: Journal Paper
Volume-07 , Issue-07 , Page no. 22-31, Mar-2019

Online published on Mar 30, 2019

Copyright © N. Sontakke, Sabarinath S, S. Sangamnerkar, V. Iyer . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

View this paper at   Google Scholar | DPI Digital Library

How to Cite this Paper

  • IEEE Citation
  • MLA Citation
  • APA Citation
  • BibTex Citation
  • RIS Citation

IEEE Style Citation: N. Sontakke, Sabarinath S, S. Sangamnerkar, V. Iyer, “Hybrid Recommendation Engine with Web Scraping and Sentiment Analysis,” International Journal of Computer Sciences and Engineering, Vol.07, Issue.07, pp.22-31, 2019.

MLA Style Citation: N. Sontakke, Sabarinath S, S. Sangamnerkar, V. Iyer "Hybrid Recommendation Engine with Web Scraping and Sentiment Analysis." International Journal of Computer Sciences and Engineering 07.07 (2019): 22-31.

APA Style Citation: N. Sontakke, Sabarinath S, S. Sangamnerkar, V. Iyer, (2019). Hybrid Recommendation Engine with Web Scraping and Sentiment Analysis. International Journal of Computer Sciences and Engineering, 07(07), 22-31.

BibTex Style Citation:
@article{Sontakke_2019,
author = {N. Sontakke, Sabarinath S, S. Sangamnerkar, V. Iyer},
title = {Hybrid Recommendation Engine with Web Scraping and Sentiment Analysis},
journal = {International Journal of Computer Sciences and Engineering},
issue_date = {3 2019},
volume = {07},
Issue = {07},
month = {3},
year = {2019},
issn = {2347-2693},
pages = {22-31},
url = {https://www.ijcseonline.org/full_spl_paper_view.php?paper_id=898},
publisher = {IJCSE, Indore, INDIA},
}

RIS Style Citation:
TY - JOUR
UR - https://www.ijcseonline.org/full_spl_paper_view.php?paper_id=898
TI - Hybrid Recommendation Engine with Web Scraping and Sentiment Analysis
T2 - International Journal of Computer Sciences and Engineering
AU - N. Sontakke, Sabarinath S, S. Sangamnerkar, V. Iyer
PY - 2019
DA - 2019/03/30
PB - IJCSE, Indore, INDIA
SP - 22-31
IS - 07
VL - 07
SN - 2347-2693
ER -

           

Abstract

In recent years a lot of data has been generated owing to the exponential increase in internet usage. People are overloaded with information, spanning over multiple domains. This helps people obtain knowledge and come to informed decisions. If we consider purchasing a product as a use case, the buyer can visit multiple websites to find strengths and weaknesses of the product as well as the opinions of other purchasers. To make this process easier and faster for the buyer we propose a robust and scalable hybrid recommendation system which is implemented as a combination of Content Based Recommendation System and Collaborative Filtering Techniques. As a test case, this system has been used to help purchase smartphones based on all features and sentiments of previous purchasers. System gets data using a self-learning web crawler that gathers data from a number of relevant websites irrespective of their different structures. The sentiments of users who purchased the same items previously have also been analyzed to aid the current buyers’ decisions. In this paper, we have provided detailed survey and comparison of multiple techniques we reviewed before implementing each module.

Key-Words / Index Term

Artificial intelligence, Decision support systems, Knowledge based systems, Hybrid intelligence systems, Data Mining, Web Mining, Supervised Learning, Recommender systems, Content based retrieval, Information retrieval, Clustering algorithms, Classification algorithms, Natural language processing, Sentiment analysis, Recurrent Neural Networks.

References

[1] P. Srinivasan,F. Menczer, G. Pant“A General Evaluation Framework for Topical Crawlers”, 2004.
[2] Ziyan Zhou and Muntasir Mashuq, “Web Content Extraction Through Machine Learning”, Stanford University.
[3] Amit Gupte, Sourabh Joshi, Pratik Gadgul, Akshay Kadam, “Comparative Study of Classification Algorithms used in Sentiment Analysis”, 2014.
[4] Yun Y., Hooshyar D., Jo J., Lim H., “Developing a hybrid collaborative filtering recommendation system with opinion mining on purchase review.”, 2017.
[5] Wenxian Wang, Xingshu Chen, Yongbin Zou, “A Focused Crawler Based on Naive Bayes Classifier”, Third International Symposium on Intelligent Information Technology and Security Informatics, 2010.
[6] Duygu Taylan, Mitat Poyraz, Selim Akyokuş, Murat Can Ganiz, “Intelligent Focused Crawler: Learning which Links to Crawl”, IEEE, 2011.
[7] Youngki Park, Sungchan Park, Sang-goo Lee, “Fast Collaborative Filtering with a k- Nearest Neighbor Graph”, Journal of Machine Learning Research, 2014.
[8] Claudio Adrian Levinas., “An Analysis of Memory Based Collaborative Filtering Recommender Systems with Improvement Proposals”, 2014.
[9] Koji Miyahara†, Michael J. Pazzani, “Improvement of Collaborative Filtering with the Simple Bayesian Classifier”, 2013.
[10] J. Herlocker, J. A. Konstan, J. Riedl,“An Empirical Analysis of Design Choices in Neighborhood-Based Collaborative Filtering Algorithms”, Information Retrieval, Vol 5, Issue 4, 2002.
[11] Smyth B, “Case-Based Recommendation”, The Adaptive Web. Lecture Notes in Computer Science, Vol. 4321, Springer, Berlin, 2007.
[12] Dima S. Mahmoud, Robert I. John, “Enhanced Content-based Filtering Algorithm using Artificial Bee Colony Optimisation”, SAI Intelligent Systems Conference, London, 2015.
[13] Shivaprasad T K, J. Shetty, “Sentiment Analysis of Product Reviews:A Review”, 2017.
[14] Suiki Li, “Sentiment analysis on Amazon Unlocked mobile phones”, 2017.
[15] Doaa Mohey, El-Din, “Enhancement Bag-of-Words Model for Solving the Challenges of Sentiment Analysis”, 2016.
[16] Xin Rong, “word2vec Parameter Learning Explained”, 2016.
[17] H. Palangi, Li Deng, Y. Shen, J. Gao, Xiaod, “Deep Sentence Embedding Using Long Short-Term Memory Networks: Analysis and Application to Information Retrieval”, 2016.