This repository contains a module for building dataset for the task of predicting virality for online articles. This aims at developing tools to predict virality of the articles before publishing them.
Commands for installing newspaper library
sudo apt-get install python3-pip
sudo apt-get install python-dev
sudo apt-get install libxml2-dev libxslt-dev
sudo apt-get install libjpeg-dev zlib1g-dev libpng-dev
sudo apt install curl
curl https://raw.githubusercontent.com/codelucas/newspaper/master/download_corpora.py | python3
pip3 install newspaper3k
Commands for installing selenium library and setting up chromedriver on ubuntu.
pip install selenium
sudo apt-get update (to update ubuntu to correctly run apt install)
sudo apt install chromium-chromedriver
cp /usr/lib/chromium-browser/chromedriver /usr/bin
pip install python-dateutil
pip install webdriver-manager