Skip to content

Latest commit

 

History

History
19 lines (15 loc) · 1.03 KB

File metadata and controls

19 lines (15 loc) · 1.03 KB

web_data_mining

This project is an analysis performed on Twitter and Instagram data on the topic 'quarantine'. It features 7 main tasks in total, which are the following:

Project tasks
  • (task_1) Data collecting
  • (task_2) Pre-processing
  • (task_3) Emerging topics extraction
  • (task_3) Sentiment and emotion analysis
  • (all tasks) Visualization of data analysis
  • (task_5) User Profiling
  • (task_6) Like Prediction
  • (task_7) Web Application with Flask to showcase the visualization analysis

Website with analysis results hosted on HEROKU at: https://web-data-mining.herokuapp.com (up to 30sec. loading time)

Created by: Chrysovalantis Kontoulis, Orestis Lampridis and Petros Tzallas

How to read big json file into database: twitter_db and collection: twitter_collection of mongoDB (file: tweets_full.json)

mongoimport --db twitter_db --collection twitter_collection --file tweets_full.json --batchSize 1