You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This project aims to practice NLP using spark and process bigdata using Amazon Web Service.
The workflow showed below:
About
This project was run in DataBricks using spark to analyze the recent news in 'cancer' for sentiment evaluation. The goal of this project is to practice traditional NLP like tokenization, stopwords, CV and TF-IDF, N-grams. Also, this project applied tools like AWS S3, athena, QuickSight etc. to address big data.