9 lines (9 loc) · 468 Bytes

SparkML-Streaming-Tweets

Using SparkML to build different machine learning models for simulating a small scale of big data management

Steps Performed:

Using Twitter API to gather tweets

Filtering tweets and preprocessing steps

Using spark to create a dataframe for the dataset

With the help of SparkML, successully build different models in order to define a binary classification (for positive and negative meaning tweets)