Skip to content

Latest commit

 

History

History
9 lines (9 loc) · 468 Bytes

File metadata and controls

9 lines (9 loc) · 468 Bytes

SparkML-Streaming-Tweets

Using SparkML to build different machine learning models for simulating a small scale of big data management

Steps Performed:

  • Using Twitter API to gather tweets
  • Filtering tweets and preprocessing steps
  • Using spark to create a dataframe for the dataset
  • With the help of SparkML, successully build different models in order to define a binary classification (for positive and negative meaning tweets)