Skip to content

Latest commit

 

History

History
37 lines (32 loc) · 670 Bytes

File metadata and controls

37 lines (32 loc) · 670 Bytes

Text_Classification_Tweets

Sentiment Analysis and Text Classification NLP task with Fasttext and Neural Network

Table of Contents

  • Project Aim
  • Dataset
  • Libraries
  • Models Selection

1. Project Aim

This project is the tweet classification task to distinguish 3 categories of tweets:

  1. racist
  2. sexist
  3. neither

2. Dataset

The dataset was a set of over 11K tweets

3. Libraries

  • numpy
  • pandas
  • nltk
  • re
  • plotly
  • matplotlib
  • sklearn
  • json
  • tqdm
  • string
  • pprint
  • keras
  • tensorflow

4. Models

There two main architectures used in the project

  1. FastText Word Embeddings with SVM
  2. Deep Learning Model with FastText Word Embeddings