Skip to content

A hybrid Sentiment classifier using NLTK and scikit learn (Combining rule based, lexical and machine learning techniques)

License

Notifications You must be signed in to change notification settings

MechMaster95/Pipelined-Sentiment-Classifier

 
 

Repository files navigation

TwitterHybridClassifier

  • Authors: L V Raju Nadimpalli, Akhil Sharma
  • Date: 16/05/17

These python scripts provide the TwitterClassifier I used for my B.tech final year project

Dependencies :

I also used ark_twitter_nlp Pos tagger as well as the NLTK pos tagger.

The lexicon this library uses are:

Running the code :

  • To obtain the end output make an instance of the TwitterHybridClassifier() class providing your pos tagged and tokenzied dataset as arguments

Output :

The final output of would be the sentiment(i.e. 'positive', 'negative' or 'neutral') predicted by the classifer along with the accuracy and confusion matrix.

Instead of using the NLTK tokenizer I have used the tokenizer built by Brendan O'Connor(http://github.com/brendano/tweetmotif)

The datasets I have used in my codes are of SemEval 2014 which I cannot provide because of legal reasons. You can contact the organisers of the SemEval for the datasets.

About

A hybrid Sentiment classifier using NLTK and scikit learn (Combining rule based, lexical and machine learning techniques)

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%