ITCS 6190 Cloud Computing For Data Analysis Project

Twitter Sentiment Analysis using Naive Bayes Algorithm

Below Technologies and frameworks have been used to develop the project:

Python: Pandas, Numpy, Nltk
Twitter API and Tweepy Library

Amazon Web Services(AWS): S3, EMR

Below Steps have been used to develop the project:

1. Data Scraping: Collection of Tweets using Twitter API and Tweepy library

2. Data Pre-processing: Multiple “.json” files concatenated to create datasets for Positive and Negative sentiments

3. Model Building: Splitting the data in Training and Testing and building Naïve Bayes model

4. Prediction: Predicting the sentiment of unseen tweets

ITCS 6190

- Cloud Computing for Data Analysis

Course Project by Sameer Kulkarni (MS CS at UNC Charlotte)

Twitter Sentiment Analysis

About

Twitter Sentiment Analysis using Naive Bayes Algorithm