Predict Customer Satisfaction with ML – Part 2

Posted in Uncategorized | Leave a comment

Predict Customer Satisfaction with ML

So as part of my data science course at General Assembly, i have the opportunity of doing a machine learning project, which is super awesome.

What problem am i solving?

Predicting customer satisfaction using machine learning.

Some basic stats about the data set;
import pandas as pd
df = pd.read_csv('train.csv')
print df.shape
(76,020, 361)

The journey thus far,

Splitting the data set into a new training and test set
from sklearn.cross_validation import train_test_split
train, test = train_test_split(df, train_size=46020, test_size=30000)

After running logistic regression on the new training set above, and using the trained model to predict values for in the test set,  the resulting confusion matrix looked like this.
[[28823 4][1173 0]]


TODO: Provide  some interpretation of the confusion matrix###

  • further tweaking of the train and test data set sizes did not yield any better predictions for the 1s(ones), the model predicted all the zeros accurately but wrong results for the ones.

Follow me on Part 2 of this journey


Posted in Data Science, Machine Learning | Leave a comment

Website under construction

website under construction

Posted in Uncategorized | Leave a comment