Kaggle Quora duplicate question analysis – a very naive approach with a Decision Tree

The aim of this Kaggle competition is to predict whether the question pairs in the data set, obtained from Quora, have the same meaning. There are currently many approaches in the Kaggle Kernel section each with its own merits and drawback. In this analysis I hope to experiment with the most popular methods as described […]

My choice for OS in data science

Just sharing a nice comic posted by Duncan Hull. In my opinion this comic accurately reflects the culture of different user groups! We all know that in many real life situations, there could be many solutions to a single problem. This is also true for a data scientist. Many junior/to-be data scientists often wonder what […]