a tree in a sunlit forest
,

What is a Random Forest?

In my early journey into the murky depths of data science and machine learning I've come across the phrase Random Forest a few times, and been completely clueless as to what it actually referred to. Today I decided to dive in and explore…

Basic NLP Text Preprocessing

I’m in the midst of unsteadily lurching through my first real Kaggle Competition, leaving behind the false comfort of the MNIST Digit recognition and other “Hello World” style machine learning problems. During this process it was very quickly made clear to me just how much of a beginner I am, and that, even with my background in philosophy, I’ve just barely scratched the surface when it comes to Natural Language Processing (NLP) in a Machine Learning context. This was made abundantly clear to me when, after looking at some of the leading kernels for inspiration, found they all practiced some form of NLP text preprocessing and analysis prior to feeding their data into the almighty Algorithm (typically BERT, in this scenario).