Jonathan Law

I’m a Data Scientist and Business Analyst currently employed at The Chemours Company.
I have a passion for data, statistics, programming, and using all of those things to help people run a better business.

  • Blog
  • About
  • Contact

Document Class Comparison with TF-IDF & Python

February 06, 2019 by Jonathan Law in Data Science

There are many different techniques within the world of natural language processing, ranging from the very simple to the very complex. In this tutorial, we're going to be looking at one of the simpler techniques. Although it is simple, it is powerful. Using a concept called TF-IDF and a bit of linear algebra, we'll be able to compare any two documents or any two classes of documents for similarity.

Read More
February 06, 2019 /Jonathan Law
data science, tfidf, nlp, natural language, python
Data Science

Powered by Squarespace