- Topic Modeling with BERT - Nov 3, 2020.
Leveraging BERT and TF-IDF to create easily interpretable topics.
BERT, NLP, TF-IDF, Topic Modeling
- Content-Based Recommendation System using Word Embeddings - Aug 14, 2020.
This article explores how average Word2Vec and TF-IDF Word2Vec can be used to build a recommendation engine.
NLP, Recommendation Engine, Recommender Systems, TF-IDF, Word Embeddings, word2vec
An Introductory Guide to NLP for Data Scientists with 7 Common Techniques - Jan 9, 2020.
Data Scientists work with tons of data, and many times that data includes natural language text. This guide reviews 7 common techniques with code examples to introduce you the essentials of NLP, so you can begin performing analysis and building models from textual data.
Data Preparation, NLP, Sentiment Analysis, TF-IDF, Tokenization, Topic Modeling, Word Embeddings
- WTF is TF-IDF? - Aug 2, 2018.
Relevant words are not necessarily the most frequent words since stopwords like “the”, “of” or “a” tend to occur very often in many documents.
Information Retrieval, Python, Text Analytics, Text Mining, TF-IDF
- Recommender Engine - Under The Hood - Feb 21, 2018.
We examine two main types of recommender systems: Content based and Collaborative filtering. Both have their pros and cons depending upon the context in which you want to use them.
Recommendation Engine, Recommender Systems, TF-IDF
- Top KDnuggets tweets, Jun 25-26: 12 JavaScript Libraries for Data Viz; TF-IDF – key measure for Text Analytics - Jun 27, 2014.
Very useful: 12 JavaScript Libraries for Data Visualization; Introduction to TF-IDF - key measure for Text Analytics; UC Berkeley new online MS in #DataScience, 18 months, $60K; XLMiner solves Big Data Problems in Excel.
Data Visualization, Javascript, Text Analytics, TF-IDF, UC Berkeley, XLMiner