TF-IDF stands for “term frequency / inverse document frequency” and is a
method for emphasizing words that occur frequently in a given document,
while at the same time de-emphasising words that occur frequently in
many documents.
source: http://fastml.com/classifying-text-with-bag-of-words-a-tutorial/
No comments:
Post a Comment