A large pounds in tf–idf is attained by a superior term frequency (while in the supplied document) as well as a very low document frequency of your expression in The full collection of documents; the weights therefore have a tendency to filter out frequent terms. This probabilistic interpretation in turn requires exactly the same kind as that