Skip to content Skip to sidebar Skip to footer

Document Comparison / Similarity Using Whoosh Python Search Library

How do I get a similarity measure of a document using Whoosh? I want to create a 'Related' feature that ranks other previously indexed documents that have a high similarity to a d

Solution 1:

The Whoosh searcher class has a method called 'more_like()'.

It allows you to compare and indexed document to other indexed documents and returns a list of documents similar to the given document.

And the class whoosh.searching.Hit can give a rank and a score.

Post a Comment for "Document Comparison / Similarity Using Whoosh Python Search Library"