Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Lack of reference to fuzzy matching. #114

Open
mrkkollo opened this issue Jun 25, 2020 · 3 comments
Open

Lack of reference to fuzzy matching. #114

mrkkollo opened this issue Jun 25, 2020 · 3 comments

Comments

@mrkkollo
Copy link

There has been a commit to add support for fuzzy matching using the "max_cost" argument in extract_keywords,
however there seems to be no reference to it in the README and the documentation. Currently it feels like many people
don't know such a feature is available.

@olgnaydn
Copy link

olgnaydn commented Jun 25, 2020 via email

@remiadon
Copy link
Contributor

Hi, I implemented the "fuzzyness" feature for flashtext
Benchmarks are not included, and I agree it's lacking of documentation.

Amongst other things, there is a need to make it "smarter", and, perhaps, faster.

@olgnaydn do you have an example to provide that makes you argue that fuzzywhuzzy is more suitable when performance matters ?
From what I know fuzzywhuzzy is not designed for multi-words matching, but I may be wrong

@shivampuri20
Copy link

hi where i can find max argument

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants