fuzzysearch | : pig : Tiny and fast fuzzy search in Go | Search Engine library

by lithammer Go Version: v1.1.8 License: MIT

X-Ray Key Features Code Snippets Community Discussions(5)Vulnerabilities Install Support

kandi X-RAY | fuzzysearch Summary

fuzzysearch is a Go library typically used in Database, Search Engine applications. fuzzysearch has no bugs, it has no vulnerabilities, it has a Permissive License and it has medium support. You can download it from GitHub.

:pig: Tiny and fast fuzzy search in Go

Support

Quality

Security

License

Reuse

Support

fuzzysearch has a medium active ecosystem.

It has 939 star(s) with 57 fork(s). There are 22 watchers for this library.

It had no major release in the last 12 months.

There are 5 open issues and 13 have been closed. On average issues are closed in 12 days. There are 1 open pull requests and 0 closed requests.

It has a neutral sentiment in the developer community.

The latest version of fuzzysearch is v1.1.8

Quality

fuzzysearch has 0 bugs and 0 code smells.

Security

fuzzysearch has no vulnerabilities reported, and its dependent libraries have no vulnerabilities reported.

fuzzysearch code analysis shows 0 unresolved vulnerabilities.

There are 0 security hotspots that need review.

License

fuzzysearch is licensed under the MIT License. This license is Permissive.

Permissive licenses have the least restrictions, and you can use them in most projects.

Reuse

fuzzysearch releases are available to install and integrate.

Installation instructions are not available. Examples and code snippets are available.

Top functions reviewed by kandi - BETA

kandi's functional review helps you automatically verify the functionalities of the libraries and avoid rework.
Currently covering the most popular Java, JavaScript and Python libraries. See a Sample of fuzzysearch

Get all kandi verified functions for this library.

fuzzysearch Key Features

No Key Features are available at this moment for fuzzysearch.

fuzzysearch Examples and Code Snippets

No Code Snippets are available at this moment for fuzzysearch.

Community Discussions

Trending Discussions on fuzzysearch

Lucene ignores / overwrite fuzzy edit distance in QueryParser

PostgreSQL: Match Common Name Variants (Nicknames)

In drag and drop selecting an item from list it reappears again on search

gatsby build fail when passing navigate state from react component to page

solr fuzzy vs wildcard vs stemmer

QUESTION

Lucene ignores / overwrite fuzzy edit distance in QueryParser

Asked 2021-Apr-14 at 20:41

Given the following QueryParser with a FuzzySearch term in the query string:

...

ANSWER

Answered 2021-Apr-14 at 20:41

This may cross the border into "not an answer" - but it is too long for a comment (or a few comments):

Why is this?

That was a design decision, it would seem. It's mentioned in the documentation here.

"The value is between 0 and 2"

There is an old article here which gives an explanation:

"Larger differences are far more expensive to compute efficiently and are not processed by Lucene.".

I don't know how official that is, however.

More officially, from the JavaDoc for the FuzzyQuery class, it states:

"At most, this query will match terms up to 2 edits. Higher distances (especially with transpositions enabled), are generally not useful and will match a significant amount of the term dictionary."

How can I correctly get the fuzzy edit distance I want into the query parser?

You cannot, unless you customize the source code.

The best (least worst?) alternative, I think, is probably the one mentioned in the above referenced FuzzyQuery Javadoc:

"If you really want this, consider using an n-gram indexing technique (such as the SpellChecker in the suggest module) instead."

In this case, one price to be paid will be a potentially much larger index - and even then, n-grams are not really equivalent to edit distances. I don't know if this would meet your needs.

Source https://stackoverflow.com/questions/67088386

QUESTION

PostgreSQL: Match Common Name Variants (Nicknames)

Asked 2021-Mar-21 at 20:49

Scenario

I have a number of enterprise datasets that I must find missing links between, and one of the ways I use for finding potential matches is joining on first and last name. The complication is that we have a significant number of people who use their legal name in one dataset (employee records), but they use either a nickname or (worse yet) their middle name in others (i.e., EAD, training, PIV card, etc.). I am looking for a way to match up these potentially disparate names across the various datasets.

Simplified Example

Here is an overly simplified example of what I am trying to do, but I think it conveys my thought process. I begin with the employee table:

Employees table employee_id first_name last_name 052451 Robert Armsden 442896 Jacob Craxford 054149 Grant Keeting 025747 Gabrielle Renton 071238 Margaret Seifenmacher

and try to find the matching data from the PIV card dataset:

Cards table card_id first_name last_name 1008571527 Bobbie Armsden 1009599982 Jake Craxford 1004786477 Gabi Renton 1000628540 Maggy Seifenmacher Desired Result

After trying to match these datasets on first name and last name, I would like to end up with the following:

Employees_Cards table emp_employee_id emp_first_name emp_last_name crd_card_id crd_first_name crd_last_name 052451 Robert Armsden 1008571527 Bobbie Armsden 442896 Jacob Craxford 1009599982 Jake Craxford 054149 Grant Keeting NULL NULL NULL 025747 Gabrielle Renton 1004786477 Gabi Renton 071238 Margaret Seifenmacher 1000628540 Maggy Seifenmacher

As you can see, I would like to make the following matches:

Gabrielle -> Gabi
Jacob -> Jacob
Margaret -> Maggy
Robert -> Bobbie

My initial thought was to find a common names dataset along the lines of:

Name_Aliases table name1 name2 name3 name4 Gabrielle Gabi NULL NULL Jacob Jake NULL NULL Margaret Maggy Maggie Meg Michael Mike Mikey Mick Robert Bobbie Bob Rob

and use something like this for the JOIN:

...

ANSWER

Answered 2021-Mar-20 at 01:10

How to structure and query and the aliases table is an interesting question. I'd suggest organizing it in pairs rather than wider rows, because you don't know in advance how many variations may eventually be needed in a group of connected names, and a two column structure gives you the ability to add to a given group indefinitely:

name1 name2 Jacob Jake Margaret Maggy Margaret Maggie Margaret Meg Maggy Maggie Maggy Meg Maggie Meg

Then you just check both columns in each JOIN in the query, something like this:

Source https://stackoverflow.com/questions/66716155

QUESTION

In drag and drop selecting an item from list it reappears again on search

Asked 2020-May-21 at 02:55

I have a dnd selection tab with search functionality to filter the items available. Once an item is selected from draggable section it should be removed from that section and dropped to the droppable section and vice versa. The link to my code is https://codesandbox.io/s/dnd-search-select-sort-xfdtn When an item say "Apple" is selected it is going to the droppable section but when I search for "Apple" again in the draggable section search bar it reappears again and I can again move that to the droppable section which should not be the case. Once it is selected it should not appear again on the list. Below is the corresponding code.

...

ANSWER

Answered 2020-May-21 at 02:55

In your state you create Fuse instance once with full list of items. In your handleItemSearch you always search against full list of items. Hence the issue.

To solve the issue, create fresh instance of Fuse in your handleItemSearch

Source https://stackoverflow.com/questions/61925994

QUESTION

gatsby build fail when passing navigate state from react component to page

Asked 2020-Apr-28 at 18:03

I'm building a help page with Gatsby and have a search bar (Searchbar.js) where I'm trying to pass the user's input in the field (search bar is always present within the page--think like Evernote's help page) to a component that conducts the search (search.js), which then passes that output to the actual results page (SearchResults.js).

When I do gatsby develop everything works as it should, but when I do a gatsby build I get an error where it says it cant read the property "query" because its undefined (line 63 of search.js: var search = location.state.query.trim()). Why is this failing on build?

Searchbar.js

...

ANSWER

Answered 2020-Apr-28 at 17:03

location is short for window.location, but at build-time your code is running in Node.js which does not have a window. Instead consider testing for the existence of window (typeof window !== "undefined") before running your location.state.query.trim call, and fall back to a default value in the case that window does not exist.

Source https://stackoverflow.com/questions/61484648

QUESTION

solr fuzzy vs wildcard vs stemmer

Asked 2020-Apr-10 at 10:41

I have couple of questions here.

I want to search a term jumps

With Fuzzy search, I can do jump~ With wild card search, I can do jump* With stemmer I can do, jump

My understanding is that, fuzzy search gives pump. Wildcard search gives jumping as well. Stemmer gives "jumper" also.

I totally agree with the results.

What is the performance of thes three?
- Wild card is not recommended if it is at the beginning of the term - my understanding as it has to match with all the tokens in the index - But in this case, it would be all the tokens which starts jump
- Fuzzy search gives me unpredicted results - It has to do something kind of spellcheck I assume.
- Stemmer suits only particular scenarios like it can;t match pumps.
How should I use these things which can give more relevant results?

I probably more confused about all these because of this section. Any suggestions please?

...

ANSWER

Answered 2020-Apr-10 at 08:08

For question 2 you can go strict to permissive.

Option one: Only give strict search result. If no result found give stemmer results. Continue with fuzzy or wildcard search if no result found previously.

Option two: Give all results but rank them by level (ie. first exact match, then stemmer result, ...)

Source https://stackoverflow.com/questions/61132899

Community Discussions, Code Snippets contain sources that include Stack Exchange Network

Vulnerabilities

No vulnerabilities reported

Install fuzzysearch

You can download it from GitHub.

Support

For any new features, suggestions and bugs create an issue on GitHub. If you have any questions check and ask questions on community page Stack Overflow .

Find more information at: