-
Notifications
You must be signed in to change notification settings - Fork 50
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Vectorize fuzzy_row_match
#489
Conversation
a171b33
to
beadcb4
Compare
20d2538
to
092ce3d
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hi @Scienfitz. Thx, this is great, finally some improvement to one of our legacy code parts 👍🏼 Overall, the logic is solid and the PR in good shape. There are a couple of issue to be fixed, though
979233d
to
36d34d4
Compare
4ce705a
to
198ac87
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thx again, I think looks very good now
5360b49
to
f73980f
Compare
Co-authored-by: AdrianSosic <[email protected]>
f73980f
to
1d0d922
Compare
Here a resulting test looking at the speedup:

left_df
large versusright_df
) approaches 4x from aboveleft_df
andright_df
comparable in size or overall very small) the speedup can even be 40x