Issues: dedupeio/dedupe
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
What is the system core and memory benchmark for dedupe library for bigdata
#1024
opened May 20, 2022 by
sarbaniAi
benchmark runs with training separately than runs that use settings file
#1006
opened May 5, 2022 by
fgregg
Change default sample size or change sampling scheme based Tahamont's, et. al.'s findings
#980
opened Mar 10, 2022 by
fgregg
scoring pairs is much slower after training then after loading settings file.
#977
opened Mar 1, 2022 by
fgregg
disk has reached capacity issue with moderate record size with >500 gb of free disk space
#965
opened Feb 18, 2022 by
zwarshavsky
consider using bisection in filtering of connected components size
#957
opened Feb 6, 2022 by
fgregg
Performance degrades when loading/training with large labeled training file to prepare_train()
#940
opened Jan 25, 2022 by
cbhower
deprecate recall argument for precision and expose argument for tree depth
enhancement
#934
opened Jan 19, 2022 by
fgregg
ergonomics for working with differently named fields in linkage and gazetteer mode
enhancement
#867
opened Dec 14, 2020 by
fgregg
Previous Next
ProTip!
Updated in the last three days: updated:>2022-06-03.

