Code "cleanup"
#1149
Replies: 3 comments 6 replies
-
i don's see most of these as clear improvements, so no. dedupe can already handle a much more than a million records a day, what is your bottleneck? |
Beta Was this translation helpful? Give feedback.
1 reply
-
did you use one of the bigger data recipes from dedupe-examples? |
Beta Was this translation helpful? Give feedback.
5 replies
-
fantastic! |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi @fgregg, as I am reading the code of this library (with the aim to reach a system able to incrementally digest one million records a day...) I could take the chance to apply some "code cleanup" and open a few PRs.
All changes in these PRs would be about coding style and possibly some minor performance improvements, no implementation or behavioural changes at all!
For example, here are some specimens of what I may change in
variables/base.py
.Let me know if you are interested in these kinds of changes.
Direct import of entities:
would become
Use new f-string syntax:
would become
Remove useless blank lines:
would become
Always take a reference to objects before looping (to avoid repeated attribute lookup):
would become
Use Python3 super() syntax:
would become
Use comprehension when it results in simpler code:
would become
Beta Was this translation helpful? Give feedback.
All reactions