Replies: 3 comments 5 replies
-
Silhuette score is O(n^2) algorithm and it is thus slow (and uses lots of memory) for big data sets. Users often tried to use it with data sets that were too big, and their Orange therefore crashed. Because it is used with K-Means, which you can use on big data sets precisely because it is fast, we limited silhouette to 5000 instances, otherwise silhuette score would make k-means useless in those cases. |
Beta Was this translation helpful? Give feedback.
-
@arunado Do you have a suggestion for a better error? |
Beta Was this translation helpful? Give feedback.
-
I'm with @arunado on this one. Crashing should be fixed by widget multiprocessing. Until then, can we just make widgets which would take a long time only run manually? As in, apply automatically is disabled, and a warning is thrown asking the user to press 'Apply changes' for processing to begin, knowing full well it may take a long time. |
Beta Was this translation helpful? Give feedback.
-
Please help me understand this error? cannot create Silhouette score > 5000 records
WHat should be done here?
Beta Was this translation helpful? Give feedback.
All reactions