You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Some segfaults arose in scikit-tree during the implementation of this PR: neurodata/treeple#249.
Some actionable items came to mind to improve what we have:
should ideally get rewritten, or at least more comments. Rn it is hard to parse what comprises initial_roots. Since this is part of the Cython codebase, it is thus a critical piece as segfaults are time-consuming and difficult to chase down.
Next, we prolly want to include a clear description for developers on what the differences are here:
Features, or documentation
Part of sklearn handles monotonic constraints and n_constant_features tracking. From first glance it is also not clear that these are actually tracked. I.e. is the monotonic constraint and n_constant_features if we do fit and then partial_fit for two subsets of the data different from if we just did fit for the entire dataset? If they are different, what does this imply?
In an ideal world the state is the same.
The text was updated successfully, but these errors were encountered:
Some segfaults arose in scikit-tree during the implementation of this PR: neurodata/treeple#249.
Some actionable items came to mind to improve what we have:
Documentation:
scikit-learn/sklearn/tree/_tree.pyx
Lines 211 to 273 in f360749
initial_roots
. Since this is part of the Cython codebase, it is thus a critical piece as segfaults are time-consuming and difficult to chase down.Next, we prolly want to include a clear description for developers on what the differences are here:
scikit-learn/sklearn/tree/_tree.pyx
Lines 302 to 320 in f360749
Features, or documentation
Part of sklearn handles monotonic constraints and n_constant_features tracking. From first glance it is also not clear that these are actually tracked. I.e. is the monotonic constraint and n_constant_features if we do
fit
and thenpartial_fit
for two subsets of the data different from if we just didfit
for the entire dataset? If they are different, what does this imply?In an ideal world the state is the same.
The text was updated successfully, but these errors were encountered: