[ENH] kNN Classifier and Regressor reimplementations #66

GuiArcencio · 2023-02-15T13:00:22Z

Reference Issues/PRs

Fixes [ENH] Redesign KNN classifier and regressor #39.

What does this implement/fix? Explain your changes.

Reimplemented both KNeighborsTimeSeriesClassifier and KNeighborsTimeSeriesRegressor in order to fix memory leaks and replace hard coded "distances" string list.
Removed computation of distance matrices in fit (which was useless in any case) and predict, as well as sklearn's KNeighbors instances contained within the models. The k-Neighbors algorithm is now implemented here.
Nearest neighbors are found by np.argpartitioning the distance vector into [0..k-1], [k], [k+1..], which is $O(n)$ as opposed to the $O(n \log n)$ in sorting.
Distance metrics are now generated by distance_factory, uncoupling the selection of possible metrics from the model.

What should a reviewer concentrate their feedback on?

Documentation for the new implementations might be lacking or inconsistent.
Other mtypes could be added for unequal-length support.
n_jobs is a parameter for the new classifier for compatibility purposes only. There is no parallelism implementation yet.

The following graphs show the results of current and new implementation benchmarks. The experiments were made on a regression dataset which consists of univariate time series of length 365. At each sample size, data was split between train and test in 70% - 30% proportions, respectively. The distance was always set to 'euclidean'.

More benchmarks are underway, one using distance='dtw' and two others fixing train size and varying test size, and vice-versa.

Did you add any tests for the change?

Some tests were removed and/or changed due to being implementation-specific.

PR checklist

For all contributions

I've added myself to the list of contributors.
Optionally, I've updated sktime's CODEOWNERS to receive notifications about future changes to these files.
The PR title starts with either [ENH], [MNT], [DOC], or [BUG] indicating whether the PR topic is related to enhancement, maintenance, documentation, or bug.

TonyBagnall · 2023-02-15T13:25:19Z

fantastic, thanks @GuiArcencio. @chrisholder could you take a look please?

TonyBagnall · 2023-02-16T13:16:51Z

hey @GuiArcencio do you have those timing/memory graphs? If so, good to post them here

TonyBagnall

looks really good, thanks!

sktime/classification/distance_based/_time_series_neighbors.py

sktime/regression/distance_based/_time_series_neighbors.py

sktime/regression/distance_based/tests/test_time_series_neighbors.py

TonyBagnall

looks good to me

Guilherme Arcencio added 2 commits February 15, 2023 11:49

Enhances kNN regressor implementation

3bd3a28

Reimplements kNN classifier

bd0730b

Guilherme Arcencio added 2 commits February 15, 2023 13:47

Fixes formatting issues

9ec481e

Removes aggregate distance tests

93b7e45

TonyBagnall requested changes Feb 16, 2023

View reviewed changes

Guilherme Arcencio added 4 commits February 16, 2023 16:37

Fixes inconsistent docs

ef52bbb

Brings back get_test_params method

9d08143

Hard codes unit test results for kNN regressor

69f330b

Adds myself to the list of contributors

d2b2183

TonyBagnall self-requested a review February 16, 2023 19:19

GuiArcencio marked this pull request as ready for review February 16, 2023 19:27

GuiArcencio requested review from aiwalter and GuzalBulatova as code owners February 16, 2023 19:27

TonyBagnall approved these changes Feb 16, 2023

View reviewed changes

TonyBagnall merged commit 9797acf into aeon-toolkit:main Feb 16, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ENH] kNN Classifier and Regressor reimplementations #66

[ENH] kNN Classifier and Regressor reimplementations #66

GuiArcencio commented Feb 15, 2023 •

edited

Loading

TonyBagnall commented Feb 15, 2023

TonyBagnall commented Feb 16, 2023

TonyBagnall left a comment

TonyBagnall left a comment

[ENH] kNN Classifier and Regressor reimplementations #66

[ENH] kNN Classifier and Regressor reimplementations #66

Conversation

GuiArcencio commented Feb 15, 2023 • edited Loading

Reference Issues/PRs

What does this implement/fix? Explain your changes.

What should a reviewer concentrate their feedback on?

Did you add any tests for the change?

PR checklist

For all contributions

TonyBagnall commented Feb 15, 2023

TonyBagnall commented Feb 16, 2023

TonyBagnall left a comment

Choose a reason for hiding this comment

TonyBagnall left a comment

Choose a reason for hiding this comment

GuiArcencio commented Feb 15, 2023 •

edited

Loading