Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[ENH] Terminology for multivariate classification problems: columns, dimensions or channels? #121

Closed
TonyBagnall opened this issue Feb 27, 2023 · 3 comments · Fixed by #162
Labels
classification Classification package documentation Improvements or additions to documentation enhancement New feature, improvement request or other non-bug code enhancement

Comments

@TonyBagnall
Copy link
Contributor

I am rewriting the classification notebook to focus more on numpy arrays. There are some terminology issues to fix. Using basic motions as an example, which has 40 instances, 6 dimensions and series length 100. We store this in an array shape (n_instances, n_dimensions, n_timepoints),

What do we call axis 1: columns, dimensions or channels? I do not like columns, it makes no sense with numpy 3D arrays. I have in the past mostly used dimensions, but then this creates a confusion with the array dimensions, n_dims. I am leaning to a complete switch to call them channels. I think channels works for basic motions, but it could be weird in some applications though.

Any thoughts?

@TonyBagnall TonyBagnall added documentation Improvements or additions to documentation enhancement New feature, improvement request or other non-bug code enhancement classification Classification package labels Feb 27, 2023
@haskarb
Copy link
Contributor

haskarb commented Mar 1, 2023

In our group, we discussed the same a few years back; if I remember correctly, we didn't choose the "dimension" because it confuses people from tabular data. The channel seemed ambiguous, yet better because it could be data from video, image, or sensors, where the term is prevalent.

@hadifawaz1999
Copy link
Member

I think the term "channels" is the best fit here, especially that in the literature they use "channel" term for multivariate TS.

@TonyBagnall
Copy link
Contributor Author

ok, so channels it is!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
classification Classification package documentation Improvements or additions to documentation enhancement New feature, improvement request or other non-bug code enhancement
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants