-
Notifications
You must be signed in to change notification settings - Fork 165
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Understand the dataset dimension #28
Comments
Hi, Any issue regarding the data challenge should be posted in the data challenge repo that you just linked, as to keep things tidy. Now regarding your question, |
Ah, I missed the tile function in this repo. Now I get the right dimension for R. In the train notebook of this repo, my x and y read from the dataset have dimension of (7500, 18, 691] and [7500, 8, 672] respectively. I can understand it that there are 7500 rows, 18 labels for Z, 672 + 19 (latent labels for R) = 691 in one time series and 8 labels to predict. When it comes to calculate loss at Btw, great work and appreciate you can open source it. |
I think you misunderstood the role of As to understand how the Transformer converts a tensor of dimension |
Thanks. I figured out the issue. |
@maxjcohen did you see any improvement during training by providing the time-independent sequences in |
Hi, I added variables contained in I haven't yet looked at the weights, but I agree they should be uniform, although you could argue that some cyclic patterns could appear. For instance, the "window area" variable holds most of its value during sunny hours, which could appear in the Transformer's weights. |
I am using the npz_check function to generate the npz file. Before it dumps the data to npz I printed out the dimension of R, Z and X. They are
R: (7500, 19) X: (7500, 8, 672) Z: (7500, 18, 672)
. There are 7500 rows and 672 entry for one time series, as described by the challenge. 19, 8 and 18 are the number of labels for R, Z and X defined in labels.JSON. But I am wondering why R is not defined with 672 entries and is there any particular reason to define it like this?Npz_check function and these variables are calculated in this file https://github.com/maxjcohen/ozechallenge_benchmark/blob/master/src/utils.py#L218
The text was updated successfully, but these errors were encountered: