New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

JAX time series classification #105

Merged

jakevdp merged 1 commit into jax-ml:main from mtsokol:nlp-time-series-classif

Dec 3, 2024

Contributor

mtsokol commented Nov 20, 2024

This PR adds a tutorial with JAX time series classification task on a FordA dataset from the UCR/UEA archive. It follows one of the Keras tutorials. It's still missing the narrative.

jakevdp self-assigned this

mtsokol force-pushed the nlp-time-series-classif branch 2 times, most recently from 0e0f205 to a7d3a6f Compare

November 21, 2024 14:51

Contributor Author

mtsokol commented Nov 21, 2024

It's ready for a review!

jakevdp changed the title ~~NLP: JAX time series classification~~ JAX time series classification

jakevdp approved these changes

View reviewed changes

Collaborator

jakevdp left a comment

Looks good – it's a bit sparse on the description of what the code is doing, but we can probably address that in a later pass.

Collaborator

jakevdp commented Nov 21, 2024

Please resolve the conflicts, and then we can merge this. Thanks!

mtsokol force-pushed the nlp-time-series-classif branch from a7d3a6f to fcc6b33 Compare

November 22, 2024 13:04

Contributor Author

mtsokol commented Nov 22, 2024 •

edited

Loading

Please resolve the conflicts, and then we can merge this. Thanks!

Rebased!

I also added a couple of descriptions before some of the code cells, so there's more narrative.

pavithraes suggested changes

View reviewed changes

Contributor

pavithraes left a comment

@mtsokol - Thank you for opening this PR. I've added a few more suggestions for narration based on:

Please verify the accuracy of the statements. I've tried to maintain formatting, but these are still GitHub-suggestions, so you may need to update line breaks and such as needed.

docs/JAX_time_series_classification.md Outdated

+              # Time series classification with JAX
+              In this tutorial, we're going to perform time series classification with a Convolutional Neural Network.
+              We're going to use FordA dataset from the [UCR archive](https://www.cs.ucr.edu/%7Eeamonn/time_series_data_2018/).

Contributor

pavithraes Nov 22, 2024

Suggested change

      
            We're going to use FordA dataset from the [UCR archive](https://www.cs.ucr.edu/%7Eeamonn/time_series_data_2018/).
          
            We will use the FordA dataset from the [UCR archive](https://www.cs.ucr.edu/%7Eeamonn/time_series_data_2018/), which contains measurements of engine noise captured by a motor sensor.

Contributor Author

mtsokol Nov 25, 2024

Done!

docs/JAX_time_series_classification.md Outdated

+              The problem we're facing is to assess if an engine is malfunctioning based on recorded noises it generates.
+              Each sample is comprised of noise measurements across time, together with a "yes/no" label, so it's a binary classification problem.
+              Although convolution models are mainly associated with image processing, they are useful also for time series data as they're able to extract temporal structures.

Contributor

pavithraes Nov 22, 2024

Suggested change

      
            Although convolution models are mainly associated with image processing, they are useful also for time series data as they're able to extract temporal structures.
          
            Although convolution models are mainly associated with image processing, they are also useful for time series data because they can extract temporal structures.

Contributor Author

mtsokol Nov 25, 2024

Done!

docs/JAX_time_series_classification.md Outdated

Comment on lines 25 to 39

+              ```{code-cell} ipython3
+              # Required packages
+              # !pip install -U jax flax optax
+              # !pip install -U grain tqdm requests matplotlib
+              ```
+              ## Tools overview
+              Here's a list of key packages that belong to JAX AI stack:
+              - [JAX](https://github.com/jax-ml/jax) will be used for array computations.
+              - [Flax](https://github.com/google/flax) for constructing neural networks.
+              - [Optax](https://github.com/google-deepmind/optax) for gradient processing and optimization.
+              - [Grain](https://github.com/google/grain/) will be be used to define data sources.
+              - [tqdm](https://tqdm.github.io/) for a progress bar to monitor the training progress.

Contributor

pavithraes Nov 22, 2024

I think moving the installation cell after introducing the libraries has a nicer flow. :)
I've also modified the title and made the list have a consistent narrative.

Suggested change

      
            ```{code-cell} ipython3
          
            # Required packages
          
            # !pip install -U jax flax optax
          
            # !pip install -U grain tqdm requests matplotlib
          
            ```
          
            ## Tools overview
          
            Here's a list of key packages that belong to JAX AI stack:
          
            - [JAX](https://github.com/jax-ml/jax) will be used for array computations.
          
            - [Flax](https://github.com/google/flax) for constructing neural networks.
          
            - [Optax](https://github.com/google-deepmind/optax) for gradient processing and optimization.
          
            - [Grain](https://github.com/google/grain/) will be be used to define data sources.
          
            - [tqdm](https://tqdm.github.io/) for a progress bar to monitor the training progress.
          
            ## Tools overview and setup
          
            Here's a list of key packages that belong to the JAX AI stack required for this tutorial:
          
            - [JAX](https://github.com/jax-ml/jax) for array computations.
          
            - [Flax](https://github.com/google/flax) for constructing neural networks.
          
            - [Optax](https://github.com/google-deepmind/optax) for gradient processing and optimization.
          
            - [Grain](https://github.com/google/grain/) to define data sources.
          
            - [tqdm](https://tqdm.github.io/) for a progress bar to monitor the training progress.
          
            We'll start by installing and importing these packages.
          
            ```{code-cell} ipython3
          
            # Required packages
          
            # !pip install -U jax flax optax
          
            # !pip install -U grain tqdm requests matplotlib
          
            ```

Contributor Author

mtsokol Nov 25, 2024

Done!

docs/JAX_time_series_classification.md Outdated

Comment on lines 20 to 21

		The problem we're facing is to assess if an engine is malfunctioning based on recorded noises it generates.
		Each sample is comprised of noise measurements across time, together with a "yes/no" label, so it's a binary classification problem.

Contributor

pavithraes Nov 22, 2024

Suggested change

      
            The problem we're facing is to assess if an engine is malfunctioning based on recorded noises it generates.
          
            Each sample is comprised of noise measurements across time, together with a "yes/no" label, so it's a binary classification problem.
          
            We need to assess if an engine is malfunctioning based on the recorded noises it generates.
          
            Each sample comprises of noise measurements across time, together with a "yes/no" label, so this is a binary classification problem.

Contributor Author

mtsokol Nov 25, 2024

Done!

docs/JAX_time_series_classification.md Outdated

Comment on lines 55 to 56

		We load dataset files into NumPy arrays, add singleton dimention to take into
		the account convolution features, and change `-1` label to `0` value:

Contributor

pavithraes Nov 22, 2024

Suggested change

      
            We load dataset files into NumPy arrays, add singleton dimention to take into
          
            the account convolution features, and change `-1` label to `0` value:
          
            We load dataset files into NumPy arrays, add singleton dimension to take convolution features into account, and change `-1` label to `0` (so that the expected values are `0` and `1`):

Contributor Author

mtsokol Nov 25, 2024

Done!

docs/JAX_time_series_classification.md


		optimizer = nnx.Optimizer(model, optax.adam(learning_rate, momentum))
		```

Contributor

pavithraes Nov 25, 2024

Suggested change

      
            We'll define a loss and logits computation function using
          
            Optax's [`losses.softmax_cross_entropy_with_integer_labels`](https://optax.readthedocs.io/en/latest/api/losses.html#optax.losses.softmax_cross_entropy_with_integer_labels).

Contributor Author

mtsokol Nov 25, 2024

Done!

docs/JAX_time_series_classification.md

+                  ).mean()
+                  return loss, logits
+              ```

Contributor

pavithraes Nov 25, 2024

Suggested change

      
            We'll now define the training and evaluation step functions.
          
            The loss and logits from both functions will be used for calculating accuracy metrics.
          
            For training, we'll use `nnx.value_and_grad` compute the gradients, and then update the model’s parameters using our optimizer.
          
            Notice the use of [`nnx.jit`](https://flax.readthedocs.io/en/latest/api_reference/flax.nnx/transforms.html#flax.nnx.jit). This sets up the functions for just-in-time (JIT) compilation with [XLA](https://openxla.org/xla) for performant execution across different hardware accelerators like GPUs and TPUs.

Contributor Author

mtsokol Nov 25, 2024

Done!

docs/JAX_time_series_classification.md

+                  "test_accuracy": [],
+              }
+              ```

Contributor

pavithraes Nov 25, 2024

Suggested change

      
            We can now train the CNN model.
          
            We'll evaluate the model’s performance on the test set after each epoch,
          
            and print the metrics: total loss and accuracy.

Contributor Author

mtsokol Nov 25, 2024

Done!

docs/JAX_time_series_classification.md

+                  train_one_epoch(epoch)
+                  evaluate_model(epoch)
+              ```

Contributor

pavithraes Nov 25, 2024

Suggested change



	Finally, let's visualize the loss and accuracy with Matplotlib.

Contributor Author

mtsokol Nov 25, 2024

Done!

docs/JAX_time_series_classification.md Outdated

Comment on lines 345 to 346

		For model early stopping and selecting best model there's [Orbax](https://github.com/google/orbax)
		library which provides checkpointing and persistence utilities.

Contributor

pavithraes Nov 25, 2024

Suggested change

      
            For model early stopping and selecting best model there's [Orbax](https://github.com/google/orbax)
          
            library which provides checkpointing and persistence utilities.
          
            For model early stopping and selecting best model, you can check out [Orbax](https://github.com/google/orbax),
          
            a library which provides checkpointing and persistence utilities.

Contributor Author

mtsokol Nov 25, 2024

Done!

Contributor

trallard commented Nov 26, 2024

@jakevdp this should be ready now 🚀

jakevdp reviewed

View reviewed changes

docs/JAX_time_series_classification.md

+              ```{code-cell} ipython3
+              model = MyModel(rngs=nnx.Rngs(0))
+              nnx.display(model)

Collaborator

jakevdp Dec 2, 2024

This cell is producing a GPU warning – let's remove it from the notebook output to avoid distracting readers.

Contributor Author

mtsokol Dec 3, 2024

Done!

I squashed the PR, it's ready from my side!


          NLP: JAX time series classification

830418d

mtsokol force-pushed the nlp-time-series-classif branch from 78135a9 to 830418d Compare

December 3, 2024 14:48

jakevdp approved these changes

View reviewed changes

Collaborator

jakevdp left a comment

Thanks!

jakevdp merged commit 47125fe into jax-ml:main

6 checks passed

mtsokol deleted the nlp-time-series-classif branch

December 3, 2024 16:53

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet