Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fixed broken shakespeare_with_tpu_and_keras.ipynb example #1026

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

dcerisano
Copy link

@dcerisano dcerisano commented Aug 18, 2023

Not sure why this example is now broken on Colab.
Thinking version control problem between numpy and tensorflow.

Essentially, the shape of the numpy prediction array is inconsistent, and is now being rejected by tensorflow's prediction method.

In the current version of the example, prediction breaks on the second prediction because it is expecting an array of single ascii value arrays (like the first seeded input).

However the array that is appended for the next prediction is just an array of ascii values, which causes the predict function to throw an error.

Also, the printing section is expecting an array of ascii values instead of the array of single value ascii arrays used for storing predictions.

The fixes are minor, simply making sure each character is created as a single ascii value array, and then indexed that way during printing of results.

Side note: I got much better results by increasing (4X) the number of epochs, step size and embedding dimensions during training. Results in a much higher resolution weights file that produces much more cogent outputs.

Not sure why this example is now broken on Colab.
Thinking version control problem between numpy and tensorflow.

In the current version of the example, prediction breaks on the second prediction because it is expecting an array of single ascii value arrays (like the first seeded input).

However the array that is appended for the next prediction is just an array of ascii values, which causes the predict function to throw an error.

Also, the printing section is expecting an array of ascii values instead of the array of  single value ascii arrays used for storing predictions.

The fixes are minor, simply making sure each character is  created as a single ascii value array, and then indexed that way during printing of results.

Side note: I got much better results by increasing (4X)  the number of epochs, step size and embedding dimensions during training. Results in a much higher resolution weights file that produces much more cogent outputs.
@review-notebook-app
Copy link

Check out this pull request on  ReviewNB

See visual diffs & provide feedback on Jupyter Notebooks.


Powered by ReviewNB

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant