-
Notifications
You must be signed in to change notification settings - Fork 1.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Propose really easy reinforcement learning API for v3 #317
Comments
Proposal for basic feed forward style class: const net = new brain.Reinforce(); Proposal for "episodes" style class: const net = new brain.RNNReinforce();
const net = new brain.LSTMReinforce(); Inputs and outputs work like that of a standard net. Reward function: const net = new brain.Reinforce({
reward: (?) => {
return 1;
}
});
const net = new brain.LSTMReinforce({
reward: (?) => {
return 1;
}
}); |
In addition to I found an excellent article with step by step implementation here: https://medium.com/@quasimik/implementing-monte-carlo-tree-search-in-node-js-5f07595104df I want to find a good term for this that highlights the practice, over the algorithm, but will start to ponder over that based on these terms:
|
Just some idea... : const net = new brain.Reinforce(options)
.reward((some_args) => {
return some_value
}); @robertleeplummerjr Where can I find current V3 API proposal? |
Is there any ETA on when this will be a thing? This would be amazing if we could use reinforcement learning in brain.js! |
As someone who is pretty new to this, would this allow me to create a net that can learn over time (from running operations using According to the docs I need to train before running operations:
Maybe this isn't even correct? I seem to be able to switch between training and running without problem. |
Sorry everyone for my tardiness. I've been getting GPU.js, which is brain.js's v2 engine, ready, and am nearly done. I've been saying this for a while, but all I can say is: it will be right, and it will be revolutionary for javascript. I believe it (GPU.js v2) to be days from release. But I will take a break and answer this thread because it is very important to me, and I think about it frequently.
This is precisely what I mean.
v3 has not been fully proposed as of yet, but the latest is here:
I don't have one yet. Let's (all of us) get busy and start making it happen!
Yes. This proposal would be a continuous learning system that, rather than train towards an error rate of 0, it trains for highest rewards. But keep in mind this is a proposal for v3, so it is not yet even proof of concept. That being said, the GPU.js v2 release, when finally released, will make iteration much faster.
It isn't yet because we are talking about a hypothetical api that doesn't yet exist. |
Well the qlearn reinforcement models currently available on nodejs have simple api to reference from. I suggest taking a look at them |
This has gained some traction here: https://github.com/BrainJS/rl |
Any updates? |
Can you go ahead and try the project mentioned above? That will likely be the initial candidate. https://github.com/BrainJS/rl works fairly well and is fairly well tested. I want to find its weak area prior to analyzing if it is even possible for the API to fit here. |
TL;DR
TS;DR Consider this closed! |
What is wrong?
Where does it happen?
How do we replicate the issue?
How important is this (1-5)?
Expected behavior (i.e. solution)
Other Comments
The text was updated successfully, but these errors were encountered: