Propose really easy reinforcement learning API for v3 #317

robertleeplummerjr · 2018-12-13T21:39:13Z

What is wrong?

Where does it happen?

How do we replicate the issue?

How important is this (1-5)?

Expected behavior (i.e. solution)

Other Comments

robertleeplummerjr · 2018-12-24T15:01:35Z

Proposal for basic feed forward style class:

const net = new brain.Reinforce();

Proposal for "episodes" style class:

const net = new brain.RNNReinforce();
const net = new brain.LSTMReinforce();

Inputs and outputs work like that of a standard net. Reward function:

const net = new brain.Reinforce({
  reward: (?) => {
    return 1;
  }
});
const net = new brain.LSTMReinforce({
  reward: (?) => {
    return 1;
  }
});

robertleeplummerjr · 2018-12-29T11:47:13Z

In addition to Reinforce, there is another part of reinforcement learning that is super important for winning at games, like Chess, which is used for exploration. As far as I'm aware the state of the art best practice is the "Monte Carlo" algorithm and I'd like to at least consider how to generalize an implementation of this where the net can have this fit in like a puzzle piece that is super simple.

I found an excellent article with step by step implementation here: https://medium.com/@quasimik/implementing-monte-carlo-tree-search-in-node-js-5f07595104df

I want to find a good term for this that highlights the practice, over the algorithm, but will start to ponder over that based on these terms:

Monte Carlo
Exploitation
Tree
Strategy

DanielMazurkiewicz · 2019-01-16T18:31:26Z

Just some idea... :

const net = new brain.Reinforce(options)
   .reward((some_args) => {
      return some_value
   });

@robertleeplummerjr Where can I find current V3 API proposal?

samfultonjr · 2019-04-29T23:48:17Z

Is there any ETA on when this will be a thing? This would be amazing if we could use reinforcement learning in brain.js!

maximilliangeorge · 2019-05-14T07:31:48Z

As someone who is pretty new to this, would this allow me to create a net that can learn over time (from running operations using net.run())?

According to the docs I need to train before running operations:

Use train() to train the network with an array of training data. The network has to be trained with all the data in bulk in one call to train()

Maybe this isn't even correct? I seem to be able to switch between training and running without problem.

robertleeplummerjr · 2019-05-14T20:03:44Z

Sorry everyone for my tardiness. I've been getting GPU.js, which is brain.js's v2 engine, ready, and am nearly done. I've been saying this for a while, but all I can say is: it will be right, and it will be revolutionary for javascript. I believe it (GPU.js v2) to be days from release. But I will take a break and answer this thread because it is very important to me, and I think about it frequently.

const net = new brain.Reinforce(options)
 .reward((some_args) => {
     return some_value
  });

This is precisely what I mean.

@robertleeplummerjr Where can I find current V3 API proposal?

v3 has not been fully proposed as of yet, but the latest is here:
https://github.com/BrainJS/brain.js/wiki/Roadmap

Is there any ETA on when this will be a thing? This would be amazing if we could use reinforcement learning in brain.js!

I don't have one yet. Let's (all of us) get busy and start making it happen!

As someone who is pretty new to this, would this allow me to create a net that can learn over time (from running operations using net.run())?

Yes. This proposal would be a continuous learning system that, rather than train towards an error rate of 0, it trains for highest rewards. But keep in mind this is a proposal for v3, so it is not yet even proof of concept. That being said, the GPU.js v2 release, when finally released, will make iteration much faster.

Maybe this isn't even correct? I seem to be able to switch between training and running without problem.

It isn't yet because we are talking about a hypothetical api that doesn't yet exist.

myisaak · 2020-02-09T03:37:51Z

Well the qlearn reinforcement models currently available on nodejs have simple api to reference from. I suggest taking a look at them

robertleeplummerjr · 2022-11-10T18:04:47Z

This has gained some traction here: https://github.com/BrainJS/rl
I wanted to improve the initial design before "brainjsing" it.

BasToTheMax · 2023-04-13T15:18:05Z

Any updates?

robertleeplummerjr · 2023-04-13T15:34:13Z

Can you go ahead and try the project mentioned above? That will likely be the initial candidate. https://github.com/BrainJS/rl works fairly well and is fairly well tested. I want to find its weak area prior to analyzing if it is even possible for the API to fit here.

robertleeplummerjr · 2023-05-10T19:38:03Z

TL;DR
You can now install the experimental version from https://www.npmjs.com/package/@brain/rl via:

npm install @brain/rl

TS;DR
We secured the "brain" organization from NPM!!!! This took YEARS of work and patience, and communicating with them through their immense workload, and THEY FINALLY DID IT! The initial release of the reinforcement learning project based on work from Andrej Karpathy is now available from npm as https://www.npmjs.com/package/@brain/rl.

Consider this closed!

robertleeplummerjr changed the title ~~Propose really easy reinforcement learning API for vx~~ Propose really easy reinforcement learning API for v3 Dec 13, 2018

This was referenced Jan 3, 2019

Web machine learning @ W3 #337

Closed

LSTMTimeStep can't forecast a simple array of numbers... #342

Closed

robertleeplummerjr mentioned this issue Jan 10, 2019

Let's learn a game #345

Closed

mubaidr added 2 - Working <= 5 enhancement labels Apr 30, 2020

Vectorrent mentioned this issue Mar 21, 2023

Reinforcement Learning Feature Request #883

Closed

robertleeplummerjr closed this as completed May 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Propose really easy reinforcement learning API for v3 #317

Propose really easy reinforcement learning API for v3 #317

robertleeplummerjr commented Dec 13, 2018

robertleeplummerjr commented Dec 24, 2018 •

edited

Loading

robertleeplummerjr commented Dec 29, 2018 •

edited

Loading

DanielMazurkiewicz commented Jan 16, 2019

samfultonjr commented Apr 29, 2019

maximilliangeorge commented May 14, 2019 •

edited

Loading

robertleeplummerjr commented May 14, 2019

myisaak commented Feb 9, 2020

robertleeplummerjr commented Nov 10, 2022

BasToTheMax commented Apr 13, 2023

robertleeplummerjr commented Apr 13, 2023 •

edited

Loading

robertleeplummerjr commented May 10, 2023

Propose really easy reinforcement learning API for v3 #317

Propose really easy reinforcement learning API for v3 #317

Comments

robertleeplummerjr commented Dec 13, 2018

What is wrong?

Where does it happen?

How do we replicate the issue?

How important is this (1-5)?

Expected behavior (i.e. solution)

Other Comments

robertleeplummerjr commented Dec 24, 2018 • edited Loading

robertleeplummerjr commented Dec 29, 2018 • edited Loading

DanielMazurkiewicz commented Jan 16, 2019

samfultonjr commented Apr 29, 2019

maximilliangeorge commented May 14, 2019 • edited Loading

robertleeplummerjr commented May 14, 2019

myisaak commented Feb 9, 2020

robertleeplummerjr commented Nov 10, 2022

BasToTheMax commented Apr 13, 2023

robertleeplummerjr commented Apr 13, 2023 • edited Loading

robertleeplummerjr commented May 10, 2023

robertleeplummerjr commented Dec 24, 2018 •

edited

Loading

robertleeplummerjr commented Dec 29, 2018 •

edited

Loading

maximilliangeorge commented May 14, 2019 •

edited

Loading

robertleeplummerjr commented Apr 13, 2023 •

edited

Loading