Update report #448

pilgrimygy · 2021-08-12T09:45:08Z

PR Checklist

Update NEWS.md?

findmyway · 2021-08-13T03:27:45Z

Could you update the format a bit to make the blog correctly rendered?

https://github.com/JuliaReinforcementLearning/ReinforcementLearning.jl/pull/449/files#diff-e7c24e4b1cefc93111f19774edad27c4dece4bdc3423f5b6762b11cc98396674R1-R23

findmyway · 2021-08-13T03:29:23Z

For citation, use the \dcite{dayan2009dopamine} format to include entries in the bib file.

For figures, use \dfig{body;2021-02-20_17_41_54-draft.pptx_-_PowerPoint.png; A general workflow between policy and environment.} this format.

You may refer https://github.com/JuliaReinforcementLearning/ReinforcementLearning.jl/blob/master/docs/homepage/blog/an_introduction_to_reinforcement_learning_jl_design_implementations_thoughts/index.md

pilgrimygy · 2021-08-13T03:30:13Z

This is my mistake. I will update it as soon as possible.

findmyway · 2021-08-15T05:47:33Z

I'll merge this first to help you sync your progress. We can address the gpu related issue later.

findmyway · 2021-08-15T05:55:24Z

Oops...

I merged the wrong PR

This reverts commit e05ed4e.

findmyway · 2021-08-15T05:58:19Z

docs/homepage/blog/index.md

+  - [Chapter13 Short Corridor.jl](/blog/notebooks_for_reinforcement_learning_an_introduction/Chapter13_Short_Corridor.jl)
+
+- [Phase 1 Technical Report of Enriching Offline Reinforcement Learning Algorithms in ReinforcementLearning.jl](/blog/offline_reinforcement_learning_algorithm_phase1)


Move this line to the top of this paragraph.

findmyway · 2021-08-15T06:03:37Z

docs/homepage/blog/offline_reinforcement_learning_algorithm_phase1/index.md

+    - `Project Information`
+    - `Project Schedule`
+    - `Future Plan`


Change this line into plain text to avoid the formatting issue

findmyway · 2021-08-15T06:03:53Z

docs/homepage/blog/offline_reinforcement_learning_algorithm_phase1/index.md

+        "authors": [
+            "author":"Guoyu Yang",
+            "authorURL":"https://github.com/pilgrimygy"
+            "affiliation":"",


Add a link to your university here.

findmyway · 2021-08-15T06:04:35Z

docs/homepage/blog/offline_reinforcement_learning_algorithm_phase1/index.md

+This technical report is the first evaluation report of Project "Enriching Offline Reinforcement Learning Algorithms in ReinforcementLearning.jl" in OSPP. It includes three components: project information, project schedule, future plan.
+## Project Information
+- Project name: Enriching Offline Reinforcement Learning Algorithms in ReinforcementLearning.jl
+- Scheme Description: Recent advances in offline reinforcement learning make it possible to turn reinforcement learning into a data-driven discipline, such that many effective methods from the supervised learning field could be applied. Until now, the only offline method provided in ReinforcementLearning.jl is behavior cloning. We'd like to have more algorithms added like BCQ, CQL. It is expected to implement at least three to four modern offline RL algorithms.


Add link to those concepts like BCQ, CQL,

findmyway · 2021-08-15T06:09:32Z

docs/homepage/blog/offline_reinforcement_learning_algorithm_phase1/index.md

+##### Variational Auto-Encoder (VAE)
+In offline reinforcement learning tasks, VAE is often used to learn from datasets to approximate behavior policy.
+
+VAE\dcite{DBLP:journals/corr/KingmaW13} ([link](https://github.com/pilgrimygy/ReinforcementLearning.jl/blob/framework/src/ReinforcementLearningCore/src/policies/q_based_policies/learners/approximators/neural_network_approximator.jl)) consists of two neural network: `encoder` and `decoder`.


Move the link to the first reference in the above line?

pilgrimygy added 3 commits August 12, 2021 17:44

update report

08c32df

update report

7da52f8

update report

5d97cbe

pilgrimygy and others added 4 commits August 14, 2021 16:00

update report

ff96dad

update report

389bc4d

Merge branch 'JuliaReinforcementLearning:master' into report

4e8acf8

Merge branch 'master' into report

c9c68c8

findmyway merged commit e05ed4e into JuliaReinforcementLearning:master Aug 15, 2021

findmyway added a commit that referenced this pull request Aug 15, 2021

Revert "Update report (#448)"

95db71f

This reverts commit e05ed4e.

findmyway mentioned this pull request Aug 15, 2021

Revert "Update report" #456

Merged

findmyway added a commit that referenced this pull request Aug 15, 2021

Revert "Update report (#448)" (#456)

50756bb

This reverts commit e05ed4e.

findmyway reviewed Aug 15, 2021

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Update report #448

Update report #448

Uh oh!

pilgrimygy commented Aug 12, 2021

Uh oh!

findmyway commented Aug 13, 2021

Uh oh!

findmyway commented Aug 13, 2021 •

edited

Loading

Uh oh!

pilgrimygy commented Aug 13, 2021

Uh oh!

findmyway commented Aug 15, 2021

Uh oh!

findmyway commented Aug 15, 2021

Uh oh!

findmyway Aug 15, 2021

Uh oh!

findmyway Aug 15, 2021

Uh oh!

findmyway Aug 15, 2021

Uh oh!

findmyway Aug 15, 2021

Uh oh!

findmyway Aug 15, 2021

Uh oh!

Uh oh!

		- [Chapter13 Short Corridor.jl](/blog/notebooks_for_reinforcement_learning_an_introduction/Chapter13_Short_Corridor.jl)

		- [Phase 1 Technical Report of Enriching Offline Reinforcement Learning Algorithms in ReinforcementLearning.jl](/blog/offline_reinforcement_learning_algorithm_phase1)

Uh oh!

Update report #448

Update report #448

Uh oh!

Conversation

pilgrimygy commented Aug 12, 2021

Uh oh!

findmyway commented Aug 13, 2021

Uh oh!

findmyway commented Aug 13, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pilgrimygy commented Aug 13, 2021

Uh oh!

findmyway commented Aug 15, 2021

Uh oh!

findmyway commented Aug 15, 2021

Uh oh!

findmyway Aug 15, 2021

Choose a reason for hiding this comment

Uh oh!

findmyway Aug 15, 2021

Choose a reason for hiding this comment

Uh oh!

findmyway Aug 15, 2021

Choose a reason for hiding this comment

Uh oh!

findmyway Aug 15, 2021

Choose a reason for hiding this comment

Uh oh!

findmyway Aug 15, 2021

Choose a reason for hiding this comment

Uh oh!

Uh oh!

findmyway commented Aug 13, 2021 •

edited

Loading