Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Guidance #3

Open
Hamza-101 opened this issue Feb 24, 2024 · 2 comments
Open

Guidance #3

Hamza-101 opened this issue Feb 24, 2024 · 2 comments

Comments

@Hamza-101
Copy link

Hello Andre,

I was wondering if you had a mathematical equation available for the reward function or could point it out to me in the code file. Much appreciated.

Also why did you only use only orientation based rewards (as mentioned in the repo?

Regards,
Hamza

@andredelft
Copy link
Owner

andredelft commented Feb 26, 2024

Hello Hamza,

Thanks for showing interest in my code!

The reward function is quite simple: the Bird class is initialized with a reward signal self.reward_signal, which is just a constant (by default it is 5). When in a given step the bird flies in the prefered direction (eastward), it is given this reward, otherwise it does not receive any reward. This is done in the function Bird.reward().

As for your question why I used orientation based rewards: I did that, because that has not been done in literature (at least at that time), and because I thought it was a suitable reward system to stimulate flock behaviour, without explicitly rewarding it, or implementing it in the system (as the Vicsek model does).

You can read more about the model in my thesis, which is publicly available herre: https://studenttheses.universiteitleiden.nl/access/item%3A2711425/view.

Best,

André

@Hamza-101
Copy link
Author

Hamza-101 commented Mar 3, 2024

I am trying to replicate your reward function for an open ended environment where I vary acceleration. 1. As I understand it you are just using alignment right? 2. Do you terminate an episode and if so with what conditions? 3. How do you ensure sufficient separation?

Thanks for the help.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants