Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

2.2.2贝尔曼方程期望推导部分的随机变量和给定条件的随机变量分不清 #168

Open
LiuShisan23 opened this issue Dec 11, 2024 · 1 comment

Comments

@LiuShisan23
Copy link

有点分不清哪个是随机变量哪个是给定的值, 用简化字母代替后更分不清了.
可能是我的问题, 求推导过程能标注一下给定值么, 或者再详细一点, 辛苦啦!

@LiuShisan23
Copy link
Author

LiuShisan23 commented Dec 11, 2024

如上所述在已知条件 $s, s'$ 的情况下,$\sum_{g'} g' p(g' \mid s')$ 是确定值,因此条件期望的结果直接等于该确定值:

$\mathbb{E} \left[ \sum_{g'} g' p(g' \mid s') \mid s, s' \right] = \sum_{g'} g' p(g' \mid s', s).$

$p(g' \mid s') = p(g' \mid s', s)$

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant