mirror of
https://github.com/ArthurDanjou/handson-ml3.git
synced 2026-01-14 20:19:29 +01:00
Fix equation 16-6 (max_alpha'=>max_a')
This commit is contained in:
@@ -974,7 +974,7 @@
|
||||
"**Equation 16-6: Q-Learning using an exploration function**\n",
|
||||
"\n",
|
||||
"$\n",
|
||||
"Q(s, a) \\gets (1-\\alpha)Q(s,a) + \\alpha\\left(r + \\gamma . \\underset{\\alpha'}{\\max}f(Q(s', a'), N(s', a'))\\right)\n",
|
||||
"Q(s, a) \\gets (1-\\alpha)Q(s,a) + \\alpha\\left(r + \\gamma \\, \\underset{a'}{\\max}f(Q(s', a'), N(s', a'))\\right)\n",
|
||||
"$\n",
|
||||
"\n",
|
||||
"**Equation 16-7: Target Q-Value**\n",
|
||||
|
||||
Reference in New Issue
Block a user