handson-ml3/18_reinforcement_learning.ipynb at 64f0e05a941897d3475c2f4cce7ca573f68daac4

mirror of https://github.com/ArthurDanjou/handson-ml3.git synced 2026-01-14 12:14:36 +01:00

Files

B D 64f0e05a94 Minor change on greedy policy variable usage

Chap 18, why not using directly the 'n_outputs' variable defined earlier, instead of hardcoded '2'

2021-02-28 12:02:23 +01:00

View Raw