Files
handson-ml3/18_reinforcement_learning.ipynb
B D 64f0e05a94 Minor change on greedy policy variable usage
Chap 18, why not using directly the 'n_outputs' variable defined earlier, instead of hardcoded '2'
2021-02-28 12:02:23 +01:00

81 KiB