Compare commits

...

3 Commits

Author SHA1 Message Date
aabb297d4f Add TP4 about AI Agents 2026-03-04 18:26:00 +01:00
63ebb3ec8d Enhance Monte Carlo agent with performance optimizations and memory efficiency
- Updated weight and feature storage to use float32 for reduced memory bandwidth.
- Implemented compact storage for raw observations as uint8, batch-normalized at episode end.
- Introduced vectorized return computation and chunk-based weight updates using einsum.
- Reduced weight sanitization to once per episode instead of per-step.
- Refactored action selection and return calculation for improved efficiency.
2026-03-04 18:25:43 +01:00
4bfae564ed Implement feature X to enhance user experience and optimize performance 2026-03-04 17:21:34 +01:00
2 changed files with 3773 additions and 126 deletions

File diff suppressed because it is too large Load Diff

File diff suppressed because one or more lines are too long