Beta Phase: Square45 is currently in beta testing. Expect some features or content to be incomplete or missing.
45

Policy

A policy defines the agent's behavior, mapping states to actions, and is a central object of learning in reinforcement learning.