Beta Phase: Square45 is currently in beta testing. Expect some features or content to be incomplete or missing.
45

Reward Function

A reward function assigns a numerical value to a state or state-action pair, guiding the agent's learning process towards desired outcomes.