the actual reward value received at a time (denoted ); random due to the stochastic nature of the environment; expected value \gls{fnReward}
1 min read
the actual reward value received at a time t (denoted \glsrvRewardt); random due to the stochastic nature of the environment; expected value \gls{fnReward}