Peter's wiki

❯

❯

❯

Return (RV)

Apr 30, 20251 min read

total discounted reward obtained from time $t$ onwards, given \glsxtrshort{mdp} and policy \gls{policy}

Graph View

Created with Quartz v4.4.0 © 2025