expected \gls{DEF_return} when starting from state x at time t, taking action a, and then following policy \gls{policy}