Skip to content

Latest commit

 

History

History
4 lines (2 loc) · 162 Bytes

File metadata and controls

4 lines (2 loc) · 162 Bytes

Write out the parameter update equations for TD learning with $$\hat{U}(x,y) = \theta_0 + \theta_1 x + \theta_2 y + \theta_3,\sqrt{(x-x_g)^2 + (y-y_g)^2}\ .$$