Dear author, hello!
Seeing this loss function, I have some questions. S uses the value at time t0, and the value of Q at time t is copied from time t0. So the part of the time that actually has an effect is v_t_i. So, is the purpose of introducing the "track" just to weight the loss?
Dear author, hello!
Seeing this loss function, I have some questions. S uses the value at time t0, and the value of Q at time t is copied from time t0. So the part of the time that actually has an effect is v_t_i. So, is the purpose of introducing the "track" just to weight the loss?