Why are Q network and target network the same? #31

QionghuaLiao · 2021-12-27T12:08:14Z

Usually, the Q Network is trained while the parameters of target network are fixed. And every certain steps, the parameters of Q Network will be copied to Target Network. But when I check your code, I find that the Q Network and the Target Network are the same neural network, which confuses me. Could you please help me out?

wxwmd · 2022-08-09T10:04:37Z

你说的机制是为了减少训练的震荡，这个demo项目就没采用这个机制，直接每步都让q网络更新呗，这有啥confuse的。。。。。。。

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why are Q network and target network the same? #31

Why are Q network and target network the same? #31

QionghuaLiao commented Dec 27, 2021

wxwmd commented Aug 9, 2022

Why are Q network and target network the same? #31

Why are Q network and target network the same? #31

Comments

QionghuaLiao commented Dec 27, 2021

wxwmd commented Aug 9, 2022