Random Network Distillation in Pytorch

Kimiya Saadat

2025/03/04

Categories: projects Tags: Juicy

An example of a improvement of the reward when transfer is done. I developed a pytorch version of OpenAI’s RND (Random Network Distillation with Proximal Policy Optimization) and trained it on Montezuma’s Revenge to address the challenge of sparse rewards.

See the source code.

>> Home