-
Notifications
You must be signed in to change notification settings - Fork 578
Open
Labels
questionFurther information is requestedFurther information is requested
Description
❓ Question
I wanted to train a DQN agent for 10M, given the time limit of the cluster I'm using I have to continue training the agent by loading the last saved model.I tried to follow the documentation (see the command bellow) but I seems like the model is not properly loaded: the last model achieves a score of 20 but when I continue training it has a score of -21 and other parameters (e.g exploration rate) don't continue annealing.
I just wanted to make sure that I'm not missing something. Is this the right way to do it?
python train.py --algo dqn --env PongNoFrameskip-v4 -i agents/dqn/PongNoFrameskipv4_1/rl_model_60000_steps.zip -n 1000000
Checklist
- I have checked that there is no similar issue in the repo
- I have read the SB3 documentation
- I have read the RL Zoo documentation
- If code there is, it is minimal and working
- If code there is, it is formatted using the markdown code blocks for both code and stack traces.
Metadata
Metadata
Assignees
Labels
questionFurther information is requestedFurther information is requested