Pytorch lightning resuming from checkpoint with new data

Question

I'm wanting to continue the training process for a model using new data.

I understand that you can continue training a Pytorch Lightning model e.g.

pl.Trainer(max_epochs=10, resume_from_checkpoint='./checkpoints/blahblah.ckpt') for example, if you last checkpoint is saved at epoch 5. But is there a way to continue training by adding different data?

Boomalope · Accepted Answer · 2023-06-26 21:46:03Z

11

To new users of Torch lightning, the new syntax looks something like this

trainer = pl.Trainer() trainer.fit(model,data,ckpt_path = "./path/to/checkpoint")

Also since I don't have enough reputation to comment, if you have already trained for 10 epoch and you want to train for 5 more epoch, add the following parameters to the Trainer

trainer = pl.Trainer(max_epochs = 15)

edited Jun 26, 2023 at 21:46

answered Jun 26, 2023 at 21:44

Boomalope

1111 silver badge3 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

Aniket Maurya · Accepted Answer · 2022-05-09 07:31:18Z

3

Yes, when you resume from a checkpoint you can provide the new DataLoader or DataModule during the training and your training will resume from the last epoch with the new data.

trainer = pl.Trainer(max_epochs=10, resume_from_checkpoint='./checkpoints/blahblah.ckpt')

trainer.fit(model, new_train_dataloader)

answered May 9, 2022 at 7:31

Aniket Maurya

4003 silver badges5 bronze badges

2 Comments

Mukul Over a year ago

if the model has been trained for 10 epochs previously and if I want to train for 5 more epoch should I keep max_epochs=5 or max_epochs=10? Reference: lightning.ai/forums/t/how-to-resume-training/432/8

Antonios Sarikas Over a year ago

How on earth can the DataModule during training be provided if we just load the model?

Collectives™ on Stack Overflow

Pytorch lightning resuming from checkpoint with new data

2 Answers 2

Comments

2 Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

Comments

2 Comments

Your Answer

Sign up or log in

Post as a guest

Related