Should I use torch.inference_mode() in a prediction method even when using model.eval()?

I'm following the book "Deep Learning with PyTorch Step By Step" and I have a question about the predict method in the StepByStep class (from this repository: GitHub).

The current implementation is:

def predict(self, x):
        self.model.eval()
        x_tensor = torch.as_tensor(x).float()
        y_hat_tensor = self.model(x_tensor.to(self.device))

        self.model.train()
        return y_hat_tensor.detach().cpu().numpy()

My question is: should I wrap the forward pass with torch.inference_mode() or torch.no_grad() even though I'm already calling self.model.eval() like :

def predict(self, x):
        self.model.eval()
        x_tensor = torch.as_tensor(x).float()

        with torch.inference_mode():
            y_hat_tensor = self.model(x_tensor.to(self.device))

        self.model.train()
        return y_hat_tensor.detach().cpu().numpy() # detach() is superfluous

I understand that model.eval() changes the behavior of layers like dropout and batch normalization, but I believe the context manager should still be added for better performance and memory efficiency. However, I'm not sure why the original implementation doesn't include it. What are the best practices here?

edited Nov 5 at 11:04

Adriaan

18.2k7 gold badges47 silver badges88 bronze badges

asked Nov 4 at 12:43

Matteo

934 bronze badges

@cronoik Thanks :) I had already checked that thread before posting. That discussion is focused on validation inside a training loop, while my question is specifically about pure inference in a predict() method and the use of torch.inference_mode() in that context Different scenario, so that link doesn’t actually answer my question.

Matteo
– Matteo

2025-11-04 14:49:31 +00:00
Commented Nov 4 at 14:49
2

The linked question actually does answer yours, because it makes clear that model.eval() and no_grad() do different things and you should use both. inference_mode() has similar functionality to no_grad(), so you should still use it and mode.eval(). As for why the book doesn't use it, who knows. I don't know the book and make no assumptions about the author, but technically anyone can write a book, whether they fully grasp the material or not.

xdurch0
– xdurch0

2025-11-04 14:59:00 +00:00
Commented Nov 4 at 14:59

Add a comment |

0

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.

Collectives™ on Stack Overflow

Should I use torch.inference_mode() in a prediction method even when using model.eval()? [duplicate]

0

Linked

Hot Network Questions