How do I display a single image in PyTorch?

Question

How do I display a PyTorch Tensor of shape (3, 224, 224) representing a 224x224 RGB image? Using plt.imshow(image) gives the error:

TypeError: Invalid dimensions for image data

fecavy · Accepted Answer · 2025-01-27 15:32:24Z

174

Given a Tensor representing the image, use .permute() to put the channels as the last dimension when passing them to matplotlib:

plt.imshow(tensor_image.permute(1, 2, 0))

Note: permute does not copy or allocate memory, and from_numpy() doesn't either.

edited Jan 27 at 15:32

fecavy

1892 silver badges12 bronze badges

answered Mar 16, 2019 at 11:41

Tom Hale

48.1k43 gold badges207 silver badges275 bronze badges

Sign up to request clarification or add additional context in comments.

5 Comments

Devashish Prasad Over a year ago

Wow thank you... This worked for me... I was trying to do tensor_image.numpy().reshape([224,224,3]) and visualize it using cv2.imshow() But i was not getting the actual image... whats going wrong here??

Sophie Swett Over a year ago

@DevashishPrasad The problem is that reshape([224,224,3]) doesn't do the same thing that permute(1, 2, 0) does. The permute function is similar to transposing a matrix, where rows become columns and columns become rows. The reshape function does something totally unrelated that I don't know how to describe concisely. In short, reshape is the wrong function.

Charlie Parker Over a year ago

what is the shape of tensor_image ?

rusheb Over a year ago

An arguably more readable alternative is plt.imshow(torch.einsum('cwh->whc', tensor_image))

Anson Savage Oct 14 at 3:07

And I also had to make sure it's on the cpu (.to('cpu'))

trsvchn · Accepted Answer · 2018-12-14 16:16:46Z

21

As you can see matplotlib works fine even without conversion to numpy array. But PyTorch Tensors ("Image tensors") are channel first, so to use them with matplotlib you need to reshape it:

Code:

from scipy.misc import face
import matplotlib.pyplot as plt
import torch

np_image = face()
print(type(np_image), np_image.shape)
tensor_image = torch.from_numpy(np_image)
print(type(tensor_image), tensor_image.shape)
# reshape to channel first:
tensor_image = tensor_image.view(tensor_image.shape[2], tensor_image.shape[0], tensor_image.shape[1])
print(type(tensor_image), tensor_image.shape)

# If you try to plot image with shape (C, H, W)
# You will get TypeError:
# plt.imshow(tensor_image)

# So we need to reshape it to (H, W, C):
tensor_image = tensor_image.view(tensor_image.shape[1], tensor_image.shape[2], tensor_image.shape[0])
print(type(tensor_image), tensor_image.shape)

plt.imshow(tensor_image)
plt.show()

Output:

<class 'numpy.ndarray'> (768, 1024, 3)
<class 'torch.Tensor'> torch.Size([768, 1024, 3])
<class 'torch.Tensor'> torch.Size([3, 768, 1024])
<class 'torch.Tensor'> torch.Size([768, 1024, 3])

edited Dec 14, 2018 at 16:16

answered Dec 5, 2018 at 13:04

trsvchn

9,1113 gold badges26 silver badges35 bronze badges

1 Comment

Tom Hale Over a year ago

Hmm, doesn't work for me, see updated question with the tensor's shape.

amirhe · Accepted Answer · 2023-07-06 08:09:39Z

13

PyTorch modules processing image data expect tensors in the format C × H × W.¹
Whereas PILLow and Matplotlib expect image arrays in the format H × W × C.²

You can easily convert tensors to/from this format with a TorchVision transform:

from torchvision.transforms import functional as F

F.to_pil_image(image_tensor)

Or by directly permuting the axes:

image_tensor.permute(1,2,0)

^{PyTorch modules dealing with image data require tensors to be laid out as C × H × W : channels, height, and width, respectively.

Note how we have to use permute to change the order of the axes from C × H × W to H × W × C to match what Matplotlib expects.

Deep Learning with PyTorch}

edited Jul 6, 2023 at 8:09

amirhe

2,3511 gold badge18 silver badges32 bronze badges

answered Mar 15, 2021 at 16:30

End genocide - save Gaza

25k10 gold badges113 silver badges133 bronze badges

Comments

End genocide - save Gaza · Accepted Answer · 2021-03-16 10:11:17Z

10

Given the image is loaded as described and stored in the variable image:

plt.imshow(transforms.ToPILImage()(image), interpolation="bicubic")
#transforms.ToPILImage()(image).show() # Alternatively

Or as Soumith suggested:

def show(img):
    npimg = img.numpy()
    plt.imshow(np.transpose(npimg, (1, 2, 0)), interpolation='nearest')

edited Mar 16, 2021 at 10:11

End genocide - save Gaza

25k10 gold badges113 silver badges133 bronze badges

answered Dec 5, 2018 at 0:33

Tom Hale

48.1k43 gold badges207 silver badges275 bronze badges

1 Comment

Fuji Over a year ago

import torchvision.transforms # maybe add the import to the code

End genocide - save Gaza · Accepted Answer · 2021-03-15 16:32:01Z

4

A complete example given an image pathname img_path:

from PIL import Image
image = Image.open(img_path)
plt.imshow(transforms.ToPILImage()(transforms.ToTensor()(image)), interpolation="bicubic")

Note that transforms.* return a class, which is why the funky bracketing.

edited Mar 15, 2021 at 16:32

End genocide - save Gaza

25k10 gold badges113 silver badges133 bronze badges

answered Mar 13, 2019 at 9:59

Tom Hale

48.1k43 gold badges207 silver badges275 bronze badges

Comments

TheExorcist · Accepted Answer · 2022-07-29 11:23:09Z

3

Torch is in shape of channel,height,width need to convert it into height,width, channel so permute.

plt.imshow(white_torch.permute(1, 2, 0))

Or directly if you want

import torch
import torchvision
from torchvision.io import read_image
import torchvision.transforms as T

!wget 'https://images.unsplash.com/photo-1553284965-83fd3e82fa5a?ixlib=rb-1.2.1&ixid=MnwxMjA3fDB8MHxleHBsb3JlLWZlZWR8NHx8fGVufDB8fHx8&w=1000&q=80'  -O white_horse.jpg

white_torch = torchvision.io.read_image('white_horse.jpg')

T.ToPILImage()(white_torch)

answered Jul 29, 2022 at 11:23

TheExorcist

2,0341 gold badge22 silver badges25 bronze badges

Comments

aravinda_gn · Accepted Answer · 2021-05-19 04:33:03Z

0

Use show_image from fastai

from fastai.vision.all import show_image

answered May 19, 2021 at 4:33

aravinda_gn

1,3801 gold badge13 silver badges21 bronze badges

Comments

catFood · Accepted Answer · 2021-10-18 08:22:18Z

I've written a simple function to visualize the pytorch tensor using matplotlib.

import numpy as np
import matplotlib.pyplot as plt
import torch

def show(*imgs):
    '''
     input imgs can be single or multiple tensor(s), this function uses matplotlib to visualize.
     Single input example:
     show(x) gives the visualization of x, where x should be a torch.Tensor
        if x is a 4D tensor (like image batch with the size of b(atch)*c(hannel)*h(eight)*w(eight), this function splits x in batch dimension, showing b subplots in total, where each subplot displays first 3 channels (3*h*w) at most. 
        if x is a 3D tensor, this function shows first 3 channels at most (in RGB format)
        if x is a 2D tensor, it will be shown as grayscale map
     
     Multiple input example:      
     show(x,y,z) produces three windows, displaying x, y, z respectively, where x,y,z can be in any form described above.
    '''
    img_idx = 0
    for img in imgs:
        img_idx +=1
        plt.figure(img_idx)
        if isinstance(img, torch.Tensor):
            img = img.detach().cpu()

            if img.dim()==4: # 4D tensor
                bz = img.shape[0]
                c = img.shape[1]
                if bz==1 and c==1:  # single grayscale image
                    img=img.squeeze()
                elif bz==1 and c==3: # single RGB image
                    img=img.squeeze()
                    img=img.permute(1,2,0)
                elif bz==1 and c > 3: # multiple feature maps
                    img = img[:,0:3,:,:]
                    img = img.permute(0, 2, 3, 1)[:]
                    print('warning: more than 3 channels! only channels 0,1,2 are preserved!')
                elif bz > 1 and c == 1:  # multiple grayscale images
                    img=img.squeeze()
                elif bz > 1 and c == 3:  # multiple RGB images
                    img = img.permute(0, 2, 3, 1)
                elif bz > 1 and c > 3:  # multiple feature maps
                    img = img[:,0:3,:,:]
                    img = img.permute(0, 2, 3, 1)[:]
                    print('warning: more than 3 channels! only channels 0,1,2 are preserved!')
                else:
                    raise Exception("unsupported type!  " + str(img.size()))
            elif img.dim()==3: # 3D tensor
                bz = 1
                c = img.shape[0]
                if c == 1:  # grayscale
                    img=img.squeeze()
                elif c == 3:  # RGB
                    img = img.permute(1, 2, 0)
                else:
                    raise Exception("unsupported type!  " + str(img.size()))
            elif img.dim()==2:
                pass
            else:
                raise Exception("unsupported type!  "+str(img.size()))


            img = img.numpy()  # convert to numpy
            img = img.squeeze()
            if bz ==1:
                plt.imshow(img, cmap='gray')
                # plt.colorbar()
                # plt.show()
            else:
                for idx in range(0,bz):
                    plt.subplot(int(bz**0.5),int(np.ceil(bz/int(bz**0.5))),int(idx+1))
                    plt.imshow(img[idx], cmap='gray')

        else:
            raise Exception("unsupported type:  "+str(type(img)))

maverik89 · Accepted Answer · 2024-07-26 11:31:43Z

0

If you run it locally, you can use the cudacanvas package, it's specially useful if you have your tensor on cuda and not on cpu, then you won't need to do permute or cpu transfer and can just

import torch
import cudacanvas


#REPLACE THIS with you training loop
while (True):

    #REPLACE THIS with you training code and generation of data
    noise_image = torch.rand((4, 500, 500), device="cuda")

    #Visualise your data in real-time
    cudacanvas.im_show(noise_image)

    #OPTIONAL: Terminate training when the window is closed
    if cudacanvas.should_close():
        cudacanvas.clean_up()
        #end process if the window is closed
        break

answered Jul 26, 2024 at 11:31

maverik89

435 bronze badges

Collectives™ on Stack Overflow

How do I display a single image in PyTorch?

9 Answers 9

5 Comments

1 Comment

Comments

1 Comment

Comments

Comments

Comments

Comments

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

9 Answers 9

5 Comments

1 Comment

Comments

1 Comment

Comments

Comments

Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related