Puzzled by odd NumPy error when creating array

Question

With

foo_ok = [(30, 784), (10, 30)]
foo_bad = [(10, 784), (10, 10)]

why does

np.array([np.zeros(foo_ok[0]),np.zeros(foo_ok[1])])

work while

np.array([np.zeros(foo_bad[0]),np.zeros(foo_bad[1])])

results in

ValueError: could not broadcast input array from shape (10,784) into shape (10)

Basically I need things that work with the form foo = [(X, Z), (Y, X)] where it might be the case that Y==X; but having Y==X causes things to fail.

Imanol Luengo · Accepted Answer · 2016-03-31 16:09:37Z

3

Edited the answer according to the edited question.

Basically, the problem relies when the first axis matches on the 2 arrays. Find bellow a replicable example:

foo_ok = [(30, 784), (10, 30)]
foo_ok2 = [(30, 784), (30, 784)]
foo_bad = [(10, 784), (10, 10)]

If we construct the first 2 arrays:

a = np.array([np.zeros(foo_ok[0]),np.zeros(foo_ok[1])])
b = np.array([np.zeros(foo_ok2[0]),np.zeros(foo_ok2[1])])

c = np.array([np.zeros(foo_bad[0]),np.zeros(foo_bad[1])]) # ERROR

we can see that the resulting arrays are not the same:

>>> print a.shape, a.dtype, a[0].shape, a[1].shape
(2,), dtype('O'), (30, 784), (10, 30)

>>> print b.shape, b.dtype, b[0].shape, b[1].shape
(2, 30, 784), dtype('float64'), (30, 784), (30, 784)

Here foo_ok2[0] and foo_ok2[1] have the same values, thus, it will create 2 arrays of the same shape. Numpy is smart enough to handle array concatenations when 2 arrays with the same dimensions come, and the resulting b array is a concatenation of shape (2, 30, 784). However, the resulting array a is just an array of type object with 2 elements. Each of the elements of the list is a different array (like if it was a raw python list).

Numpy is not optimized to deal with object arrays, and thus, whenever possible it tries to cast arrays to numerical data types.

That is what is happening then the first dimension of the 2 arrays matches in c. Numpy expects all the dimensions to match, and thus, throws a I cannot concatenate this exception.

Although I would still encourage not using numpy arrays with object types, there is a dirty way you can create one even when the first axis matches while the arrays have different shapes:

>>> c = np.array([np.zeros(foo_bad[0]), None])
>>> c[1] = np.zeros(foo_bad[1])

>>> print c.shape, c.dtype, c[0].shape, c[1].shape
(2,), dtype('O'), (10, 784), (10, 10)

And another version of it (closely related to your syntax):

>>> c = np.empty((2,), dtype=np.object)
>>> c[:] = [np.zeros(foo_bad[0]), np.zeros(foo_bad[1])]

>>> print c.shape, c.dtype, c[0].shape, c[1].shape
(2,), dtype('O'), (10, 784), (10, 10)

edited Mar 31, 2016 at 16:09

answered Mar 31, 2016 at 15:18

Imanol Luengo

16k3 gold badges52 silver badges68 bronze badges

Sign up to request clarification or add additional context in comments.

14 Comments

orome Over a year ago

Hold on a sec. My error may lie elsewhere. Let me review before you invest more in this.

Imanol Luengo Over a year ago

@raxacoricofallapatorius just edited something quick that might help. Maybe zeros -> zeros_like?

orome Over a year ago

Edited to simplify and make the question clearer. zeros_like doesn't do the trick. I'm looking for the the equivalent of what I get with foo_ok, just with the relevant dimensions changed from 30 to 10.

orome Over a year ago

Basically I need things that work with the form foo = [(X, Z), (Y, X)] where it might be the case that Y==X. But I don't see why having Y==X should cause things to fail.

Imanol Luengo Over a year ago

@raxacoricofallapatorius In short, you should not have numpy arrays of such type. Numpy is not optimized to deal with object arrays. Use raw python lists for that purpose.

|

Collectives™ on Stack Overflow

Puzzled by odd NumPy error when creating array

1 Answer 1

14 Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

14 Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related