Map numpy array with ufunc

Question

I'm trying to efficiently map a N * 1 numpy array of ints to a N * 3 numpy array of floats using a ufunc.

What I have so far:

map = {1: (0, 0, 0), 2: (0.5, 0.5, 0.5), 3: (1, 1, 1)}
ufunc = numpy.frompyfunc(lambda x: numpy.array(map[x], numpy.float32), 1, 1)

input = numpy.array([1, 2, 3], numpy.int32)

ufunc(input) gives a 3 * 3 array with dtype object. I'd like this array but with dtype float32.

map and input are Python builtin functions. It is best not to assign new values to these names, since it makes it hard to access the Python builtins. — unutbu
– unutbu, Commented Aug 31, 2012 at 1:16
The documentation of frompyfunc says that "The returned ufunc always returns PyObject arrays". Whatever the evil reason for this is, there is a fairly easy workaround: submit an output matrix of appropriate entry type as out argument. — Alexey
– Alexey, Commented Mar 14, 2016 at 16:47

unutbu · Accepted Answer · 2012-08-31 01:24:10Z

1

You could use np.hstack:

import numpy as np
mapping = {1: (0, 0, 0), 2: (0.5, 0.5, 0.5), 3: (1, 1, 1)}
ufunc = np.frompyfunc(lambda x: np.array(mapping[x], np.float32), 1, 1, dtype = np.float32)

data = np.array([1, 2, 3], np.int32)
result = np.hstack(ufunc(data))
print(result)
# [ 0.   0.   0.   0.5  0.5  0.5  1.   1.   1. ]
print(result.dtype)
# float32
print(result.shape)
# (9,)

answered Aug 31, 2012 at 1:24

unutbu

886k197 gold badges1.9k silver badges1.7k bronze badges

Sign up to request clarification or add additional context in comments.

Comments

jterrace · Accepted Answer · 2012-08-31 01:29:14Z

1

If your mapping is a numpy array, you can just use fancy indexing for this:

>>> valmap = numpy.array([(0, 0, 0), (0.5, 0.5, 0.5), (1, 1, 1)])
>>> input = numpy.array([1, 2, 3], numpy.int32)
>>> valmap[input-1]
array([[ 0. ,  0. ,  0. ],
       [ 0.5,  0.5,  0.5],
       [ 1. ,  1. ,  1. ]])

answered Aug 31, 2012 at 1:29

jterrace

67.5k23 gold badges164 silver badges208 bronze badges

Comments

HYRY · Accepted Answer · 2012-08-31 01:36:29Z

1

You can use ndarray fancy index to get the same result, I think it should be faster than frompyfunc:

map_array = np.array([[0,0,0],[0,0,0],[0.5,0.5,0.5],[1,1,1]], dtype=np.float32)
index = np.array([1,2,3,1])
map_array[index]

Or you can just use list comprehension:

map = {1: (0, 0, 0), 2: (0.5, 0.5, 0.5), 3: (1, 1, 1)}
np.array([map[i] for i in [1,2,3,1]], dtype=np.float32)

edited Aug 31, 2012 at 1:36

answered Aug 31, 2012 at 1:29

HYRY

97.8k28 gold badges197 silver badges192 bronze badges

1 Comment

Peter Graham Over a year ago

The input list is very large so I'm trying to avoid creating intermediate lists or arrays.

Pierre GM · Accepted Answer · 2012-08-31 11:59:58Z

Unless I misread the doc, the output of np.frompyfunc on a scalar a object indeed: when using a ndarray as input, you'll get a ndarray with dtype=obj.

A workaround is to use the np.vectorize function:

F = np.vectorize(lambda x: mapper.get(x), 'fff')

Here, we force the dtype of F's output to be 3 floats (hence the 'fff').

>>> mapper = {1: (0, 0, 0), 2: (0.5, 1.0, 0.5), 3: (1, 2, 1)}
>>> inp = [1, 2, 3]
>>> F(inp)
(array([ 0. ,  0.5,  1. ], dtype=float32), array([ 0.,  0.5,  1.], dtype=float32), array([ 0. ,  0.5,  1. ], dtype=float32))

OK, not quite what we want: it's a tuple of three float arrays (as we gave 'fff'), the first array being equivalent to [mapper[i][0] for i in inp]. So, with a bit of manipulation:

>>> np.array(F(inp)).T
array([[ 0. ,  0. ,  0. ],
       [ 0.5,  0.5,  0.5],
       [ 1. ,  1. ,  1. ]], dtype=float32)

Collectives™ on Stack Overflow

Map numpy array with ufunc

4 Answers 4

Comments

Comments

1 Comment

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

4 Answers 4

Comments

Comments

1 Comment

Comments

Your Answer

Sign up or log in

Post as a guest

Related