Search elements of one array in another, row-wise - Python / NumPy

Question

For example, I have a matrix of unique elements,

a=[
    [1,2,3,4],
    [7,5,8,6]
]

and another unique matrix filled with elements which has appeard in the first matrix.

b=[
    [4,1],
    [5,6]
]

And I expect the result of

[
    [3,0],
    [1,3]
].

That is to say, I want to find each row elements of b which equals to some elements of a in the same row, return the indices of these elements in a. How can i do that? Thanks.

You just need to consider each row independently and stackoverflow.com/questions/432112/… — sshashank124
– sshashank124, Commented Dec 28, 2019 at 8:20
Welcome to SO! It seems to not to have any logic in your question? Could you explain which operation/process you wnat to do on it? — David García Bodego
– David García Bodego, Commented Dec 28, 2019 at 8:34

Divakar · Accepted Answer · 2019-12-28 09:35:55Z

2

Here's a vectorized approach -

# https://stackoverflow.com/a/40588862/ @Divakar
def searchsorted2d(a,b):
    m,n = a.shape
    max_num = np.maximum(a.max() - a.min(), b.max() - b.min()) + 1
    r = max_num*np.arange(a.shape[0])[:,None]
    p = np.searchsorted( (a+r).ravel(), (b+r).ravel() ).reshape(m,-1)
    return p - n*(np.arange(m)[:,None])

def search_indices(a, b):
    sidx = a.argsort(1)
    a_s = np.take_along_axis(a,sidx,axis=1)
    return np.take_along_axis(sidx,searchsorted2d(a_s,b),axis=1)

Sample run -

In [54]: a
Out[54]: 
array([[1, 2, 3, 4],
       [7, 5, 8, 6]])

In [55]: b
Out[55]: 
array([[4, 1],
       [5, 6]])

In [56]: search_indices(a, b)
Out[56]: 
array([[3, 0],
       [1, 3]])

Another vectorized one leveraging broadcasting -

In [65]: (a[:,None,:]==b[:,:,None]).argmax(2)
Out[65]: 
array([[3, 0],
       [1, 3]])

edited Dec 28, 2019 at 9:35

answered Dec 28, 2019 at 9:15

Divakar

222k19 gold badges273 silver badges374 bronze badges

Sign up to request clarification or add additional context in comments.

1 Comment

Divakar Over a year ago

@Navaro Well the usual NumPy way (under the hoods i.e. and the one that is performant) is with implementation in C. So, yes, it would be nice to see these being implemented in native NumPy. For now, I am just building up on existing tools.

Mercury · Accepted Answer · 2019-12-28 09:01:41Z

0

If you don't mind using loops, here's a quick solution using np.where:

import numpy as np

a=[[1,2,3,4],
   [7,5,8,6]]
b=[[4,1],
   [5,6]]

a = np.array(a)
b = np.array(b)
c = np.zeros_like(b)

for i in range(c.shape[0]):
    for j in range(c.shape[1]):
        _, pos = np.where(a==b[i,j])
        c[i,j] = pos

print(c.tolist())

answered Dec 28, 2019 at 9:01

Mercury

4,1811 gold badge15 silver badges43 bronze badges

1 Comment

AgaigetS AgaigetS Over a year ago

Thanks. But when I process on huge data, there will be much slow. I want it solve in the matrix way, just because of the demand of faster processing speed.

oppressionslayer · Accepted Answer · 2019-12-28 10:48:40Z

0

You can do it this way:

np.split(pd.DataFrame(a).where(pd.DataFrame(np.isin(a,b))).T.sort_values(by=[0,1])[::-1].unstack().dropna().reset_index().iloc[:,1].to_numpy(),len(a))                               

# [array([3, 0]), array([1, 3])]

edited Dec 28, 2019 at 10:48

answered Dec 28, 2019 at 9:09

oppressionslayer

7,2242 gold badges11 silver badges26 bronze badges

Collectives™ on Stack Overflow

Search elements of one array in another, row-wise - Python / NumPy

3 Answers 3

1 Comment

1 Comment

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

3 Answers 3

1 Comment

1 Comment

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related