How to delete a row based on a condition from a numpy array?

Question

From the following array :

test = np.array([[1,2,'a'],[4,5,6],[7,'a',9],[10,11,12]])

How can I delete the rows that contain 'a' ? Expected result :

array([[ 4,  5,  6],
   [10, 11, 12]])

juanpa.arrivillaga · Accepted Answer · 2017-12-14 17:45:03Z

11

Note, numpy supports vectorized comparisons:

>>> test
array([[1, 2, 'a'],
       [4, 5, 6],
       [7, 'a', 9],
       [10, 11, 12]], dtype=object)
>>> test == 'a'
array([[False, False,  True],
       [False, False, False],
       [False,  True, False],
       [False, False, False]], dtype=bool)

Now, you want the rows where all are not equalt to 'a':

>>> (test != 'a').all(axis=1)
array([False,  True, False,  True], dtype=bool)

So, simply select the rows with the mask:

>>> row_mask = (test != 'a').all(axis=1)
>>> test[row_mask,:]
array([[4, 5, 6],
       [10, 11, 12]], dtype=object)

answered Dec 14, 2017 at 17:45

juanpa.arrivillaga

97.6k14 gold badges141 silver badges190 bronze badges

Sign up to request clarification or add additional context in comments.

2 Comments

G F Over a year ago

Thanks for your great answer. I was trying to find a solution with the np.where function but I did not. Thanks again.

juanpa.arrivillaga Over a year ago

@GF you could use np.where, but I think the solution would be less clean, unless you were looking specifically for the values where something was whatever. The problem when you want only the rows or columns is that you have to do an additional step to find unique indices...

kmario23 · Accepted Answer · 2017-12-14 18:04:55Z

3

Also, like this maybe? (Inspired from one of my another answers )

In [100]: mask = ~(test == 'a')

In [101]: mask
Out[101]: 
array([[ True,  True, False],
       [ True,  True,  True],
       [ True, False,  True],
       [ True,  True,  True]], dtype=bool)

In [102]: test[np.all(mask, axis=1), :]
Out[102]: 
array([['4', '5', '6'],
       ['10', '11', '12']],
      dtype='<U21')

But, please note that here we're not deleting any rows from the original array. We're just slicing out the rows which doesn't have the alphabet a.

edited Dec 14, 2017 at 18:04

answered Dec 14, 2017 at 17:54

kmario23

62.1k17 gold badges174 silver badges159 bronze badges

1 Comment

G F Over a year ago

Nice too. Thanks ;)

G F · Accepted Answer · 2017-12-14 18:04:19Z

2

To sum up, there are a few possible ways such as :

test[np.all(test != 'a', axis=1), :]

Or

test[(test != 'a').all(axis=1)]

answered Dec 14, 2017 at 18:04

G F

3411 gold badge6 silver badges15 bronze badges

Collectives™ on Stack Overflow

How to delete a row based on a condition from a numpy array?

3 Answers 3

2 Comments

1 Comment

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

3 Answers 3

2 Comments

1 Comment

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related