remove zero rows from numpy ndarray

Question

Given m x n nd array of floats, what is the best way to get an m' x n nd array of floats that does not contain all-zero rows?

for example: Given

[ 
  [1.0, 0.0, 2.0], 
  [0.0, 0.0, 0.0], 
  [2.0, 1.0, 0.0] 
]

I want to get

[ 
  [1.0, 0.0, 2.0], 
  [2.0, 1.0, 0.0] 
]

alani · Accepted Answer · 2020-08-15 06:14:13Z

1

You can index using a boolean array:

a = np.array([[1.0, 0.0, 2.0], [0.0, 0.0, 0.0], [2.0, 1.0, 0.0]])

print(a[a.any(axis=1)])

Here a.any(axis=1) will be True where any elements in the row are non-zero. These are the rows that we want to keep.

answered Aug 15, 2020 at 6:14

alani

13.2k3 gold badges18 silver badges34 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

Damiox · Accepted Answer · 2020-08-15 06:02:18Z

1

You can exclude those elements as follows:

>>> import numpy as np
>>> x = np.array([ [1.0, 0.0, 2.0], [0.0, 0.0, 0.0], [2.0, 1.0, 0.0] ])
>>> x
array([[1., 0., 2.],
       [0., 0., 0.],
       [2., 1., 0.]])
>>> sumrow = np.abs(x).sum(-1)
>>> x[sumrow>0]
array([[1., 0., 2.],
       [2., 1., 0.]])

Note: @Akavall pointed out correctly that np.abs() would prevent issues with negative values.

Additionally, another more complex approach:

>>> x = np.array([ [1.0, 0.0, 2.0], [0.0, 0.0, 0.0], [2.0, 1.0, 0.0] ])
>>> x[~np.all(x == 0, axis=1)]
array([[1., 0., 2.],
       [2., 1., 0.]])

See: https://www.geeksforgeeks.org/numpy-indexing/

edited Aug 15, 2020 at 6:02

answered Aug 15, 2020 at 5:46

Damiox

6724 silver badges13 bronze badges

4 Comments

Akavall Over a year ago

What if a row contains non zero elements that sum to 0, for example: [-1., 0., 1.]?

Akavall Over a year ago

I suppose np.abs(x).sum(-1) will solve the issue that I pointed out.

Damiox Over a year ago

You're correct @Akavall - I just edited the answer to be correct. Also added another approach.

alani Over a year ago

Instead of ~np.all(x == 0, axis=1) you can just do x.any(axis=1). This saves some computation steps and temporary arrays, although by De Morgan's Laws we can see that it arrives at the same answer.

Bart Barnard · Accepted Answer · 2020-08-15 05:47:22Z

0

A possible solution would be to use the fact that the sum of all zeros is zero. Create a mask using that fact:

>>> bar = np.array ([ [1.0, 0.0, 2.0], [0.0, 0.0, 0.0], [2.0, 1.0, 0.0] ] )
>>> mask = bar.sum(axis=1)==0
>>> bar[mask]
array([[1., 0., 2.],
       [2., 1., 0.]])

answered Aug 15, 2020 at 5:47

Bart Barnard

1,1689 silver badges17 bronze badges

1 Comment

Bart Barnard Over a year ago

O, that's actually the same answer as @Damiox gave :)

Girish Hegde · Accepted Answer · 2020-08-15 06:20:11Z

0

Here's one way to do it:

import numpy as np
x    = np.array([ [1.0, 0.0, 2.0], [0.0, 0.0, 0.0], [2.0, 1.0, 0.0] ])
m, n = x.shape
rows = [row for row in range(m) if not all(x[row] == 0)]
x    = x[rows]
print(x)

This works for arrays containing negative data also. If we use sum suppose a row contains [-1, 0, 1] it will be deleted we don't want that.

answered Aug 15, 2020 at 6:20

Girish Hegde

1,5238 silver badges16 bronze badges

Comments

Kuldip Chaudhari · Accepted Answer · 2020-08-15 07:07:14Z

0

a=np.array([r for r in a if any(r)])

answered Aug 15, 2020 at 7:07

Kuldip Chaudhari

1,1146 silver badges8 bronze badges

Collectives™ on Stack Overflow

remove zero rows from numpy ndarray

5 Answers 5

Comments

4 Comments

1 Comment

Comments

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

5 Answers 5

Comments

4 Comments

1 Comment

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Related