5

I have a pandas dataframe, df , that contains columns where each row contains a numpy array of varying size e.g.

   column A 
0  np.array([1,2,3])
1  np.array([1,2,3,4])
2  np.array([1,2])

I there a built in pandas function that will return the mean value of each array, i.e. row, for the entire column? Something like :

df.A.mean()

But which operates on each row. Thanks for any help.

1 Answer 1

8

You can use df.<column>.map to apply a function to each element in a column:

df = pd.DataFrame({'a': 
    [np.array([1, 2, 3]), 
     np.array([4, 5, 6, 7]), 
     np.array([7, 8])]
})

df
Out[8]: 
              a
0     [1, 2, 3]
1  [4, 5, 6, 7]
2        [7, 8]

df['a'].map(lambda x: x.mean())
Out[9]: 
0    2.0
1    5.5
2    7.5
Name: a, dtype: float64
Sign up to request clarification or add additional context in comments.

Comments

Your Answer

By clicking “Post Your Answer”, you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.