how to get the index of numpy.random.choice? - python

Question

Is it possible to modify the numpy.random.choice function in order to make it return the index of the chosen element? Basically, I want to create a list and select elements randomly without replacement

import numpy as np
>>> a = [1,4,1,3,3,2,1,4]
>>> np.random.choice(a)
>>> 4
>>> a
>>> [1,4,1,3,3,2,1,4]

a.remove(np.random.choice(a)) will remove the first element of the list with that value it encounters (a[1] in the example above), which may not be the chosen element (eg, a[7]).

It may not be the chosen element, but it seems like two cases are indistinguishable. — Robᵩ
– Robᵩ, Commented Sep 13, 2013 at 20:07
@Rob: Not really. After I create the list it's important that it remains in the same order, whichever element I remove. — HappyPy
– HappyPy, Commented Sep 13, 2013 at 20:08

CT Zhu · Accepted Answer · 2013-09-13 20:46:07Z

19

Regarding your first question, you can work the other way around, randomly choose from the index of the array a and then fetch the value.

>>> a = [1,4,1,3,3,2,1,4]
>>> a = np.array(a)
>>> random.choice(arange(a.size))
6
>>> a[6]

But if you just need random sample without replacement, replace=False will do. Can't remember when it was firstly added to random.choice, might be 1.7.0. So if you are running very old numpy it may not work. Keep in mind the default is replace=True

edited Sep 13, 2013 at 20:46

answered Sep 13, 2013 at 20:24

CT Zhu

54.6k18 gold badges125 silver badges136 bronze badges

Sign up to request clarification or add additional context in comments.

6 Comments

askewchan Over a year ago

No need to make a list and choose from it in this case, just do np.random.randint(0,a.size), unless I suppose many mutually exclusive choices are needed.

CT Zhu Over a year ago

@askwchan, right! What was I thinking. np.random.randint(0,a.size, size=size_you_want) will be enough.

HappyPy Over a year ago

@CT Zhu: I get a AttributeError: 'list' object has no attribute 'size'

CT Zhu Over a year ago

Oh, a is a list, not a array. Put convert it to array first. I forgot to copy 1 line.

CT Zhu Over a year ago

@askwchan, oh, no. Your method will always become sampling with replacement. HappyPy really needs that replace=False, so a once a element is sampled it will not sampled again.

|

Óscar López · Accepted Answer · 2013-09-13 20:27:35Z

15

Here's one way to find out the index of a randomly selected element:

import random # plain random module, not numpy's
random.choice(list(enumerate(a)))[0]
=> 4      # just an example, index is 4

Or you could retrieve the element and the index in a single step:

random.choice(list(enumerate(a)))
=> (1, 4) # just an example, index is 1 and element is 4

edited Sep 13, 2013 at 20:27

answered Sep 13, 2013 at 20:08

Óscar López

237k38 gold badges321 silver badges391 bronze badges

9 Comments

HappyPy Over a year ago

This is not working for me. It gives me a "ValueError: a must be 1-dimensional"

HappyPy Over a year ago

I copy/pasted the your code and the list above, and I still get the same error. Is it working with you?

user2357112 Over a year ago

list(enumerate(a)) produces a list of tuples, which is considered a 2D array-like object. This won't work.

Óscar López Over a year ago

@HappyPy you're right, I tested it with random.choice, not np.random.choice. If you must absolutely use np.random.choice, then my answer won't work and I'll delete it. But if you use plain old random.choice (from the random module), it'll work.

Russell Myers Over a year ago

Strong warning, this is going to have terrible performance, which is one of the primary reasons people use numpy in the first place. You're iterating over an entire array. It would be cheaper to just generate a random integer between 0 and the length of the list rather than this.

|

user2357112 · Accepted Answer · 2019-10-05 01:23:21Z

10

numpy.random.choice(a, size=however_many, replace=False)

If you want a sample without replacement, just ask numpy to make you one. Don't loop and draw items repeatedly. That'll produce bloated code and horrible performance.

Example:

>>> a = numpy.arange(10)
>>> a
array([0, 1, 2, 3, 4, 5, 6, 7, 8, 9])
>>> numpy.random.choice(a, size=5, replace=False)
array([7, 5, 8, 6, 2])

On a sufficiently recent NumPy (at least 1.17), you should use the new randomness API, which fixes a longstanding performance issue where the old API's replace=False code path unnecessarily generated a complete permutation of the input under the hood:

rng = numpy.random.default_rng()
result = rng.choice(a, size=however_many, replace=False)

edited Oct 5, 2019 at 1:23

answered Sep 13, 2013 at 20:08

user2357112

286k32 gold badges490 silver badges570 bronze badges

4 Comments

HappyPy Over a year ago

I don't understand how would this work. What's "a" in this case? Could you provide an example please?

user2357112 Over a year ago

@HappyPy: a is exactly the same thing it is in your code; it's the array-like object we want a sample from. size is the number of elements we want in the sample, and replace=False asks for a sample without replacement. The result will be a 1D array of shape (however_many,) containing the sample you wanted.

HappyPy Over a year ago

The sample is already "a". I want to work directly with "a" so that I can control how many elements are still left and perform other operations with "a".

user2357112 Over a year ago

@HappyPy: That sounds like you're using numpy all wrong. If a is already a random sample, but you want to draw elements from a without replacement, you're essentially drawing another random sample from a. If you really, really want to successively remove elements from a, numpy is unlikely to help you.

lmjohns3 · Accepted Answer · 2016-06-09 04:00:01Z

4

This is a bit in left field compared with the other answers, but I thought it might help what it sounds like you're trying to do in a slightly larger sense. You can generate a random sample without replacement by shuffling the indices of the elements in the source array :

source = np.random.randint(0, 100, size=100) # generate a set to sample from
idx = np.arange(len(source))
np.random.shuffle(idx)
subsample = source[idx[:10]]

This will create a sample (here, of size 10) by drawing elements from the source set (here, of size 100) without replacement.

You can interact with the non-selected elements by using the remaining index values, i.e.:

notsampled = source[idx[10:]]

edited Jun 9, 2016 at 4:00

answered Sep 13, 2013 at 20:40

lmjohns3

7,6325 gold badges39 silver badges57 bronze badges

Comments

Mehdi Saman Booy · Accepted Answer · 2020-06-23 09:11:30Z

2

Maybe late but it worth to mention this solution because I think the simplest way to do so is:

a = [1, 4, 1, 3, 3, 2, 1, 4]
n = len(a)
idx = np.random.choice(list(range(n)), p=np.ones(n)/n)

It means you are choosing from the indices uniformly. In a more general case, you can do a weighted sampling (and return the index) in this way:

probs = [.3, .4, .2, 0, .1]
n = len(a)
idx = np.random.choice(list(range(n)), p=probs)

If you try to do so for so many times (e.g. 1e5), the histogram of the chosen indices would be like [0.30126 0.39817 0.19986 0. 0.10071] in this case which is correct.

Anyway, you should choose from the indices and use the values (if you need) as their probabilities.

edited Jun 23, 2020 at 9:11

answered Nov 27, 2019 at 15:00

Mehdi Saman Booy

2,9685 gold badges28 silver badges32 bronze badges

Comments

Tobias Kienzler · Accepted Answer · 2016-12-02 13:40:21Z

1

Instead of using choice, you can also simply random.shuffle your array, i.e.

random.shuffle(a)  # will shuffle a in-place

answered Dec 2, 2016 at 13:40

Tobias Kienzler

27.8k23 gold badges138 silver badges232 bronze badges

Comments

Kernel · Accepted Answer · 2019-08-14 19:53:26Z

1

Here is a simple solution, just choose from the range function.

import numpy as np
a = [100,400,100,300,300,200,100,400]
I=np.random.choice(np.arange(len(a)))
print('index is '+str(I)+' number is '+str(a[I]))

answered Aug 14, 2019 at 19:53

Kernel

7251 gold badge14 silver badges27 bronze badges

Comments

askewchan · Accepted Answer · 2013-09-13 20:39:55Z

Based on your comment:

The sample is already a. I want to work directly with a so that I can control how many elements are still left and perform other operations with a. – HappyPy

it sounds to me like you're interested in working with a after n randomly selected elements are removed. Instead, why not work with N = len(a) - n randomly selected elements from a? Since you want them to still be in the original order, you can select from indices like in @CTZhu's answer, but then sort them and grab from the original list:

import numpy as np
n = 3 #number to 'remove'
a = np.array([1,4,1,3,3,2,1,4])
i = np.random.choice(np.arange(a.size), a.size-n, replace=False)
i.sort()
a[i]
#array([1, 4, 1, 3, 1])

So now you can save that as a again:

a = a[i]

and work with a with n elements removed.

Neil C. Obremski · Accepted Answer · 2022-05-10 13:09:18Z

The question title versus its description are a bit different. I just wanted the answer to the title question which was getting only an (integer) index from numpy.random.choice(). Rather than any of the above, I settled on index = numpy.random.choice(len(array_or_whatever)) (tested in numpy 1.21.6).

Ex:

import numpy
a = [1, 2, 3, 4]
i = numpy.random.choice(len(a))

The problem I had in the other solutions were the unnecessary conversions to list which would recreate the entire collection in a new object (slow!).

Reference: https://numpy.org/doc/stable/reference/random/generated/numpy.random.choice.html?highlight=choice#numpy.random.choice

Key point from the docs about the first parameter a:

a: 1-D array-like or int If an ndarray, a random sample is generated from its elements. If an int, the random sample is generated as if it were np.arange(a)

Since the question is very old then it's possible I'm coming at this from the convenience of newer versions supporting exactly what myself and the OP wanted.

Collectives™ on Stack Overflow

how to get the index of numpy.random.choice? - python

9 Answers 9

6 Comments

9 Comments

4 Comments

Comments

Comments

Comments

Comments

Comments

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

9 Answers 9

6 Comments

9 Comments

4 Comments

Comments

Comments

Comments

Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related