Equivalent of Numpy.argsort() in basic python? [duplicate]

Question

is there a builtin function of Python that does on python.array what argsort() does on a numpy.array?

unutbu · Accepted Answer · 2015-06-04 22:36:04Z

113

There is no built-in function, but it's easy to assemble one out of the terrific tools Python makes available:

def argsort(seq):
    # http://stackoverflow.com/questions/3071415/efficient-method-to-calculate-the-rank-vector-of-a-list-in-python
    return sorted(range(len(seq)), key=seq.__getitem__)

x = [5,2,1,10]

print(argsort(x))
# [2, 1, 0, 3]

It works on Python array.arrays the same way:

import array
x = array.array('d', [5, 2, 1, 10])
print(argsort(x))
# [2, 1, 0, 3]

edited Jun 4, 2015 at 22:36

answered Aug 1, 2010 at 14:30

unutbu

886k197 gold badges1.9k silver badges1.7k bronze badges

Sign up to request clarification or add additional context in comments.

5 Comments

Ender Over a year ago

Instead of using the (theoretically private) getitem, you can also use operator.itemgetter / operator.attrgetter docs.python.org/library/operator.html

unutbu Over a year ago

If operator.itemgetter could be used as a drop-in replacement for __getitem__, I think I'd agreed with you Ender, but as far as I can see, operator.itemgetter would also require wrapping it in a lambda expression. I'd rather avoid the extra lambda if I could.

Ferdinand Beyer Over a year ago

@Ender: itemgetter is no use here: x.__getitem__(i) returns x[i], whereas itemgetter(x)(i) will return i[x].

johannesack Over a year ago

In my opinion, key=lambda i: seq[i] might be easier to understand.

neonwatty Over a year ago

agreed with comment above (key=lambda i: seq[i]) might be easier to read- but still great!

Markus Dutschke · Accepted Answer · 2021-03-29 10:05:03Z

86

I timed the suggestions above and here are my results.

import timeit
import random
import numpy as np

def f(seq):
    # http://stackoverflow.com/questions/3382352/equivalent-of-numpy-argsort-in-basic-python/3383106#3383106
    #non-lambda version by Tony Veijalainen
    return [i for (v, i) in sorted((v, i) for (i, v) in enumerate(seq))]

def g(seq):
    # http://stackoverflow.com/questions/3382352/equivalent-of-numpy-argsort-in-basic-python/3383106#3383106
    #lambda version by Tony Veijalainen
    return [x for x,y in sorted(enumerate(seq), key = lambda x: x[1])]


def h(seq):
    #http://stackoverflow.com/questions/3382352/equivalent-of-numpy-argsort-in-basic-python/3382369#3382369
    #by unutbu
    return sorted(range(len(seq)), key=seq.__getitem__)


seq = list(range(10000))
random.shuffle(seq)

n_trials = 100
for cmd in [
        'f(seq)', 'g(seq)', 'h(seq)', 'np.argsort(seq)',
        'np.argsort(seq).tolist()'
        ]:
    t = timeit.Timer(cmd, globals={**globals(), **locals()})
    print('time for {:d}x {:}: {:.6f}'.format(n_trials, cmd, t.timeit(n_trials)))

output

time for 100x f(seq): 0.323915
time for 100x g(seq): 0.235183
time for 100x h(seq): 0.132787
time for 100x np.argsort(seq): 0.091086
time for 100x np.argsort(seq).tolist(): 0.104226

A problem size dependent analysis is given here.

edited Mar 29, 2021 at 10:05

Markus Dutschke

10.8k5 gold badges73 silver badges67 bronze badges

answered Aug 8, 2011 at 7:49

Boris Gorelik

32.1k41 gold badges136 silver badges172 bronze badges

3 Comments

JPH Over a year ago

Interesting - probably the average is more important than the 'best' of 3(?)

Ricardo Cruz Over a year ago

The average is affected by outliers. You do not want the results be polluted by other programs running or hardware cache misses happenstances.

reve_etrange Over a year ago

For future readers, %timeit is reporting the best average from 3 averages of 100 loops each.

Community · Accepted Answer · 2017-05-23 12:02:05Z

9

My alternative with enumerate:

def argsort(seq):
    return [x for x,y in sorted(enumerate(seq), key = lambda x: x[1])]

seq=[5,2,1,10]
print(argsort(seq))
# Output:
# [2, 1, 0, 3]

Better though to use answer from https://stackoverflow.com/users/9990/marcelo-cantos answer to thread python sort without lambda expressions

[i for (v, i) in sorted((v, i) for (i, v) in enumerate(seq))]

edited May 23, 2017 at 12:02

CommunityBot

11 silver badge

answered Aug 1, 2010 at 17:53

Tony Veijalainen

5,56525 silver badges32 bronze badges

Comments

Jeff M. · Accepted Answer · 2015-04-13 15:51:13Z

5

Found this question, but needed argsort for a list of objects based on an object property.

Extending unutbu's answer, this would be:

sorted(range(len(seq)), key = lambda x: seq[x].sort_property)

answered Apr 13, 2015 at 15:51

Jeff M.

1,09711 silver badges7 bronze badges

Collectives™ on Stack Overflow

Equivalent of Numpy.argsort() in basic python? [duplicate]

4 Answers 4

5 Comments

3 Comments

Comments

Comments

Linked

Hot Network Questions

Collectives™ on Stack Overflow

4 Answers 4

5 Comments

3 Comments

Comments

Comments

Linked

Related