Python - Overcome import in timeit

Question

I recently noticed the following about the timeit python module:

On my machine the lines:

from timeit import Timer
t = Timer(stmt='a = 2**3**4')
print("This took {:.3f}s to execute.".format(t.timeit()))

will produce:

This took 0.017s to execute.

On the other hand writing a file test.py:

#!/usr/bin/env python3

a = 2**3**4

and calling:

from timeit import Timer
t = Timer(stmt='import test')
print("This took {:.3f}s to execute.".format(t.timeit()))

will produce:

This took 0.126s to execute.

And I'm wondering how I can test the execution time of test.py without changing the file itself. How can I work around importing the file (and therefore loosing time).

If it's the first time you call import test, python will be busy creating "test.pyc" and take a bit longer. Subsequent imports should be somewhat faster. — Aaron
– Aaron, Commented Dec 18, 2017 at 20:54
@Aaron: Except subsequent imports won't import anything at all (they go through a bunch of import rigmarole that eventually ends in pulling the cached module from sys.modules, which doesn't rerun the module at all). — ShadowRanger
– ShadowRanger, Commented Dec 18, 2017 at 20:57

godaygo · Accepted Answer · 2017-12-18 21:34:15Z

2

There is one problem with your measurements you are not measuring what you think you are measuring, when you write:

t = Timer(stmt= 'a = 2**3**4')

You are measuring binding time! Look:

>>> Timer(stmt='a = 10**5**4').timeit(100000)
    0.0064544077574026915
>>> Timer(stmt='a = 2**3**4').timeit(100000)
    0.006511381058487586

The timings are pretty the same but it is somewhat longer to compute 10**5**4 than 2**3**4. The 2**3**4 is computed only once, when the code is compiled and this is called "constant folding", some optimizations which Python perform during compilation of your source.

Compare this two results:

>>> Timer(stmt= 'a = 2**3**4').timeit(100000) 
    0.00628656749199763
>>> Timer(stmt= 'a = x**y**z', setup='(x,y,z)=(2,3,4)').timeit(100000) 
    0.18055968312580717

But this acceleration is not given for free. There are two points:

Compilation time increases
.pyc file size increases (because this value is stored inside .pyc file)

Suppose I have two files:

#foo1.py
a = 10**7**7

#foo2.py
x,y,z =(10,7,7)
a = x**y**z

If I compile them with python -m py_compile foo1.py foo2.py the sizes of .pyc files on my machine will be:

foo1.cpython-36.pyc - 364 882 bytes
foo2.cpython-36.pyc - 150 bytes

edited Dec 18, 2017 at 21:34

answered Dec 18, 2017 at 21:17

godaygo

2,3082 gold badges20 silver badges35 bronze badges

Sign up to request clarification or add additional context in comments.

2 Comments

user9115052 Over a year ago

That's really interesting, but I don't quite get how this exactly relates to my import statement? I was thinking, when I import test.py there already exists an test.pyc containing the constant, therefore the code should be faster than generating it on the fly (as for example in the command line), when in reality using import is slower.

user9115052 Over a year ago

Maybe you'd like to check out my next question as well

Aaron · Accepted Answer · 2017-12-18 22:15:40Z

1

The closest you're going to get is to use compile with exec If you plan on running as a .pyc file, don't include the compile statement in what you're timing.

# time as if executing:" >python test.pyc " from terminal
#   (importing test.py will typically automatically generate the .pyc file automatically)
t = Timer(stmt='exec(code_object)', 
          setup='code_object = compile(open("test.py").read(), "test.py", "exec")')

# time as if executing:" >python test.py " from terminal
t = Timer(stmt='exec(compile(open("test.py").read(), "test.py", "exec"))')

This should get you close to the realistic timing for calling a script from the terminal. This does not eliminate overhead, because the overhead of calling a script is real and will be observed in the real world.

If you're on a linux based system, you can also just call >time test.py from a terminal.

edited Dec 18, 2017 at 22:15

answered Dec 18, 2017 at 20:53

Aaron

11.2k1 gold badge27 silver badges43 bronze badges

5 Comments

Aaron Over a year ago

@StefanPochmann sorry, stupid typos. not thinking about what I'm doing. See edit

Stefan Pochmann Over a year ago

Ok now it works, but... it takes about 0.35 seconds while the OP's way only takes 0.15 seconds. Was it actually faster for you when you tested it? You did test this time, right? :-P

user9115052 Over a year ago

I want to time a script just if I put it all in quotes and put it as stmt in timeit. >time test.py always prints much more time, since python startup is included. How would I solve this problem? I want to test a some code ...code... as if I used print(Timer.timeit(stmt='code')).

user9115052 Over a year ago

Maybe you'd like to check out my next question as well

Aaron Over a year ago

@StefanPochmann it's not faster, it's representative of the real world

Collectives™ on Stack Overflow

Python - Overcome import in timeit

2 Answers 2

2 Comments

5 Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

2 Comments

5 Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related