Nicholas P. Rougier, From Python To Numpy¹, 2017

Introduction

def random_walkfaster(n=1000): from itertools import accumulate

steps = random.choices([-1,+1], k=n) return [0]+list(accumulate(steps))

walk = random_walkfaster(1000)

accumulate([1,2,3,4,5]) --> 1 3 6 10 15²

Without using loops and instead vectorizing the problem we get a 85% increase in performance.

>>> from tools import timeit >>> timeit(“random_walkfaster(n=10000)”, globals()) 10 loops, best of 3: 2.21 msec per loop

Translating in numpy we get:

def random_walkfastest(n=1000):

steps = np.random.choice([-1,+1], n) return np.cumsum(steps)

walk = random_walkfastest(1000)

>>> from tools import timeit >>> timeit(“random_walkfastest(n=10000)”, globals()) 1000 loops, best of 3: 14 usec per loop

Readability vs Speed

The tradeoff for the massive speedups using numpy is often the readabily of the code: comment your code!