Need to create a large number of new processes very quickly in Windows/Python

Question

In order to test some security software, I need to be able to create a large (configurable) number of new processes (not threads!) in Windows, very quickly, have them exist for a (configurable) period of time, then terminate cleanly. The processes shouldn't do anything at all - just exist for the specified duration.

Ultimately, I want to be able to run something like:

C:\> python process_generate.py --processes=150 --duration=2500

which would create 150 new processes very quickly, keep them all alive for 2500ms, then have them all terminate as quickly as possible.

As a starting point, I ran

from multiprocessing import Process
import os

def f():
    pass

if __name__ == '__main__':
    import datetime
    count = 0
    startime = datetime.datetime.now()
    while True:
        p = Process(target=f)
        p.start()
        p.terminate()
        count += 1
        if count % 1000 == 0:
            now = datetime.datetime.now()
            print "Started & stopped d processes in %s seconds" % (count, str(now-starttime))

and found I could create and terminate about 70 processes/second serially on my laptop, with the created processes terminating straightaway. The approx 70 processes/second rate was sustained over about an hour duration.

When I changed the code to

from multiprocessing import Process
import os
import time

def f_sleep():
    time.sleep(1)

if __name__ == '__main__':
    import datetime
    starttime = datetime.datetime.now()

    processes = []
    PROCESS_COUNT = 100
    for i in xrange(PROCESS_COUNT):
        p = Process(target=f_sleep)
        processes.append(p)
        p.start()
    for i in xrange(PROCESS_COUNT):
        processes[i].terminate()
    now = datetime.datetime.now()
    print "Started/stopped %d processes in %s seconds" % (len(processes), str(now-starttime))

and tried different values for PROCESS_COUNT, I expected it to scale a lot better than it did. I got the following results for different values of PROCESS_COUNT:

20 processes completed in 0.72 seconds
30 processes completed in 1.45 seconds
50 processes completed in 3.68 seconds
100 processes completed in 14 seconds
200 processes completed in 43 seconds
300 processes completed in 77 seconds
400 processes completed in 111 seconds

This is not what I expected - I expected to be able to scale up the parallel process count in a reasonably linear fashion till I hit a bottleneck, but I seem to be hitting a process creation bottleneck almost straightaway. I definitely expected to be able to create something close to 70 processes/second before hitting a process creation bottleneck, based on the first code I ran.

Without going into the full specs, the laptop runs fully patched Windows XP, has 4Gb RAM, is otherwise idle and is reasonably new; I don't think it'd be hitting a bottleneck this quickly.

Am I doing anything obviously wrong here with my code, or is XP/Python parallel process creation really that inefficient on a 12 month old laptop?

"new processes" "very quickly" "Windows" ... I think I see the problem... — Ignacio Vazquez-Abrams
Doesn't explain how I was able to create 70 processes/second when they were being created serially though — monch1962

datenwolf datenwolf · Accepted Answer · 2012-06-22T07:23:13

Well, Windows process management doesn't really scale well. The more processes there are, the longer it takes to insert a new one into scheduling.

Now compare this with other OS kernels, for example Linux, where process creation is practically O(1) (constant time) since kernel 2.6.8 (when the scheduler capable of this was introduced).

Note that I'm not trying to sell you Linux here. I just suggest you try out your program on a different OS to see for yourself.

Need to create a large number of new processes very quickly in Windows/Python

5 Answers