Can (Should?) a native shared library which creates its own threads support the using process exiting 'without warning'?

Question

I work on a product that's usually built as a shared library.

The using application will load it, create some handles, use them, and eventually free all the handles and unload the library.

The library creates some background threads which are usually stopped at the point the handles are freed.

Now, the issue is that some consuming applications aren't super well-behaved, and will fail to free the handles in some cases (cancellation, errors, etc). Eventually, static destructors in our library run, and crash when they try to interact with the (now dead) background thread(s).

One possibility is to not have any global objects with destructors, and so to avoid running any code in the library during static destruction. This would probably solve the crash on process exit, but it would introduce leaks and crashes in the scenario where the application simply unloads the library without freeing the handles (as opposed to exiting), as we wouldn't ensure that the background threads are actually stopped before the code they were running was unloaded.

More importantly, to my knowledge, when main() exits, all other threads will be killed, wherever they happened to be at the time, which could leave locks locked, and invariants broken (for example, within the heap manager).

Given that, does it even make sense to try and support these buggy applications?

You are not responsible for other's fails. The only, if possible, give a hint, a short message somebody can read. — Ripi2

Harry Johnston Harry Johnston · Accepted Answer · 2016-12-16T02:53:04

Yes, your library should allow the process to exit without warning. Perhaps in an ideal world every program using your library would carefully track the handles and free them all when it exits for any reason, but in practice this isn't a realistic requirement. The code path that is triggering the program exit might be a shared component that isn't even aware that your library is in use!

In any case, it is likely that your current architecture has a more general problem, because it is inherently unsafe for static destructors to interact with other threads.

From DllMain entry point in MSDN:

Because DLL notifications are serialized, entry-point functions should not attempt to communicate with other threads or processes. Deadlocks may occur as a result.

and

If your DLL is linked with the C run-time library (CRT), the entry point provided by the CRT calls the constructors and destructors for global and static C++ objects. Therefore, these restrictions for DllMain also apply to constructors and destructors and any code that is called from them.

In particular, if your destructors attempt to wait for your threads to exit, that is almost certain to deadlock in the case where the library is explicitly unloaded while the threads are still running. If the destructors don't wait, the process will crash when the code the threads are running disappears. I'm not sure why you aren't seeing that problem already; perhaps you are terminating the threads? (That's not safe either, although for different reasons.)

There are a number of ways to resolve this problem. Probably the simplest is the one you already mentioned:

One possibility is to not have any global objects with destructors, and so to avoid running any code in the library during static destruction.

You go on to say:

[...] but it would introduce leaks and crashes in the scenario where the application simply unloads the library without freeing the handles [...]

That's not your problem! The library will only be unloaded if the application explicitly chooses to do so; obviously, and unlike the earlier scenario, the code in question knows your library is present, so it is perfectly reasonable for you to require that it close all your handles before doing so.

Ideally, however, you would provide an uninitialization function that closes all the handles automatically, rather than requiring the application to close each handle individually. Explicit initialization and uninitialization functions also allows you to safely set up and free global resources, which is usually more efficient than doing all of your setup and teardown on a per-handle basis and is certainly safer than using global objects.

(See the link above for a full description of all the restrictions applicable to static constructors and destructors; they are quite extensive. Constructing all your globals in an explicit initialization routine, and destroying them in an explicit uninitialization routine, avoids the whole messy business.)

Can (Should?) a native shared library which creates its own threads support the using process exiting 'without warning'?

1 Answers