Under what circumstances does control pass from userspace to the Linux kernel space?

Question

I'm trying to understand which events can cause a transition from userspace to the linux kernel. If it's relevant, the scope of this question can be limited to the x86/x86_64 architecture.

Here are some sources of transitions that I'm aware of:

System calls (which includes accessing devices) causes a context switch from userspace to kernel space.
Interrupts will cause a context switch. As far as I know, this also includes scheduler preemptions, since a scheduler usually relies on a timer interrupt to do its work.
Signals. It seems like at least some signals are implemented using interrupts but I don't know if some are implemented differently so I'm listing them separately.

I'm asking two things here:

Am I missing any userspace->kernel path?
What are the various code paths that are involved in these context switches?

I'm pretty sure the only way you can change from user mode to kernel mode on x86 is a SYSENTER or an interrupt, both of which pass through arch/x86/entry/entry_{32,64}.S. System calls can be done as a SYSENTER or INT 80h. Some signal's are caused by interrupts from the processor (e.g. SIGSEGV), but entry from user space to kernel space is done using an interrupt. — Mikel Rychliski
How high would you say your confidence level is? :) I will wait a while and see if someone comes up with something else that could happen, but if not, why not put your comment as an answer? I might just accept it. :) — nitzanms
I think you've conflated two separate concepts - whether or not control passes to kernel code and kernel resources, and whether not a context switch occurs. Most modern operating systems, Linux included, map the kernel memory into every user process, but with restricted permissions. This is done specifically so that interrupts can run without causing a context switch, and instead just a processor state change that allows instructions to access kernel memory. IIRC, the only time a full context switch occurs is when a kernel thread is scheduled and run, eg to perform some deferred processing. — antiduh
To futher illustrate what I mean about one of the other mechanisms being used, consider a few example signals. In the case of a null pointer dereference causing a SIGSEGV, the kernel transition here is actually caused by a page fault, which is a type of exception. In the case of a process raising a signal itself, the kernel transition is caused by the kill() system call entry. In the case of a signal being sent from a process running on another CPU while the target is running in userspace, the kernel transition is caused by an Inter-Processor Interrupt. — caf
My point is just that signals themselves do not effect a switch to kernel mode, they always use one of the underlying mechanisms (system call, asychronous interrupt, exception). — caf

missimer missimer · Accepted Answer · 2015-07-23T18:44:48

One you are missing: Exceptions

(which can be further broken down in faults, traps and aborts)

For example a page fault, breakpoint, division by zero or floating-point exception. Technically, one can view exceptions as interrupts but not really the way you have defined an interrupt in your question.

You can find a list of x86 exceptions at this osdev webpage.

With regard to your second question:

What are the various code paths that are involved in these context switches?

That really depends on the architecture and OS, you will need to be more specific. For x86, when an interrupt occurs you go to the IDT entry and for SYSENTER you get to to address specified in the MSR. What happens after that is completely up to the OS.

Under what circumstances does control pass from userspace to the Linux kernel space?

2 Answers