Why does compiler align N byte data types on N byte boundaries?

Question

I don't understand why the compiler aligns int on 4 byte boundaries, short on 2 byte boundaries and char on 1 byte boundaries. I understand that the if the data bus width of the processor is 4 bytes, it takes 2 memory read cycles for reading an int from an address not a multiple of 4.
So, why doesn't the compiler align all data on 4 byte boundaries? For eg.:

struct s {
 char c;
 short s;
};

Here, 1) why does the compiler align short on a 2 byte boundary? Assuming that the processor can fetch 4 bytes on a single memory read cycle, wouldn't it take only 1 memory read cycle to read short in the above case even if there is no padding between char and short?

2) Why doesn't the compiler align short on a 4 byte boundary?

the purpose of structure padding for alignment is to fetch the data in one machine read. In your case, the struct will be 4 and not 8. You can still fetch the char OR short in one cycle bu using masking. So while fetching the char the processor will fetch 4 bytes and mask out 24 bits.<br> However, if you had something like this:<br> struct s { char c; int i}; then the size would get 8 byte coz you need full 4 bytes for the integer to be fetched in read cycle. — Nikhil Vidhani
@NikhilVidhani: My question is not regarding the purpose of padding. My question is about why the byte is padded between char and short and not after short. Assuming the processor can fetch 4 bytes in a single cycle, no matter where the padding happens, the short can be fetched in 1 cycle, right? So, what's the savings that we get in the above case? I guess there is some hardware level explanation for this. — linuxfreak
@linuxfreak going by my instincts... i think it is easier to fetch (mask) last 16 bits than the bits 9-24 if short were to occupy byte 2 and 3. — Nikhil Vidhani
@NikhilVidhani - Yeah.. I think so. To fetch the bits 9-24, the processor has to do bit shifting in addition to masking. — linuxfreak

MSalters MSalters · Accepted Answer · 2014-09-12T10:58:14

These objects have to fit in arrays. An array is contiguous. Thus, if the first element is N byte aligned, and all objects are N bytes big, then necessarily all objects in the array are N byte aligned too.

So, if short would be 2 bytes big, but 4 bytes aligned, there would be 2 byte holes between all shorts in an array which is forbidden.

You do see that your assumption is slightly flawed. I could make a struct with 26 chars, and it wouldn't be 26 byte aligned. It could start anywhere. An N byte type with have an alignment equal to N or dividing N.

Why does compiler align N byte data types on N byte boundaries?

4 Answers