I have a function written in 64 bit x86 assembly (AT&T syntax for gcc and GAS) which performs some SSE2 operations. I've checked the result by using gdb with disassembly and looking at the register values, so I know it's producing the correct result. After the retq instruction, I get a segementation fault. Since I'm new to assembly (and never took any classes on it), I'm guessing I'm not handling the function/main program interface correctly. The function takes in 2 pointers and an int and is expected to return a float. This is how I handle the inputs/output in my assembly function:
float foo(float *x,float *y,unsigned int s)
{
__asm__ __volatile__(
"movl -0x14(%%rbp),%%ecx \n\t" //ecx = s
"movq -0x8(%%rbp),%%rax \n\t" //rax -> x
"movq -0x10(%%rbp),%%rdx \n\t" //rdx -> y
"subq $4,%%rsp \n\t" //function result
#sse2 operations that end up with the answer in xmm4...
"movss %%xmm4,(%%rsp) \n\t" //store result
"flds (%%rsp) \n\t" //load function result
"addq $4,%%rsp \n\t" //adjust stack
"ret \n\t"
:
:"g"(s)
:"%ecx","%rax","%rdx"
);
}
And here is the line that seems to cause the segfault (which is the instruction right after ret in the disassembly):
0x00007fffffffe0d0 in ?? ()
=> 0x00007fffffffe0d0: 00 00 add %al,(%rax)
I have no idea why it's adding the value in rax's low bit back to rax after executing my function, but it seems to be crashing things. Am I not allowed to use rax in my assembly function even though it's general purpose and I'm declaring it clobbered?
I'm not sure if you need to see this part, but this is how gcc expects to handle the function; I've included the disassembly of the line that calls my function :
#asm dealing with function inputs
callq 0x400520 <foo>
movss %xmm0,-0x48(%rbp)
mov -0x48(%rbp),%eax
mov %eax,-0x34(%rbp)
Which brings me to my second question, why is it arbitrarily moving the value in xmm0 to two places? Should I have had my function end up with a result in xmm0 or does this instead mean I should avoid using xmm0? I'm very confused and would appreciate any help. Thanks in advance for anyone who took time to read my noob post :)