Intel Pin Modifying Instruction Addresses

Question

I am trying to do some instruction analysis of an executable-binary using Intel Pin-tool instrumentation.

Upon analyzing the executed instructions in my Pin-tool, I am observing that the instruction address (program counter) value-range received is very different from what I otherwise observe when analyze the disassembly of the compiled code using objdump -d -S <binary>. I am testing it on the standard Linux /bin/ls executable-binary.

As per my understanding, Pin modifies the original binary to put its own "hooks" to gather execution related information which invoke call-backs in our desired Pin-tool for analysis. So, this naturally should lead to the actually executed binary as being different from the original. Unfortunately, I do not know other under-the-hood stuff about Pin.

I was wondering if there was any way to preserve the original code layout, or obtain some correspondence between the old binary and new binary instruction addresses?

Peter Cordes Peter Cordes · Accepted Answer · 2020-03-18T14:12:53

Modern distros use PIE executables that are ELF shared objects that get relocated at runtime. objdump only shows you addresses relative to the image base. What is the -fPIE option for position-independent executables in gcc and ld? and 32-bit absolute addresses no longer allowed in x86-64 Linux?

You can disable ASLR like GDB does so it's always relocated to the same place, like 0x55555..., but it still won't match the objdump address.

You could I think use objdump --adjust-vma=offset to relocate your disassembly.

Or you could build non-PIE executables with gcc -no-pie -fno-pie -O3 so objdump will know the real run-time address of every instruction.

Intel Pin Modifying Instruction Addresses

2 Answers