Pick random bit from 32bit value in O(1) if possible

Question

I have a 32bit random value (say 631).

0...0000001001110111

Each of these bits is a flag. I would like to return a random flag from these bits in, if at all possible, O(1) operation. How would I select the bit position 0, 1, 2, 4, 5, 6 or 9 (or it's corresponding value 1, 2, 4, 16, 32, 64, 512) from the given value 631? Preferably with as least possible bias towards some bits.

Things I came up with:

- Shift value right random number of bits (max. 10 in this case)
- See if LSB is set
  - If it is: got a bit position (last shifted number of bits); done
  - if not:
    - If resulting value == 0; start over
    - If resulting value != 0, go back to shifting random bits again

Above is not O(1) and possibly need multiple iterations if we happen to only 'hit' the 0 bits.

- Mask (and) with random value
- Repeat until power of 2 is left or start over when value is 0.

Again, above is not O(1) unfortunately.

I'm pretty sure this must be possible with some bithisfting/masking/magic somehow...

Edit:

As CodeCaster suggested; this will give me all set bit values:

int myvalue = 631;
var matchingmasks = Enumerable.Range(0, 32)
                              .Select(i => 1 << i)
                              .Where(i => (myvalue & i) == i)
                              .ToArray();

From the resulting array I can pick a random element and I'll have found my 'random' bit (flag) from the given value. However, this still requires a (hidden, because Linq) for-loop, 'brute-forcing' each possible bit, memory allocations for the resulting array etc.

Any random generation algorithm (including the simplest one) will generate a random value in O(1) — user586399
@:deleted: I don't want the binary representation; please read the question. I want to pick a (one) bit from the set bits and either have it's position (e.g. bit 3) OR it's "value" (e.g. 4 for bit 3) @Kilanny: I don't want a random number; I want a random bit from a given number. — J. Doe
If you have the binary representation then you could manipulate that to get the bits, or the position of the bits. IF I understand correctly you have 631 want as the input and you want the output to be 1, 2, 4 , 16, 32 — Donald Jansen
@DonaldJansen: you could manipulate that to get the bits: yes; the question is how. (also: 64 and 512 are a possible output you left out for the given value of 631). — J. Doe
Also: avoid the temptation to say "that's not the fastest possible". The fastest possible is not your goal. You are probably unwilling to spend even a small amount like ten million dollars to develop custom hardware to solve this problem as fast as possible. Your goal is "fast enough given my budget and other constraints". Since we know neither your performance requirements nor your budget, we don't know what meets that goal. — Eric Lippert

Eric Lippert Eric Lippert · Accepted Answer · 2016-02-10T14:43:37

First off, I would suggest doing this the easy, straightforward, obvious way that you suggest in the question: make an array of values, choose an element at random. Yes this allocates memory and whatnot. Optimize for the code being readable and correct first; only when you have a demonstrated performance problem should you optimize it.

If you do want to optimize it down to bit twiddling, this page is my go-to resource: http://graphics.stanford.edu/~seander/bithacks.html

The algorithms you'll want here are:

first, pick your favourite algorithm for determining the Hamming weight -- that is, "how many bits are on?" Call that number n.
Now pick a random number r from 1 to n
Now read the algorithm called "select the bit position with the given count". This takes your number r and gives you the bit position of the rth true bit starting from the high end. The code given on the page is for longs; it should be straightforward to modify it for ints.

I note that a key feature of many of these algorithms is that they are branchless. When you are trying to wring the last ounce of performance out of an algorithm, remember that every "if" kills performance. An "if" means that there is code in the cache that is NOT running, because you branched away from it, and therefore you are making a cache miss more likely. An "if" means there is an opportunity for the branch predictor to make the wrong choice. At the CLR level, every "if" means more basic blocks, which means more work for the jitter to do its flow analysis. And so on.

Pick random bit from 32bit value in O(1) if possible

6 Answers