Least distance between two values in a large binary tree with duplicate values

Question

Given a binary tree that might contain duplicate values, you need to find minimum distance between two given values. Note that the binary tree can be large.

For example:

        5
      /   \
    1       7
   / \     / \
  4   3   8   2
 / \
1   2

The function should return 2 for (1 and 2 as input).
(If duplicates are not present, we can find LCA and then calculate the distance.)

I've written the following code but I couldn't handle cases when the values are present in different subtrees and in the below cases:

root = 1, root.left = 4, root.left.left = 3, root.left.right = 2, root.left.left.left = 1
root = 1, root.left = 4, root.left.left = 3, root.left.left.left = 1, root.left.left.right = 2

void dist(struct node* root,int& min,int n1,int n2,int pos1,int pos2,int level) {
    if(!root)
        return;
    if(root->data==n1){
        pos1 = level;
        if(pos2>=0)
            if(pos1-pos2 < min)
                min = pos1-pos2;
    }
    else if(root->data==n2){
        pos2 = level;
        if(pos1>=0)
            if(pos2-pos1 < min)
                min = pos2-pos1;
    }
    dist(root->left,min,n1,n2,pos1,pos2,level+1);
    dist(root->right,min,n1,n2,pos1,pos2,level+1);
}

I think at each node we can find if that node is the LCA of the values or not. If that node is LCA then find the distance and update min accordingly, but this would take O(n²).

What have you tried so far? This is an interesting question, but unless you've demonstrated that you've actually tried to solve it yourself it's not appropriate to post the question here. — templatetypedef
@templatetypedef The main problem is when the two values are in different subtrees and when root= 1, root.left= 4, root.left.left=3, root.left.right=2, root.left.left.left=1 and root= 1, root.left= 4, root.left.left=3, root.left.left.left=1, root.left.left.right=2. In both the cases answer should be 2. — da3m0n
I think identifying an approach (e.g. LCA) and its weaknesses counts as trying to solve it. — mcdowella

Vikram Bhat Vikram Bhat · Accepted Answer · 2013-11-24T06:46:11

Following is an algorithm to solve the problem:-

traverse all of the tree and calculate paths for each node using binary strings representation and store into hash map

eg. For your tree the hashmap will be

1 => 0,000
2 => 001,11
3 => 01
...

When query for distance between (u,v) check for each pair and calculate distance between them. Remove common prefix from strings and then sum the remaining lengths

eg. u=1 and v=2

distance(0,001) = 2
distance(0,11) = 3
distance(000,001) = 2
distance(000,11) = 5 

min = 2

Note: I think the second step can be made more efficient but need to do more research

Least distance between two values in a large binary tree with duplicate values

3 Answers