Algorithm for optimal pairing strategy of items in two ordered lists while maintaining order

Question

I have the the following two ordered lists of items:

A = ["apples","oranges","bananas","blueberries"]
B = ["apples","blueberries","oranges","bananas"]

Each item has a score which is equal to the length of the string, so...

apples = 6 points
oranges = 7 points
bananas = 7 points
blueberries = 11 points

I want to create a list of pairs (A,B) that contain either the index from A or the index from B or both for a pair, without changing the sequential order that they appear in each list.

Each pair then has the score of its item, so by pairing items we halve the total score of both items. I want to get the combination of pairs that have the smallest score.

For example, in the two lists above, each item can be paired but some pairs prevent us from pairing others because we can't pair both without changing the order of one of the lists. For example we can't pair "blueberries" and also pair "oranges" as "oranges" comes before "blueberries" in one list but after it in the other. We could only pair one or the other. Each list could also have the same item multiple times.

The optimal result for the above problem is...

    +---+---+----------------+-------+
    | A | B | Value          | Score |
+---+---+---+----------------+-------+
| 0 | 0 | 0 | "apples"       | 6     |
+---+---+---+----------------+-------+
| 1 | - | 1 | "blueberries"  | 11    |
+---+---+---+----------------+-------+
| 2 | 1 | 2 | "oranges"      | 7     |
+---+---+---+----------------+-------+
| 3 | 2 | 3 | "bananas"      | 7     |
+---+---+---+----------------+-------+
| 4 | 3 | - | "blueberries"  | 11    |
+---+---+---+--------+-------+-------+
                     | Total | 42    |
                     +-------+-------+

I think the answer is along the lines of:

Find the pairs
Split those pairs into possible groups of pairs
Calculate the group with the greatest weight

I can determine which pairs exclude which other pairs visually by joining the pairs from one list to another, if that join intersects another join it excludes it. I'm not sure how this would be done programmatically though.

How can I solve this problem?

I think the question is a bit confusing, since you put only the indices in the example, then we have to go through several level of translation to understand why the number in the bracket. — nhahtdh
Apologies, I will try edit the question to help out. I've been trying to simplify the problem as much as possible... — Stuart Wakefield

VoidStar VoidStar · Accepted Answer · 2012-09-01T22:51:55

Note, my answer assumes that there are only 2 lists and that items can be at most one time in each list.

The first thing you do is to build a map/dictionary with the item as the key and a pair of integers as the value. This map will contain the indices of one item in both arrays. Run through the first list and put the index in the first value of the pair and -1 in the second. Do the same for the second list but obviously put the index in the second value. Something like this :

pairs = map<string, pair<int, int>>

i = 0
while i < A.Length
    pairs[A[i]].first = i
    pairs[A[i]].second = -1
    i++
i = 0
while i < B.Length
    pairs[B[i]].second = i
    i++

Now, you have to establish what are the possible pair combinations you can do. This pseudo code creates a list of all the possible combinations :

i = 0
while i < A.Length
    j = i
    index = -1
    combination = list<pair>
    while j < A.Length
        pair = pairs[A[j]]
        if pair.second > index
            combination.add(pair)
            index = pair.second
        j++
    combinations.add(combination)
    i++

Now, all that's left is to weigh the possible combinations but don't forget to include the items that weren't paired.

EDIT

What I'm thinking now is to build a map of all the possible pairs for each item. Something that would yield the following result.

oranges: [0,2][0,5][5,2][5,5][0,-1][-1,2][5,-1][-1,5]
apples: [1,1][1,-1][-1,1]
bananas: [2,3][2,-1][-1,3]
...

Using the exclude logic, we can group these pairs and produce a map of list of list of pairs.

oranges: [0,2][5,-1][-1,5], [0,5][5,-1][-1,2], ..., [0,-1][5,-1][-1,2][-1,5]
apples: [1,1], [1,-1][-1,1]
...

Now, only one list of pairs per item can be used in the result output and some lists exclude each other between different items. What's left is to come up with an algorithm to weigh each possibilities.

Algorithm for optimal pairing strategy of items in two ordered lists while maintaining order

2 Answers

1. Create a list of all of the pairs

2. Get the scores for each pair

3. Create a list of pairs that each pair excludes

4. Calculate the total cumulative score for each pair minus the pairs it excludes

5. Pick only the best pairs

6. Build the result