How to detect symmetries in 4 integer variables efficiently?

Question

I want to find symmetries in 4 integer variables i,j,k and l . The symmetries are:

all four numbers are equal: XXXX,
three numbers are equal: XXXY,XXYX,XYXX,YXXX
two pairs of equal numbers: XXYY,XYXY,XYYX,...
one pair of equal numbers and two different numbers: XXYZ,XYXZ,XYZX,...
all numbers are different.

All variables run within a certain non continuous range. I use nested if else statements. The first if checks for inequality of all variables. If not, then I have case 1. The next if checks if there are any equal pairs. If not, then case 5. The next if checks for three equal numbers. If true, then case 2. Otherwise, the last if checks for two pairs of equal numbers. If true, then case 3, otherwise case 4.

  if(!(i==j && j==k && k==l)){
    if(i==j || i==k || i==l || j==k || j==l || k==l){
     if((i==j && j==k) || (i==j && j==l) || (i==k && k==l) || (j==k && k==l)){            ...//do something
     }else{
    if((i==j && k==l) || (i==k && j==l) || (i==l && j==k)){ 
...//do something
    }else{
     ...//do something
    }           
  }
     }else{
     ...//do something  
     } 
 }else{
  ...//do something
 }

Is there better way do do this? I mean better in the sense of better performance, because I have to do this test millions of times.

I'd start sorting the 4 values first. Then it's almost trivial. — Jabberwocky
– Jabberwocky, Commented Mar 13, 2017 at 9:58
It will depend on the range and distribution of numbers. For example, to take an extreme case, suppose the 4 numbers are random 32-bit integers. In that case, they will almost always be all different, so you would optimize to test for that case first and fall through to the less common cases. At the opposite end of the spectrum, all 4 numbers being equal might be the most common case. In that case your current approach would be fastest. — samgak
– samgak, Commented Mar 13, 2017 at 10:00
Copy the values into an array and sort them. Rather than focusing on i, j, k, l, focus on the values in this array, where index [0] is the lowest. Anyway, this question is too broad to be answered even as algorithm/pseudo code, because it is not 1 question but 5 different. Also, optimization depends heavily on if it is always 4 items or if the number of items should be variable. — Lundin
– Lundin, Commented Mar 13, 2017 at 10:18
If you do these millions of tests inside a (small) loop body, try to optimize the code size, for example using samgak's or Ari's solution below, so it fits good into the I-caches. Otherwise, if it is an external function anyway, you might use a cascade of if-else-branches if (i==j) { if (j==k) { if (k==l) { ... } else { ... } } else { if (k == l) {... } else { ... } .... to minimize the number of comparisons. — Ctx
– Ctx, Commented Mar 13, 2017 at 10:52
If you do go with sorting, use a sorting network. But I think you'll quickly see Ari's answer is better. — R.. GitHub STOP HELPING ICE
– R.. GitHub STOP HELPING ICE, Commented Mar 13, 2017 at 11:52

Ari Hietanen · Accepted Answer · 2017-03-13 10:33:52Z

9

Similar idea than samgak, but without the need of external table. Just count the sum of all matches

int count = (i==j) + (i==k) + (i==l) + (j==k) + (j==l) + (k==l);

and do switch with following choices

switch (count){
case 0: //All differenct
case 1: //One same
case 2: //Two different pairs
case 3: //Three same
case 6: //All are same
}

Again, as already mentioned, your current code might be faster in some cases. Especially if the most common case is the one where all the elements are equal.

answered Mar 13, 2017 at 10:33

Ari Hietanen

1,76914 silver badges17 bronze badges

Sign up to request clarification or add additional context in comments.

11 Comments

user694733 Over a year ago

Not sure if this makes any practical difference, but in theory you could replace the least likely case with default and reduce number of comparisons by one in the switch statement.

2501 Over a year ago

Nice solution. This works great for a small number of variables.

2501 Over a year ago

This is clearly quadratic, Sorting would gives us O(n log n). A hash table would be O(n), with space O(n). I wonder if there is a O(n) solution that uses constant space.

Ari Hietanen Over a year ago

@Kulibo You need to write break at the end of each case otherwise it execute them all after first true value.

Danny_ds Over a year ago

@Kulibo I don't think you can get much faster than this, because in this example, to calculate count multiple instructions will be executed in a single clock cycle and internally switch is highly optimized, using different algorithms depending on the number of cases. Also, to sort the values, a O(n*n) bubblesort would probably be the fastest for 4 items - so I wouldn't care much about big-O in this case.

|

samgak · Accepted Answer · 2017-03-13 10:31:19Z

5

If you can afford a small (64 byte) lookup table, you can test each pair of values and set a bit for each comparison in a number that you use as an index into your table, e.g:

int classifySymmetries(int i, int j, int k, int l)
{
     return table[(i == j) |
                  ((i == k) << 1) |
                  ((i == l) << 2) |
                  ((j == k) << 3) |
                  ((j == l) << 4) |
                  ((k == l) << 5)];
}

Then do a switch on the return value. You can use your existing code to generate the table, by substituting a bit test for each comparison, or generating dummy i j k l values that satisfy each bit pattern from 0 to 63.

This approach requires a constant 6 comparisons. Bear in mind that sorting 4 values requires between 4 and 5 comparisons (there are 4! = 24 possible orderings, and each comparison yields 1 bit of information). But then you have to do tests based on the sorted values on top of that.

Whether using a lookup table beats your current approach will depend on the distribution of values and other factors like memory access times, you should profile to confirm.

edited Mar 13, 2017 at 10:31

answered Mar 13, 2017 at 10:19

samgak

24.5k4 gold badges63 silver badges86 bronze badges

2 Comments

Ctx Over a year ago

Do you mean (i == j) >> 0, (i == k) >> 1, (i == l) >> 2...? I cannot make sense of (i == l) & 4, which is always 0, non?

samgak Over a year ago

Yes, but shifting the other way

Jason · Accepted Answer · 2017-03-13 10:29:18Z

0

A better way is to use a map:

#include <iostream>
#include <map>
using namespace std;


int main()
{
    int i, j, k, l;
    cin >> i >> j >> k >> l;

    std::map<int, int> count;

    int outcomes[5] = { 0, 0, 0, 0, 0 };

    // Store the values in the map
    count[i]++;
    count[j]++;
    count[k]++;
    count[l]++;

    // tally types of outcome according to the map
    for(typename std::map<int, int>::iterator iter = count.begin(); iter != count.end(); ++iter)
    {
        outcomes[iter->second] ++;
    }

    // print out "1 of a kind" count, up to "4 of a kind"
    // this is just for visualization
    for (int i = 1; i <= 4; ++i)
    {
        cout << i << " of a kind = " << outcomes[i] << endl;
    }

    // your bit here, it checks on just the "outcomes" array
    if(outcomes[4] > 0) // 4 of a kind
    {
    }
    else if(outcomes[3] > 0) // 3 of a kind
    {
    }
    else if(outcomes[2] > 1) // two pair
    {
    }
    else if(outcomes[2] > 0) // one pair
    {
    }
    else // singles only
    {
    }

    cin.ignore();
    cin.get();

    return 0;
}

This approach would be far more extensible too, if you wanted to extend it beyond 4 choices.

edited Mar 13, 2017 at 10:29

answered Mar 13, 2017 at 10:23

Jason

1,1999 silver badges17 bronze badges

1 Comment

Jabberwocky Over a year ago

This solution has probably too much overhead and might therefore not be very efficient for as few as 4 values.

Collectives™ on Stack Overflow

How to detect symmetries in 4 integer variables efficiently?

3 Answers 3

11 Comments

2 Comments

1 Comment

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

3 Answers 3

11 Comments

2 Comments

1 Comment

Your Answer

Sign up or log in

Post as a guest

Linked

Related