4
votes

I think this is a simple thing for a lot of you, but I have a very limited knowlegde of regex at the moment. I want to match everything except a double digit number in a string.

For example:

TEST22KLO4567

QE45C2C

LOP10G7G400

Now I found out the regex to match the double digit numbers: \d{2}

Which matches the following:

TEST22KLO4567

QE45C2C

LOP10G7G400

Now it seems to me that it would be fairly easy to turn that regex around to match everything BUT "\d{2}". I searched a lot but I can't seem to get it done. I hope someone here can help.

3
Am I right that that you want to match everything but the first double digit number?Dyragor
To be honest I'm not 100% sure. I use it in Channable, which is a web-based feed management tool. So my guess is Javascript.user3457337
Do all strings have one 2-digit occurrence or could it be more?revo

3 Answers

2
votes

This only works if your regex engine supports look behinds:

^.+?(?=\d{2})|(?<=\d{2}).+$

Explanation:

The | separates two cases where this would match:

  • ^.+?(?=\d{2})

This matches everything from the start of the string (^) until \d{2} is encountered.

  • (?<=\d{2}).+$

This matches the end of the string, from the place just after two digits.

If your regex engine doesn't support look behinds (JavaScript for example), I don't think it is possible using a pure regex solution.

You can match the first part:

^.+?(?=\d{2})

Then get where the match ends, add 2 to that number, and get the substring from that index.

1
votes

You are right rejecting a search in regex is usually rather tricky.

In your case I think you want to have [^\d{2}], however, this is tricky as your other strings also contain two digits so your regex using it won't select them.

I would go with this regex (using PCRE 8.36 but should work also in others):

\*{2}\w*\*{2}

Explanation:

\*{2} .... matches "*" literally exactly two times
\w* .... matches "word character" zero or unlimited times

0
votes

Found one regex pretty straightforward :

^(.*?[^\d])\d{2}([^\d].*?)$

Explanations :

  • ^ : matches the beginnning of a line
  • (.*?[^\d]) : matches and catches the first part before the two numbers. It can contain anything (.*?) but needs to end with something different to a number ([^\d]) so we ensure that there is only 2 numbers in the middle
  • \d{2} : is the part you found yourself
  • ([^\d].*?) : is the symetric of (.*?[^\d]) : begins with something different from a number ([^\d]) and matches anything next.
  • $ : up to the end of the line.

To test this reges you can use this link

It will match the first occurence of double digit, but because OP said there was only one it does the job correctly. I expect it to work with every regex engine as nothing too complex is used.