I have a project for searching for bytecode instructions and I would like to expand it to allow the use of Regular Expressions for matching patterns.
The gist of what I want to do is have custom character classes/sets so I can have something such as ISTORE match any of the following instructions:
ISTORE ISTORE_0 ISTORE_1 ISTORE_2 ISTORE_3
And then something similar for ILOAD ... ILOAD_n etc.
ISTORE and ILOAD would be similar to metacharacters like \s where they truly stand for multiple characters.
Basically I am just looking for a jumping off point so I can find a way to implement my own metacharacters.
ISTORE(_\d)?or(ISTORE_0|ISTORE_1|ISTORE_2|ISTORE_3|ISTORE). You are asking how to extend the actual regex engine to handle this, rather than add a preprocessing search and replace step.