I have a pattern:
patternDel = "( \\((MoM|QoQ)\\))";
And I want to delete all rows in pandas dataframe where column df['Event Name'] matches this pattern. Which is the best way to do it? There are more than 100k rows in dataframe.
str.contains() returns a Series of booleans that we can use to index our frame
patternDel = "( \\((MoM|QoQ)\\))"
filter = df['Event Name'].str.contains(patternDel)
I tend to keep the things we want as opposed to delete rows. Since filter represents things we want to delete we use ~ to get all the rows that don't match and keep them
df = df[~filter]
"( \((MoM|QoQ)\))"unless they are raw strings. It would be helpful to include a sample of the data in order to get the best answer.