It's a very interesting question and I am seeking help of experts to understand more about it and how to do it. I have a DataFrame (which I made while parsing data from Big Iron... still exists). Now I want to remove some rows by using regular expression but don't know how does it work in Pandas.
24 | DRFT.146.856 | Dollar- | (60.00) | DEBITS- | 0.00 | CREDITSDRA- | 0.00
25 | 0616-21.01 | 2407 | WAYZAT | TMCD | JUNE | 16,DRA |2013
26 | AND | CORRECTION |JOURNAL00 | <DB> |KLRETY | CATEGORYDRA- | *
27 | DRFT.146.867 | Dollar- | (200.00) | DEBITS- | 0.00 | CREDITSDRA- | 0.00
28 | DRFT.146.922 | Dollar- | (25.00) |DEBITS- | 0.00 | CREDITSDRA- |0.00
29 | DRFT.146.963 | Dollar- | (100.00) | DEBITS- | 0.00 | CREDITSDRA- | 0.00
30 | DRFT.146.964 | Dollar- | (100.00) | DEBITS- | 0.00 | CREDITSDRA- | 0.00
The row of concern is 25 & 26 where the data is not following any pattern. Any clue.
DataFrame. It looks like certain columns should have easy to check for patterns or a limited set of valid values. As far as you understand this data - what field do you think you can filter by most effectively?