I have a string from one of the log files as below.
pf_string = "2018-02-01 00:54:49,285 [210.67.123.00] [ABC,CDE,sfv4_ABC.,dbPool5,11689563,fp2871,en_US] UNKNOWN-UNKNOWN EVENT-UNKNOWN-UNKNOWN-pc4bcf46t-20180201005446-663570 2994 770 3199 168 26 [Mozilla/5.0 (Windows NT 6.1; WOW64; Trident/7.0; ABC-IE11; rv:11.0) like Gecko] 3677610951-0 PERFORMANCE PM_REVIEW FORM_DETAIL [[95211KB 480ms 460ms 20ms 212KB 0KB 118KB 57KB 0 0 ]] 74 139 - - - - "
Now I want to extract a pattern like below:
Module_id -> PERFORMANCE
Page Name -> PM_REVIEW
Page Qualifier -> FORM_DETAIL
Here is a regular expression like below:
perfLogPatternPage = re.compile('(?P<module_id>\w+)\s(?P<page_name>\w+)\s(?P<page_qualifier>\w+)\s\[\[')
print perfLogPatternPage.match(pf_string).group('module_id')
print perfLogPatternPage.match(pf_string).group('page_name')
print perfLogPatternPage.match(pf_string).group('page_qualifier')
But this doesn't seem to work and give the right result.
Can someone suggest what's wrong?