python - regex with quotes inside of the string

Question

--

Hi everyone,

I need a hand for the following regex. The string is something like:

str = 'value=\"20\" />\r\n\t\r\n<\/div>","whatiwant":"<div id=\"whatiwant\">\r\n\t\r\n\t\t<\/div>","idontwanthat":"<div id=\"idontwanthat\">\r\n\t\r\n\t blablalblalblalbla \t\r\n\t\t\t<\/div>"'

I would like the entire div of "whatiwant". I tried the following:

matches=re.findall(r'\"whatiwant\":\"(.+?)\":\"',mstr)

ps: i can have other div in the div.

Any help with me appreciated

An html parser would be more suitable for this. Is this really your string or a part of a web page? — Jerry
– Jerry, Commented Sep 19, 2014 at 9:43
Hi jerry, i know but the string is not suitable for an html parser. i will use one for the div that i want — John Smith
– John Smith, Commented Sep 19, 2014 at 9:45

Aran-Fey · Accepted Answer · 2014-09-19 10:34:14Z

1

"whatiwant":"(.*?[^\\])??"

This will match the literal "whatiwant": and then anything (even an empty string) inside double quotes "".

If you want to extract the div's html code, you can retrieve the first group's value:

matches=re.findall(r'"whatiwant":"(.*?[^\\])??"', mstr)
for match in matches:
    html= match.group(1)

edited Sep 19, 2014 at 10:34

answered Sep 19, 2014 at 9:59

Aran-Fey

44k13 gold badges113 silver badges161 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

Kamehameha · Accepted Answer · 2014-09-19 09:46:30Z

1

Try using a positive lookahead -

\"whatiwant\":.*(?=,\".*?\"\:)

DEMO

answered Sep 19, 2014 at 9:46

Kamehameha

5,4781 gold badge25 silver badges31 bronze badges

Collectives™ on Stack Overflow

python - regex with quotes inside of the string

2 Answers 2

Comments

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Related