Python string extraction

Question

In the following string how to get the id after the directory media and after getting the id ignore the rest of the string to read only the id numbers

id_arr= ["/opt/media/12/htmls","/opt/media/24/htmls","/opt/media/26/htmls","/opt/media/56/htmls"]

The output should be 12 24 26 56

Sven Marnach · Accepted Answer · 2010-12-09 09:08:35Z

3

If the strings always look the way you said, try

ids = [int(s.split("/")[3]) for s in id_arr]

answered Dec 9, 2010 at 9:08

Sven Marnach

608k123 gold badges966 silver badges865 bronze badges

Sign up to request clarification or add additional context in comments.

2 Comments

Frédéric Hamidi Over a year ago

Maybe use os.path.sep instead of a plain /, just in case?

Sven Marnach Over a year ago

Not sure if these are paths or parts of URLs.

Tim Pietzcker · Accepted Answer · 2010-12-09 09:12:06Z

3

>>> import re
>>> myre = re.compile("^.*/media/(\d+)")
>>> for item in id_arr:
...     print (myre.search(item).group(1))
...
12
24
26
56

answered Dec 9, 2010 at 9:12

Tim Pietzcker

337k59 gold badges520 silver badges572 bronze badges

1 Comment

Wang Over a year ago

Why the downvote? Yeah, regexes are bit overkill, but they satisfy the requirement that we're extracting the numerical ID after the media directory. No other solution considers the possibility that it might not be the third field (not even Puller's, since it assumes the prefix is /opt/media).

Jan B. Kjeldsen · Accepted Answer · 2010-12-09 09:11:16Z

0

[x.split('/')[3] for x in id_arr]

answered Dec 9, 2010 at 9:11

Jan B. Kjeldsen

18.1k5 gold badges36 silver badges51 bronze badges

Comments

Wang · Accepted Answer · 2010-12-09 09:12:57Z

0

The correct way probably involves some clever use of the os.path module, but for the input given, just use a regex for media\/([0-9]+) and extract the first group.

answered Dec 9, 2010 at 9:12

Wang

3,3531 gold badge27 silver badges33 bronze badges

Comments

Vincenzo Pii · Accepted Answer · 2010-12-09 09:20:15Z

0

parts = "/opt/media/12/htmls","/opt/media/24/htmls","/opt/media/26/htmls","/opt/media/56/htmls"
for str in parts:
    print str.split("/")[3]

EDIT: unuseful rpartition() removed

edited Dec 9, 2010 at 9:20

answered Dec 9, 2010 at 9:11

Vincenzo Pii

20.1k9 gold badges42 silver badges50 bronze badges

Collectives™ on Stack Overflow

Python string extraction

5 Answers 5

2 Comments

1 Comment

Comments

Comments

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

5 Answers 5

2 Comments

1 Comment

Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Related