Get floating number match on regex using python

Question

String:

"Roaming Calls, 1.5 GB/Day 100 SMS/Day"
"Unlimited Loc/STD/Roaming Calls, 1GB/Day"

I want to get the "1.5" and "1" by regex.

I use r'.*([0-9.]+)(gb|GB| gb| GB)' but only get "5" matched for the case 1.

.* is greedy and looks to backtrack with the higher priority. For what I see in your question it is not needed at all. You should skip it, using only (\d+(?:\.\d+)?)\s?(gb|GB). — PJProudhon
– PJProudhon, Commented Feb 7, 2018 at 5:56

Yung · Accepted Answer · 2018-02-07 07:20:47Z

2

use Lookahead after the match to locate the float number before string GB/Day(case insensitive): (?= GB/Day)

[\d.]+(?= GB/Day|GB/Day| gb/day|gb/day)

Regex101 Demo

answered Feb 7, 2018 at 7:20

Yung

1764 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

Vaibhav Patil · Accepted Answer · 2018-02-07 05:56:44Z

0

Please try this regex. This will match Non space characters begore GB

r'\S+(?=\s*(GB|gb|Gb|gB))'

answered Feb 7, 2018 at 5:56

Vaibhav Patil

1321 silver badge6 bronze badges

Comments

Tim Biegeleisen · Accepted Answer · 2018-02-07 05:52:37Z

0

Here is a fix to the immediate problem with your pattern:

input = "Roaming Calls, 1.5 GB/Day 100 SMS/Day"
m0 = re.match(r'.*?([0-9.]+)?(gb|GB| gb| GB)', input)
if m0:
    print "match: ", m0.group(1)

Just make the dot appearing right before the capture group for the number lazy.

Demo

answered Feb 7, 2018 at 5:52

Tim Biegeleisen

526k32 gold badges323 silver badges399 bronze badges

Comments

Vikas Periyadath · Accepted Answer · 2018-02-07 05:56:07Z

0

For both float and other numbers you can try this :

import re
k = "Roaming Calls, 1.5 GB/Day 100 SMS/Day"
print(re.findall(r"[-+]?\d*\.\d+|\d+",k))

if you want to find only float values go for this :

import re
k = "Roaming Calls, 1.5 GB/Day 100 SMS/Day"
print(re.findall(r"[-+]?\d*\.\d+",k))

it will return a list of float numbers in that string like this :

['1.5']

answered Feb 7, 2018 at 5:56

Vikas Periyadath

3,1961 gold badge25 silver badges35 bronze badges

Comments

deathangel908 · Accepted Answer · 2018-02-07 06:03:35Z

0

The issue that .* matches everything and leaves only 1 symbol for [0-9.]+. You can replace it with .? so it won't be that greedy:

.?([0-9.]+)(gb|GB| gb| GB)

regex101

answered Feb 7, 2018 at 6:03

deathangel908

9,77911 gold badges54 silver badges92 bronze badges

Collectives™ on Stack Overflow

Get floating number match on regex using python

5 Answers 5

Comments

Comments

Demo

Comments

Comments

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

5 Answers 5

Comments

Comments

Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Related