Python multiple substring index in string

Question

Given the following list of sub-strings:

sub = ['ABC', 'VC', 'KI']

is there a way to get the index of these sub-string in the following string if they exist?

s = 'ABDDDABCTYYYYVCIIII'

so far I have tried:

for i in re.finditer('VC', s):
  print(i.start, i.end)

However, re.finditer does not take multiple arguments.

thanks

@Pingu find() returns only the first substring index, they seem to need all of matches. — bereal
– bereal, Commented Jan 17, 2023 at 10:37
@bereal So? If find returns >= 0 just try again at an appropriate offset — jackal
– jackal, Commented Jan 17, 2023 at 10:44

bereal · Accepted Answer · 2023-01-17 10:42:28Z

2

You can join those patterns together using |:

import re
sub = ['ABC', 'VC', 'KI']
s = 'ABDDDABCTYYYYVCIIII'

r = '|'.join(re.escape(s) for s in sub)
for i in re.finditer(r, s):
    print(i.start(), i.end())

answered Jan 17, 2023 at 10:42

bereal

34.7k8 gold badges65 silver badges111 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

jackal · Accepted Answer · 2023-01-17 11:19:06Z

1

A substring may occur more than once in the main string (although it doesn't in the sample data). One could use a generator based around a string's built-in find() function like this:

note the source string has been modified to demonstrate repetition

sub = ['ABC', 'VC', 'KI']
s = 'ABCDDABCTYYYYVCIIII'

def find(s, sub):
    for _sub in sub:
        offset = 0
        while (idx := s[offset:].find(_sub)) >= 0:
            yield _sub, idx + offset
            offset += idx + 1

for ss, start in find(s, sub):
    print(ss, start)

Output:

ABC 0
ABC 5
VC 13

answered Jan 17, 2023 at 11:19

jackal

29.1k3 gold badges9 silver badges27 bronze badges

Comments

bn_ln · Accepted Answer · 2023-01-17 10:45:13Z

0

You could map over the find string method.

s = 'ABDDDABCTYYYYVCIIII'
sub = ['ABC', 'VC', 'KI']

print(*map(s.find, sub))
# Output 5 13 -1

answered Jan 17, 2023 at 10:45

bn_ln

1,6931 gold badge9 silver badges13 bronze badges

1 Comment

Gábor Fekete Over a year ago

using dict(zip(sub,map(s.find, sub))) it creates a dict with the substrings as keys and the indices as values.

Triet Doan · Accepted Answer · 2023-01-17 10:53:12Z

0

How about using list comprehension with str.find?

s = 'ABDDDABCTYYYYVCIIII'
sub = ['ABC', 'VC', 'KI']
results = [s.find(pattern) for pattern in sub]

print(*results) # 5 13 -1

answered Jan 17, 2023 at 10:53

Triet Doan

12.2k9 gold badges42 silver badges76 bronze badges

Comments

Gábor Fekete · Accepted Answer · 2023-01-17 11:07:25Z

0

Another approach with re, if there can be multiple indices then this might be better as the list of indices is saved for each key, when there is no index found, the substring won't be in the dict.

import re
s = 'ABDDDABCTYYYYVCIIII'
sub = ['ABC', 'VC', 'KI']

# precompile regex pattern
subpat = '|'.join(sub)
pat = re.compile(rf'({subpat})')

matches = dict()
for m in pat.finditer(s):
    # append starting index of found substring to value of matched substring
    matches.setdefault(m.group(0),[]).append(m.start()) 

print(f"{matches=}")
print(f"{'KI' in matches=}")
print(f"{matches['ABC']=}")

Outputs:

matches={'ABC': [5], 'VC': [13]}
'KI' in matches=False
matches['ABC']=[5]

answered Jan 17, 2023 at 11:07

Gábor Fekete

1,3588 silver badges17 bronze badges

Comments

Hari · Accepted Answer · 2023-01-17 11:36:17Z

0

Just Use String index Method

list_ = ['ABC', 'VC', 'KI']

s = 'ABDDDABCTYYYYVCIIII'


for i in list_:
    if i in s:
        print(s.index(i))

answered Jan 17, 2023 at 11:36

Hari

477 bronze badges

Collectives™ on Stack Overflow

Python multiple substring index in string

6 Answers 6

Comments

Comments

1 Comment

Comments

Comments

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

6 Answers 6

Comments

Comments

1 Comment

Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Related