How to remove characters from a string in a file starting from a given substring until the next n number of characters in Linux?

Question

I have a file that contains a single line with usernames followed by their other information.

For instance the file contains:

Clara01{25 characters of info}Betty29{25 characters of info}Edith34{25 characters of info}Raji11{25 characters of info}

All in a single very long line with many usernames followed by 25 characters of their information.

Now, I want to search Betty29 and then delete/remove the substring Betty29{25 characters of info}. That is, how should I delete Betty29 and then next 25 characters. How should I do that in Linux shell scripting?

I have read about sed command but I still could not figure out. I am new to shell scripting so please be kind.

Go online, read about regular expressions and sed. Also look within SO — Xavjer
– Xavjer, Commented Jul 24, 2023 at 6:31
Does this answer your question? delete lines with sed match a special regex — Xavjer
– Xavjer, Commented Jul 24, 2023 at 6:31
@Xavjer This question isn't about deleting lines so I'm going to guess that no it's not a good choice of a dupe — Shawn
– Shawn, Commented Jul 24, 2023 at 15:11

Jetchisel · Accepted Answer · 2023-07-25 07:30:16Z

1

As @Shawn suggested

sed 's/Betty29.\{25\}//' foo.txt

is getting the job done.

Thanks all for the help!

edited Jul 25, 2023 at 7:30

Jetchisel

8,3012 gold badges23 silver badges19 bronze badges

answered Jul 25, 2023 at 7:08

Toohina Barua

359 bronze badges

Sign up to request clarification or add additional context in comments.

2 Comments

Shawn Over a year ago

Have you read up enough on (POSIX Basic) regular expressions to see how it works?

Toohina Barua Over a year ago

@Shawn I did read about it. I have a basic idea...

Andrej Podzimek · Accepted Answer · 2023-07-24 12:46:38Z

Use readarray -d'}' to access the file as an array.
Search for an element starting with Betty29 and unset that element.
Then printf '%s' the whole "${array[@]}" as an output.

unset_element() {
  local -r prefix="$1"
  local -a array
  local -i idx
  readarray -d'}' array
  unset 'array[-1]'  # empty
  for idx in "${!array[@]}"; do
    [[ "${array[idx]}" = "${prefix}{"* ]] && unset 'array[idx]' || :
  done
  printf '%s' "${array[@]}"
}

Now let’s test it:

input='Clara01{25 characters of info}'
input+='Betty29{25 characters of info}'
input+='Edith34{25 characters of info}'
input+='Raji11{25 characters of info}'

unset_element 'Betty29' <<< "$input"

Output:

Clara01{25 characters of info}Edith34{25 characters of info}Raji11{25 characters of info}

Presumably, this removes all occurrences of Betty29. If you want to remove only the first one and make it “more efficient”, just add a break into the for-loop once the match is found.

user1934428 · Accepted Answer · 2023-07-25 07:58:19Z

0

I wouldn't create a child process (sed or whatever), if I can do it easily within bash as well.

Say we have

line=abcBetty29abcdefghijklmnopqrstuvwxyz

Then we can do

new_line=${line/Betty29?????????????????????????//}

Since counting the 25 question marks is error-prone, an alternative would be to use a regex:

if [[ $line =~ ^(.*)Betty29.{25}(.*) ]]
then
  new_line=${BASH_REMATCH[1]}${BASH_REMATCH[2]}
else
  echo pattern not found 1>&2
fi

answered Jul 25, 2023 at 7:58

user1934428

22.8k9 gold badges57 silver badges108 bronze badges

Collectives™ on Stack Overflow

How to remove characters from a string in a file starting from a given substring until the next n number of characters in Linux?

3 Answers 3

2 Comments

Comments

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

3 Answers 3

2 Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related