Skip to main content
Filter by
Sorted by
Tagged with
0 votes
1 answer
94 views

What are the details of your problem? I am a teacher and I want to use Python to create a worksheet for my students. I have a vocabulary PDF with content like this: do your best duː jɔː best 33, 81 do ...
Hưng Phạm Đăng's user avatar
0 votes
0 answers
100 views

ASP.NET Core 9 MVC / C# controller extracts texts from invoices using pdfpig based on code in answer How to group text to lines if there is small difference in Y position. Invoices can have multiple ...
Andrus's user avatar
  • 28.2k
0 votes
0 answers
42 views

I'm trying to use RegexBuilder/Swift to write a Swift method that extracts for example lists enclosed by <ul> and </ul> from an HTML-string. In this example let htmlText = ""&...
Patrick Bauers's user avatar
1 vote
1 answer
82 views

How do I get rid of non-printing (escaped) characters from the output of the nltk.word_tokenize method? I am working through the book 'Natural Language Processing with Python' and am following the ...
green_ruby's user avatar
2 votes
2 answers
97 views

raw txt file contains these lines: cat raw.txt ID DESCRIPTION ----- -------------- 2 item2 4 item4 1 item1 3 item3 How can reorder it by ID as ...
showkey's user avatar
  • 375
3 votes
1 answer
149 views

Note that this is the follow up question of Parse text file, change some strings to camel case, add other strings . The parsing rules are similar but different: The input order in the output is ...
albertkao9's user avatar
2 votes
1 answer
96 views

Problem: I have a two-step bioinformatics pipeline where: Code 1 generates output files (.marked.bam) and places them into a directory structure. Code 2 processes annotated files (annotated....
Melissa's user avatar
  • 59
-5 votes
1 answer
120 views

I've encountered an issue in python. I have a string that contains both a message and code, and I need to separate them and pass each to different functions. An example: text = """ Can ...
Callme-Milad's user avatar
0 votes
0 answers
140 views

I'm using Microsoft Presidio for text analysis and anonymization, and I have a configuration file (all-config.yml) to specify recognizers, including some deny lists with accented characters. However, ...
Sergiy Polovynko's user avatar
-1 votes
1 answer
43 views

I have a single blog post with a title and description, and I want to compare its uniqueness against multiple blog entries in a CSV file. The CSV contains several blogs, each with a title and meta ...
Muhammad Hamd's user avatar
-2 votes
2 answers
133 views

I have a relatively complicated Bash problem. I have a two-column CSV file that contains duplicate values in the first column, as well as duplicates within those duplicate values (in the second column)...
Hashim Aziz's user avatar
  • 6,576
0 votes
2 answers
703 views

I have in github actions a list of words with spaces in the shell Bash. e.g. hello1 hello2 hello3 The goal is to convert this list to a JSON array format, write it to the output variable and use it ...
user avatar
1 vote
1 answer
74 views

I am parsing some game config files using Python and putting it all in dictionaries. It seemed to all work well until I encountered the following edge-case: random_owned_controlled_state = { ...
lrdewaal's user avatar
  • 360
0 votes
0 answers
554 views

I want to mask all types of sensitive data (usernames, passwords, api keys, DB connection strings, endpoints, secrets, and even any custom variables containing secrets) present in a flat log file. ...
Navdeep Singh's user avatar
0 votes
1 answer
124 views

I got a list of sentences (roughly 20000) stored in excel file named list.xlsx and sheet named Sentence under column name named Sentence. My intention is to get words from user and return those ...
Programmer_nltk's user avatar
1 vote
1 answer
802 views

I would like to compress the space for a json file by printing in compact mode (-c) but I want to add a new line after each root-level object. for example, for the below object { "a": { ...
FangQ's user avatar
  • 1,564
0 votes
1 answer
67 views

I use the following code to request data from etherscan API from etherscan import Etherscan eth = Etherscan("API_KEY_HERE") # key in quotation marks response = eth.get_normal_txs_by_address(...
pwrsheller's user avatar
-1 votes
1 answer
978 views

I have a CSV file with each row representing the different components of an address, such as City, Street, House No, etc., and then a column with a combined address in one line, with a predefined ...
Haris Munawar's user avatar
-4 votes
1 answer
82 views

Request: Extract blocks of text that contain 2 or more search terms, something akin to [ AND ] logical operator in [ awk ]. Preferably run as awk in bash/zsh function (but also ok with standalone awk ...
hbendi's user avatar
  • 11
1 vote
1 answer
58 views

As a complete preface, I am a beginner and learning. But, here's the sample schema of my products review table. Record_ID Product_ID Review Comment 1234 89847457 I love this product it was shipped ...
user14452102's user avatar
-2 votes
1 answer
44 views

This is a list, each elements consists of two strings and a "/t" between.We can call the string on the left "label" and the part on the right "text". continued In the ...
Haochuan Liu's user avatar
-3 votes
3 answers
495 views

I have a text file, where each line may start with a number of tabs, including no tabs. For example, the first line starts with no tab, the second line with 1 tab, and the third line with 2 tabs: ...
Tim's user avatar
  • 101k
1 vote
1 answer
294 views

I need to perform the local alignment of a given region of a DNA sequence for which a global alignment was made, and update the corresponding part of the global CIGAR string. The steps would be as ...
Fravadona's user avatar
  • 17.6k
1 vote
1 answer
132 views

I need to extract 1,500,000 out of 25,000,000 records and group them. The groups and the UUIDs of the records to extract are defined in a separate file (200MB) with the following format: >Cluster 0 ...
Fravadona's user avatar
  • 17.6k
1 vote
1 answer
90 views

I probably need to use other data structure, but I'm stuck with this solution for now. Will appreciate any advice on this. For now I have this data structure: const data = [ { id: 'node-1',...
Andrew Rusinas's user avatar

1
2 3 4 5
40