Python text file into directory list

Question

I needed help to add into directory below text file. Can anyone can help me to do it? I tried
I have data.txt like below:?

A1234 161
A1234 106
A456  185
A456  108
037   125

**Output:**
directory = {
"A1234": [161,106],
"A456": [185,108],
"037": [125],
}

Thank you in advance for your help.

Stijn B · Accepted Answer · 2022-03-04 14:39:57Z

1

data.txt:

From file to dictionary:

with open('data.txt', 'r') as file:
    data_lines = file.readlines()

directory = {}

for line in data_lines:
    a, *b = line.split()
    # convert all elements of b into integers:
    b = [int(item) for item in b]
    if directory.get(a, False):
        if isinstance(b, list):
            directory[a].extend(b)
        else:
            directory[a].append(b)
    else:
        directory[a] = list(b)

print(directory)
# {'A1234': [161, 106], 'A456': [185, 108], '037': [125]}
# prettified:
"""
{
    'A1234': [161, 106],
    'A456': [185, 108],
    '037': [125]
}
"""

edited Mar 4, 2022 at 14:39

answered Mar 2, 2022 at 22:14

Stijn B

3602 silver badges14 bronze badges

Sign up to request clarification or add additional context in comments.

12 Comments

Shailesh Patel Over a year ago

For above file your code works fine. Thank you. But when I use other file which has 9375 rows of data, Its throw error like "ValueError: too many values to unpack (expected 2)".

Shailesh Patel Over a year ago

I'm trying to do ICD mapping file dynamically added into directory.

Stijn B Over a year ago

Ok, I think ValueError: too many values to unpack (expected 2) has nothing to do with the amount of data. It has to do with the line a, b = line.split() : it's unpacking the line split (a list) in two variables. Does your data file contain lines with more than 2 'words' ? (I guess so. If so, show me an exemple I will adapt the code) See more info here

Stijn B Over a year ago

I tested the code with the example input data just by changing the first line to A1234 161 55 and that's throwing the same error. I edited my answer, now it should work fine with any amount of 'words' or 'numbers' per line :)

Shailesh Patel Over a year ago

Awesome, Stijn B Sir, It works. Thank you.

|

Hussain Bohra · Accepted Answer · 2022-03-02 22:14:16Z

0

Try this:

output_dict = {}
data = list(map(
    lambda a:a.strip().split(),
    open("data.csv").readlines()
))
for k,v in data:
    try:
        output_dict[k].append(v)
    except:
        output_dict[k] = [v]
output_dict

Output:

{'A1234': ['161', '106'], 'A456': ['185', '108'], '037': ['125']}

answered Mar 2, 2022 at 22:14

Hussain Bohra

1,00510 silver badges15 bronze badges

2 Comments

Matthew Borish Over a year ago

This works, but using a bare exception clause is generally considered to be a bad practice. See here for more. stackoverflow.com/questions/14797375/…

Shailesh Patel Over a year ago

For above file your code works fine. Thank you. But when I use other file which has 9375 rows of data, Its throw error like "ValueError: too many values to unpack (expected 2)".

Matthew Borish · Accepted Answer · 2022-03-02 23:12:48Z

0

Here's a pandas solution. First we employ read_csv() and use one or more spaces as our delimiter. We need to specify string (object) dtypes to get string values for the list items as in your output. If you want ints, you could skip the dtype argument and pandas will infer them. Next we groupby the fist column (0) and convert the values in column 1 to a list with apply. Finally, we use .to_dict to get a dictionary.

import pandas as pd 

df = pd.read_csv('dir.txt', header=None, sep=r"[ ]{1,}", dtype='object')

directory = df.groupby(0)[1].apply(list).to_dict()

output:

{'037': ['125'], 'A1234': ['161', '106'], 'A456': ['185', '108']}

edited Mar 2, 2022 at 23:12

answered Mar 2, 2022 at 22:16

Matthew Borish

3,1162 gold badges18 silver badges29 bronze badges

2 Comments

Shailesh Patel Over a year ago

For above file your code works fine. Thank you. But when I use other file which has 9375 rows of data, Its throw error like "ValueError: too many values to unpack (expected 2)".

Matthew Borish Over a year ago

You likely have some irregularities in your .txt file, but it's tricky to account for them without seeing the full data. Can you post a larger sample?

Collectives™ on Stack Overflow

Python text file into directory list

3 Answers 3

12 Comments

2 Comments

2 Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

3 Answers 3

12 Comments

2 Comments

2 Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related