How to open multiple csv files from a folder in python?

Question

I want to open multiple csv files in python, collate them and have python create a new file with the data from the multiple files reorganised...

Is there a way for me to read all the files from a single directory on my desktop and read them in python like this?

Thanks a lot

I tried specifying a path and writing for item in path: open etc etc but then it said it couldn't find one of the files and threw Errno 2 — I. Evans
– I. Evans, Commented Nov 7, 2018 at 12:26
If you show us the code along with the full error, we can help you to fix it. But I cannot fix an error in a code I cannot see... Anyway, this site has rules or usages for what is expected for a good question: you really should read How to Ask. — Serge Ballesta
– Serge Ballesta, Commented Nov 7, 2018 at 12:30

Matt Pitkin · Accepted Answer · 2018-11-07 13:10:44Z

If you a have a directory containing your csv files, and they all have the extension .csv, then you could use, for example, glob and pandas to read them all in and concatenate them into one csv file. For example, say you have a directory, like this:

csvfiles/one.csv
csvfiles/two.csv

where one.csv contains:

name,age

Keith,23
Jane,25

and two.csv contains:

name,age

Kylie,35
Jake,42

Then you could do the following in Python (you will need to install pandas with, e.g., pip install pandas):

import glob
import os
import pandas as pd

# the path to your csv file directory
mycsvdir = 'csvdir'

# get all the csv files in that directory (assuming they have the extension .csv)
csvfiles = glob.glob(os.path.join(mycsvdir, '*.csv'))

# loop through the files and read them in with pandas
dataframes = []  # a list to hold all the individual pandas DataFrames
for csvfile in csvfiles:
    df = pd.read_csv(csvfile)
    dataframes.append(df)

# concatenate them all together
result = pd.concat(dataframes, ignore_index=True)

# print out to a new csv file
result.to_csv('all.csv')

Note that the output csv file will have an additional column at the front containing the index of the row. To avoid this you could instead use:

result.to_csv('all.csv', index=False)

You can see the documentation for the to_csv() method here.

Hope that helps.

ASH · Accepted Answer · 2018-11-07 15:25:52Z

1

Here is a very simple way to do what you want to do.

import pandas as pd
import glob, os

os.chdir("C:\\your_path\\")
results = pd.DataFrame([])

for counter, file in enumerate(glob.glob("1*")):
    namedf = pd.read_csv(file, skiprows=0, usecols=[1,2,3])
    results = results.append(namedf)

results.to_csv('C:\\your_path\\combinedfile.csv')

Notice this part: glob("1*")

This will look only for files that start with '1' in the name (1, 10, 100, etc). If you want everything, change it to this: glob("*")

Sometimes it's necessary to merge all CSV files into a single CSV file, and sometimes you just want to merge some files that match a certain naming convention. It's nice to have this feature!

answered Nov 7, 2018 at 15:25

ASH

20.5k28 gold badges115 silver badges247 bronze badges

Comments

Arthur Lopes · Accepted Answer · 2022-08-01 16:42:21Z

I know that the post is a little bit old, but using Glob can be quite expensive in terms of memory if you are trying to read large csv files, because you will store all that data into a list in then you'll still have to have enough memory to concatenate the dataframes inside that list into a dataframe with all the data. Sometimes this is not possible.

dir = 'directory path'
df= pd.DataFrame()

for i in range(0,24):
  csvfile = pd.read_csv(dir+'/file name{i}.csv'.format(i), encoding = 'utf8')
  df = df.append(csvfile)
  del csvfile

So, in case your csv files have the same name and have some kind of number or string that differentiates them, you could just do a for loop trough the files and delete them after they are stored in a dataframe variable using pd.append! In this case all my csv files have the same name except they are numbered in a range that goes from 0 to 23.

Collectives™ on Stack Overflow

How to open multiple csv files from a folder in python?

3 Answers 3

Comments

Comments

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

3 Answers 3

Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related