How to count the number of files in a directory using Python

Question

How do I count only the files in a directory? This counts the directory itself as a file:

len(glob.glob('*'))

To leave out directories, you can do '*.fileextension' for whatever file extension you are looking for. — user2891129
– user2891129, Commented Mar 24, 2018 at 2:28

Bruno Bronosky · Accepted Answer · 2014-09-15 22:06:40Z

429

os.listdir() will be slightly more efficient than using glob.glob. To test if a filename is an ordinary file (and not a directory or other entity), use os.path.isfile():

import os, os.path

# simple version for working with CWD
print len([name for name in os.listdir('.') if os.path.isfile(name)])

# path joining version for other paths
DIR = '/tmp'
print len([name for name in os.listdir(DIR) if os.path.isfile(os.path.join(DIR, name))])

edited Sep 15, 2014 at 22:06

Bruno Bronosky

71.4k15 gold badges180 silver badges159 bronze badges

answered Apr 13, 2010 at 18:43

Daniel Stutzbach

77.1k17 gold badges90 silver badges79 bronze badges

Sign up to request clarification or add additional context in comments.

11 Comments

Rafael Oliveira Over a year ago

Remember to add the folder_path inside os.path.filename(name) if you're not on the cwd. stackoverflow.com/questions/17893542/…

Joel B Over a year ago

For recursively counting files nested inside directories, you might be better off with the os.walk() solution.

HelloGoodbye Over a year ago

What is the benefit of using os.path.join(DIR, name) over DIR + '/' + name? The latter is shorter and, IMO, more clear than the former. Is there perhaps some OS:es on which the latter would fail?

ellockie Over a year ago

@HelloGoodbye That's exactly the reason.

Brady Huang Over a year ago

For those who uses python3, print(len(os.listdir('DIRECTORY_PATH')))

|

bryant1410 · Accepted Answer · 2022-04-27 00:51:23Z

186

import os

_, _, files = next(os.walk("/usr/lib"))
file_count = len(files)

edited Apr 27, 2022 at 0:51

bryant1410

6,5604 gold badges42 silver badges43 bronze badges

answered Nov 29, 2011 at 13:16

Luke

2,0921 gold badge12 silver badges6 bronze badges

4 Comments

Kyle Bridenstine Over a year ago

This isn't recursive

Fandango68 Over a year ago

The OP didn't ask for it to be recursive

Charlie Parker Over a year ago

does os.walk no print it in sorted order?

John Glen Over a year ago

Fails if folder is empty with StopIteration

MattDMo · Accepted Answer · 2022-08-12 16:13:02Z

89

For all kind of files, subdirectories included (Python 2):

import os

lst = os.listdir(directory) # your directory path
number_files = len(lst)
print number_files

Only files (avoiding subdirectories):

import os

onlyfiles = next(os.walk(directory))[2] #directory is your directory path as string
print len(onlyfiles)

edited Aug 12, 2022 at 16:13

MattDMo

103k21 gold badges251 silver badges239 bronze badges

answered Jul 8, 2015 at 15:33

Guillermo Pereira

2,0791 gold badge14 silver badges7 bronze badges

4 Comments

Kyle Bridenstine Over a year ago

This isn't recursive

Nick Vee Over a year ago

The editing queue is full so... Please, do not use the builtins (list, dir) as a variable name or a placeholder!

MattDMo Over a year ago

@NickVeld FTFY...

John Glen Over a year ago

Fails if folder is empty with StopIteration

ngeek · Accepted Answer · 2013-05-31 20:55:37Z

53

This is where fnmatch comes very handy:

import fnmatch

print len(fnmatch.filter(os.listdir(dirpath), '*.txt'))

More details: http://docs.python.org/2/library/fnmatch.html

answered May 31, 2013 at 20:55

ngeek

7,93311 gold badges40 silver badges42 bronze badges

1 Comment

CivFan Over a year ago

This is much faster (about half the time with my testing on a directory with 10,000 files) if you know the pattern you're looking for, rather then testing each file with os.path.isfile() as the accepted answer does. Also significantly faster than glob.glob().

Paul · Accepted Answer · 2023-08-12 09:16:36Z

36

An answer with pathlib and without loading the whole list to memory:

from pathlib import Path

path = Path('.')

print(sum(1 for _ in path.glob('*')))  # Files and folders, not recursive
print(sum(1 for _ in path.rglob('*')))  # Files and folders, recursive

print(sum(1 for x in path.glob('*') if x.is_file()))  # Only files, not recursive
print(sum(1 for x in path.rglob('*') if x.is_file()))  # Only files, recursive

edited Aug 12, 2023 at 9:16

answered Aug 7, 2020 at 18:08

Paul

7,0718 gold badges52 silver badges79 bronze badges

5 Comments

William Le Over a year ago

Best answer by far!

Maximilian Wolf Over a year ago

or just sum(1 for _ in path.iterdir()) or sum(1 for _, x in enumerate(path.iterdir()) if x.is_file()). Not recursive.

supermitch Over a year ago

Instead of path.glob('**/*') use path.rglob('*'), for recursive versions.

Paul Over a year ago

@MaximilianWolf Also an option, but it may have different behavior, so I used glob to have the same behavior for both recursive and non-recursive variants.

ShadowRanger Dec 5, 2024 at 2:56

Sadly, "without loading the whole list to memory" is not the case. The implementation of all the pathlib components that scrape directories are eager in listing the actual directory, using os.listdir (they clearly intended to use os.scandir at one point, but found it caused problems with running out of file handles while recursively traversing deep directory trees, and fixed it by just making all processing eagerly slurp the whole directory so the handle gets closed immediately). So you avoid building a list of 1s or the like here, but the list of pathlib.Paths is always built.

Mr_and_Mrs_D · Accepted Answer · 2017-12-22 13:51:12Z

35

If you want to count all files in the directory - including files in subdirectories, the most pythonic way is:

import os

file_count = sum(len(files) for _, _, files in os.walk(r'C:\Dropbox'))
print(file_count)

We use sum that is faster than explicitly adding the file counts (timings pending)

edited Dec 22, 2017 at 13:51

answered Dec 21, 2017 at 17:57

Mr_and_Mrs_D

34.5k45 gold badges193 silver badges373 bronze badges

4 Comments

Ejaz Over a year ago

Hi, I was trying to understand this code (the code works perfect), I know we can use _ in a for loop. os.walk also I know. But not sure what's going on with underscores inside the sum function, could you please elaborate. Thanks!

Mr_and_Mrs_D Over a year ago

Unsderscore is just a variable name @Ejaz, by convention used when we ignore the variable - that's what we do here - we call walk and only count the number of files in each directory, ignoring the root and dirs walk return values

Pixel78 Over a year ago

This is completely recursive and probably the best answer here.

NoobCat Over a year ago

This should be the most appropriate answer, to also count the files in any subfolders..

Somyadeep Shrivastava · Accepted Answer · 2020-09-17 16:48:52Z

25

Short and simple

import os
directory_path = '/home/xyz/'
No_of_files = len(os.listdir(directory_path))

answered Sep 17, 2020 at 16:48

Somyadeep Shrivastava

5215 silver badges8 bronze badges

3 Comments

Kshitij Agarwal Over a year ago

Also, no need of directory path if the python file is in the same directory.

David Maddox Over a year ago

Importantly, this works well in Python 3

maciejwww Over a year ago

This solution counts also directories, not only files as it was indicated in question.

bryant1410 · Accepted Answer · 2025-09-25 15:22:39Z

23

I am surprised that nobody mentioned os.scandir:

def count_files(dir):
    return sum(1 for x in os.scandir(dir) if x.is_file())

edited Sep 25 at 15:22

bryant1410

6,5604 gold badges42 silver badges43 bronze badges

answered May 18, 2017 at 9:24

qed

23.3k25 gold badges131 silver badges212 bronze badges

3 Comments

Aoki Ahishatsu Over a year ago

Works great with Python 3.6!

ShadowRanger Dec 5, 2024 at 2:45

Why make lists, twice, for no benefit? sum(1 for entry in os.scandir(dir) if entry.is_file()) would achieve the same effect without producing two thoroughly unnecessary lists, and thereby keep memory usage constant regardless of directory size.

ShadowRanger Dec 5, 2024 at 2:48

Note: I agree scandir is the way to go. It improves on listdir+os.path.isfile in two ways, 1) It's lazy and therefore works on huge directories efficiently (if you don't listify the result just to iterate it), and 2) The .is_file() call is satisfied from information the directory traversal API gives you for free, without a separate stat call, dramatically reducing the number of system calls and I/O required to complete the counting process.

Marcus Riemer · Accepted Answer · 2013-02-20 12:25:12Z

13

def directory(path,extension):
  list_dir = []
  list_dir = os.listdir(path)
  count = 0
  for file in list_dir:
    if file.endswith(extension): # eg: '.txt'
      count += 1
  return count

edited Feb 20, 2013 at 12:25

Marcus Riemer

7,8189 gold badges56 silver badges82 bronze badges

answered Feb 20, 2013 at 12:04

ninjrok

1901 silver badge8 bronze badges

Comments

rash · Accepted Answer · 2014-07-01 10:18:18Z

12

import os
print len(os.listdir(os.getcwd()))

answered Jul 1, 2014 at 10:18

rash

1,3641 gold badge12 silver badges17 bronze badges

1 Comment

Brian Burns Over a year ago

This might be useful sometimes but it includes subdirectories in the count also

joaquin · Accepted Answer · 2012-06-07 19:14:29Z

10

This uses os.listdir and works for any directory:

import os
directory = 'mydirpath'

number_of_files = len([item for item in os.listdir(directory) if os.path.isfile(os.path.join(directory, item))])

this can be simplified with a generator and made a little bit faster with:

import os
isfile = os.path.isfile
join = os.path.join

directory = 'mydirpath'
number_of_files = sum(1 for item in os.listdir(directory) if isfile(join(directory, item)))

edited Jun 7, 2012 at 19:14

answered Apr 13, 2010 at 18:46

joaquin

86k31 gold badges146 silver badges155 bronze badges

Comments

LBes · Accepted Answer · 2018-10-18 09:17:13Z

10

While I agree with the answer provided by @DanielStutzbach: os.listdir() will be slightly more efficient than using glob.glob.

However, an extra precision, if you do want to count the number of specific files in folder, you want to use len(glob.glob()). For instance if you were to count all the pdfs in a folder you want to use:

pdfCounter = len(glob.glob1(myPath,"*.pdf"))

answered Oct 18, 2018 at 9:17

LBes

3,4762 gold badges39 silver badges71 bronze badges

Comments

MLDev · Accepted Answer · 2021-09-12 22:58:12Z

This is an easy solution that counts the number of files in a directory containing sub-folders. It may come in handy:

import os
from pathlib import Path

def count_files(rootdir):
    '''counts the number of files in each subfolder in a directory'''
    for path in pathlib.Path(rootdir).iterdir():
        if path.is_dir():
            print("There are " + str(len([name for name in os.listdir(path) \
            if os.path.isfile(os.path.join(path, name))])) + " files in " + \
            str(path.name))
            
 
count_files(data_dir) # data_dir is the directory you want files counted.

You should get an output similar to this (with the placeholders changed, of course):

There are {number of files} files in {name of sub-folder1}
There are {number of files} files in {name of sub-folder2}

Kristian Damian · Accepted Answer · 2010-04-13 18:48:44Z

6

def count_em(valid_path):
   x = 0
   for root, dirs, files in os.walk(valid_path):
       for f in files:
            x = x+1
print "There are", x, "files in this directory."
return x

Taked from this post

answered Apr 13, 2010 at 18:48

Kristian Damian

1,3563 gold badges22 silver badges46 bronze badges

1 Comment

SilentGhost Over a year ago

1. files is a list. 2. OP is not looking for recursive count

juan Isaza · Accepted Answer · 2020-07-11 17:14:38Z

6

one liner and recursive:

def count_files(path):
    return sum([len(files) for _, _, files in os.walk(path)])

count_files('path/to/dir')

answered Jul 11, 2020 at 17:14

juan Isaza

4,0353 gold badges33 silver badges39 bronze badges

Comments

tzot · Accepted Answer · 2010-04-13 22:31:34Z

5

import os

def count_files(in_directory):
    joiner= (in_directory + os.path.sep).__add__
    return sum(
        os.path.isfile(filename)
        for filename
        in map(joiner, os.listdir(in_directory))
    )

>>> count_files("/usr/lib")
1797
>>> len(os.listdir("/usr/lib"))
2049

answered Apr 13, 2010 at 22:31

tzot

96.6k30 gold badges151 silver badges210 bronze badges

Comments

Bojan Tunguz · Accepted Answer · 2016-05-09 18:23:27Z

5

Here is a simple one-line command that I found useful:

print int(os.popen("ls | wc -l").read())

answered May 9, 2016 at 18:23

Bojan Tunguz

511 silver badge2 bronze badges

1 Comment

Bloodgain Over a year ago

Parsing the output of ls is generally frowned upon (it can frequently cause issues), though this is not a bad "quick-and-dirty" method at the shell. You should use ls -1, though, so it guarantees one line per file.

okobaka · Accepted Answer · 2012-05-30 08:26:07Z

4

Luke's code reformat.

import os

print len(os.walk('/usr/lib').next()[2])

answered May 30, 2012 at 8:26

okobaka

6165 silver badges8 bronze badges

1 Comment

Milind R Over a year ago

Gives me AttributeError: 'generator' object has no attribute 'next' in Python 3.11

user799188 · Accepted Answer · 2016-11-24 06:45:14Z

2

I used glob.iglob for a directory structure similar to

data
└───train
│   └───subfolder1
│   |   │   file111.png
│   |   │   file112.png
│   |   │   ...
│   |
│   └───subfolder2
│       │   file121.png
│       │   file122.png
│       │   ...
└───test
    │   file221.png
    │   file222.png

Both of the following options return 4 (as expected, i.e. does not count the subfolders themselves)

len(list(glob.iglob("data/train/*/*.png", recursive=True)))
sum(1 for i in glob.iglob("data/train/*/*.png"))

answered Nov 24, 2016 at 6:45

user799188

14.6k5 gold badges38 silver badges38 bronze badges

Comments

Agha Saad · Accepted Answer · 2018-07-31 15:06:00Z

2

It is simple:

print(len([iq for iq in os.scandir('PATH')]))

it simply counts number of files in directory , i have used list comprehension technique to iterate through specific directory returning all files in return . "len(returned list)" returns number of files.

edited Jul 31, 2018 at 15:06

answered Jul 29, 2018 at 10:01

Agha Saad

313 bronze badges

2 Comments

Elletlar Over a year ago

Welcome to Stack Overflow. The quality of this answer can be improved by adding an explanation: How to Answer

Agha Saad Over a year ago

Thankyou Elletlar , i have edited my answer , i will make sure to respond in more comprehensive manner :D

styler · Accepted Answer · 2015-04-08 13:38:55Z

1

If you'll be using the standard shell of the operating system, you can get the result much faster rather than using pure pythonic way.

Example for Windows:

import os
import subprocess

def get_num_files(path):
    cmd = 'DIR \"%s\" /A-D /B /S | FIND /C /V ""' % path
    return int(subprocess.check_output(cmd, shell=True))

answered Apr 8, 2015 at 13:38

styler

111 bronze badge

1 Comment

Politank-Z Over a year ago

But it won't be as portable.

Ismail · Accepted Answer · 2015-04-19 10:04:19Z

1

I found another answer which may be correct as accepted answer.

for root, dirs, files in os.walk(input_path):    
for name in files:
    if os.path.splitext(name)[1] == '.TXT' or os.path.splitext(name)[1] == '.txt':
        datafiles.append(os.path.join(root,name)) 


print len(files)

answered Apr 19, 2015 at 10:04

Ismail

391 silver badge6 bronze badges

Comments

Kinyugo · Accepted Answer · 2020-09-27 11:27:15Z

1

A simple utility function I wrote that makes use of os.scandir() instead of os.listdir().

import os 

def count_files_in_dir(path: str) -> int:
    file_entries = [entry for entry in os.scandir(path) if entry.is_file()]

    return len(file_entries)

The main benefit is that, the need for os.path.is_file() is eliminated and replaced with os.DirEntry instance's is_file() which also removes the need for os.path.join(DIR, file_name) as shown in other answers.

answered Sep 27, 2020 at 11:27

Kinyugo

4711 gold badge4 silver badges11 bronze badges

Comments

Mayur Gupta · Accepted Answer · 2022-01-12 05:54:38Z

1

Simpler one:

import os
number_of_files = len(os.listdir(directory))
print(number_of_files)

answered Jan 12, 2022 at 5:54

Mayur Gupta

5395 silver badges16 bronze badges

1 Comment

maciejwww Over a year ago

This solution counts also directories, not only files as it was indicated in question.

Mohit Dabas · Accepted Answer · 2014-09-29 06:30:32Z

0

import os

total_con=os.listdir('<directory path>')

files=[]

for f_n in total_con:
   if os.path.isfile(f_n):
     files.append(f_n)


print len(files)

edited Sep 29, 2014 at 6:30

answered Sep 29, 2014 at 5:59

Mohit Dabas

2,3611 gold badge18 silver badges12 bronze badges

1 Comment

tktk Over a year ago

The OP asked for the number of files, this lists directories as well.

jkalden · Accepted Answer · 2017-01-11 16:03:16Z

0

i did this and this returned the number of files in the folder(Attack_Data)...this works fine.

import os
def fcount(path):
    #Counts the number of files in a directory
    count = 0
    for f in os.listdir(path):
        if os.path.isfile(os.path.join(path, f)):
            count += 1

    return count
path = r"C:\Users\EE EKORO\Desktop\Attack_Data" #Read files in folder
print (fcount(path))

edited Jan 11, 2017 at 16:03

jkalden

1,5884 gold badges25 silver badges26 bronze badges

answered Jan 11, 2017 at 15:05

Sam Ekoro

1

Comments

Charlie Parker · Accepted Answer · 2022-08-08 20:22:30Z

I find that sometimes I don't know if I will receive filenames or the path to the file. So I printed the os walk solution output:

def count_number_of_raw_data_point_files(path: Union[str, Path], with_file_prefix: str) -> int:
    import os
    path: Path = force_expanduser(path)

    _, _, files = next(os.walk(path))
    # file_count = len(files)
    filename: str
    count: int = 0
    for filename in files:
        print(f'-->{filename=}')  # e.g. print -->filename='data_point_99.json'
        if with_file_prefix in filename:
            count += 1
    return count

out:

-->filename='data_point_780.json'
-->filename='data_point_781.json'
-->filename='data_point_782.json'
-->filename='data_point_783.json'
-->filename='data_point_784.json'
-->filename='data_point_785.json'
-->filename='data_point_786.json'
-->filename='data_point_787.json'
-->filename='data_point_788.json'
-->filename='data_point_789.json'
-->filename='data_point_79.json'
-->filename='data_point_790.json'
-->filename='data_point_791.json'
-->filename='data_point_792.json'
-->filename='data_point_793.json'
-->filename='data_point_794.json'
-->filename='data_point_795.json'
-->filename='data_point_796.json'
-->filename='data_point_797.json'
-->filename='data_point_798.json'
-->filename='data_point_799.json'
-->filename='data_point_8.json'
-->filename='data_point_80.json'
-->filename='data_point_800.json'
-->filename='data_point_801.json'
-->filename='data_point_802.json'
-->filename='data_point_803.json'
-->filename='data_point_804.json'
-->filename='data_point_805.json'
-->filename='data_point_806.json'
-->filename='data_point_807.json'
-->filename='data_point_808.json'
-->filename='data_point_809.json'
-->filename='data_point_81.json'
-->filename='data_point_810.json'
-->filename='data_point_811.json'
-->filename='data_point_812.json'
-->filename='data_point_813.json'
-->filename='data_point_814.json'
-->filename='data_point_815.json'
-->filename='data_point_816.json'
-->filename='data_point_817.json'
-->filename='data_point_818.json'
-->filename='data_point_819.json'
-->filename='data_point_82.json'
-->filename='data_point_820.json'
-->filename='data_point_821.json'
-->filename='data_point_822.json'
-->filename='data_point_823.json'
-->filename='data_point_824.json'
-->filename='data_point_825.json'
-->filename='data_point_826.json'
-->filename='data_point_827.json'
-->filename='data_point_828.json'
-->filename='data_point_829.json'
-->filename='data_point_83.json'
-->filename='data_point_830.json'
-->filename='data_point_831.json'
-->filename='data_point_832.json'
-->filename='data_point_833.json'
-->filename='data_point_834.json'
-->filename='data_point_835.json'
-->filename='data_point_836.json'
-->filename='data_point_837.json'
-->filename='data_point_838.json'
-->filename='data_point_839.json'
-->filename='data_point_84.json'
-->filename='data_point_840.json'
-->filename='data_point_841.json'
-->filename='data_point_842.json'
-->filename='data_point_843.json'
-->filename='data_point_844.json'
-->filename='data_point_845.json'
-->filename='data_point_846.json'
-->filename='data_point_847.json'
-->filename='data_point_848.json'
-->filename='data_point_849.json'
-->filename='data_point_85.json'
-->filename='data_point_850.json'
-->filename='data_point_851.json'
-->filename='data_point_852.json'
-->filename='data_point_853.json'
-->filename='data_point_86.json'
-->filename='data_point_87.json'
-->filename='data_point_88.json'
-->filename='data_point_89.json'
-->filename='data_point_9.json'
-->filename='data_point_90.json'
-->filename='data_point_91.json'
-->filename='data_point_92.json'
-->filename='data_point_93.json'
-->filename='data_point_94.json'
-->filename='data_point_95.json'
-->filename='data_point_96.json'
-->filename='data_point_97.json'
-->filename='data_point_98.json'
-->filename='data_point_99.json'
854

note you might have to sort.

K.Mulier · Accepted Answer · 2022-12-24 12:13:34Z

0

I would like to extend the reply from @Mr_and_Mrs_D:

import os
folder = 'C:/Dropbox'
file_count = sum(len(files) for _, _, files in os.walk(folder))
print(file_count)

This counts all the files in the folder and its subfolders. However, if you want to do some filtering - like only counting the files ending in .svg, you can do:

import os
file_count = sum(len([f for f in files if f.endswith('.svg')]) for _, _, files in os.walk(folder))
print(file_count)

You basically replace:

len(files)

with:

len([f for f in files if f.endswith('.svg')])

answered Dec 24, 2022 at 12:13

K.Mulier

9,77023 gold badges94 silver badges155 bronze badges

Comments

Skully · Accepted Answer · 2023-02-09 21:53:36Z

-2

Convert it to a list, after that you can make use of the len() function:

len(list(glob.glob('*')))

edited Feb 9, 2023 at 21:53

Skully

3,1743 gold badges29 silver badges43 bronze badges

answered Dec 7, 2021 at 23:53

Eslamspot

755 bronze badges

Collectives™ on Stack Overflow

How to count the number of files in a directory using Python

29 Answers 29

11 Comments

4 Comments

4 Comments

1 Comment

5 Comments

4 Comments

3 Comments

3 Comments

Comments

1 Comment

Comments

Comments

Comments

1 Comment

Comments

Comments

1 Comment

1 Comment

Comments

2 Comments

1 Comment

Comments

Comments

1 Comment

1 Comment

Comments

Comments

Comments

Comments

Linked

Hot Network Questions

Collectives™ on Stack Overflow

29 Answers 29

11 Comments

4 Comments

4 Comments

1 Comment

5 Comments

4 Comments

3 Comments

3 Comments

Comments

1 Comment

Comments

Comments

Comments

1 Comment

Comments

Comments

1 Comment

1 Comment

Comments

2 Comments

1 Comment

Comments

Comments

1 Comment

1 Comment

Comments

Comments

Comments

Comments

Linked

Related