Python Pandas read_excel returns empty Dataframe

Question

Reading a simple xls returning empty dataframe, can't figure it out for the life of me:

path = ('c:/Users/Desktop/Stuff/Ready')
files = os.listdir(path)
print(files)

files_xlsx = [f for f in files if f[-3:] == 'xlsx']

readyorders = pd.DataFrame()
for filename in files_xlsx:
    with open(os.path.join(path, filename)) as f:
        data = pd.read_excel(f)
        readyorders = readyorders.append(data)

print(readyorders)

The excel is just two simple columns...is it just too early in the day?

In general, pd.read_excel returns a map sheetname -> dataframe. You may use sheetname=None as arg. This should read the dataframe in the first (and possibly only) sheet — pazqo
– pazqo, Commented Sep 12, 2017 at 16:02
By default its first sheet.. but even with sheetname arg defined still empty. — gbcuzi
– gbcuzi, Commented Sep 12, 2017 at 16:05

Nic Scozzaro · Accepted Answer · 2018-10-05 15:38:12Z

7

I had a similar issue and it turns out that there are TWO types of XLSX: "Excel Workbook" (at the top of the list in the image below) and "Strict Open XML Spreadsheet" (with the checkmark). The latter returns an empty spreadsheet in pandas, so use the Excel Workbook (.xlsx) and you won't have problems.

answered Oct 5, 2018 at 15:38

Nic Scozzaro

7,4733 gold badges47 silver badges49 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

Minions · Accepted Answer · 2020-11-06 11:02:49Z

3

I had the same issue and I later i discovered that it's because I have many sheets in the excel file and I didn't specify the sheet name.

answered Nov 6, 2020 at 11:02

Minions

5,5376 gold badges56 silver badges105 bronze badges

Comments

Fabian Wörenkämper · Accepted Answer · 2022-06-29 15:51:30Z

3

Sometimes there is also a "hidden sheet" which results of bad exports.. You should use the sheet_name parameter for your sheet then or you could also use sheet_name=None. Then you get a dict with the empty df of the hidden sheet and the other data

answered Jun 29, 2022 at 15:51

Fabian Wörenkämper

735 bronze badges

1 Comment

El- Over a year ago

This was it for me, certainly worth understanding what sheets are present in the spreadsheet you're trying to open and specify the correct one with sheet_name.

Alexander · Accepted Answer · 2017-09-12 16:02:33Z

1

f[-3:] == 'xlsx' will never be true, as you are evaluating the last three characters and comparing it to a string of four characters.

Try f[-4:] == 'xlsx'

As an aside, appending dataframes is very slow. Try concatenating instead:

readyorders = pd.concat([pd.read_excel(f) for f in files if f[-5:] == '.xlsx']

answered Sep 12, 2017 at 16:02

Alexander

111k32 gold badges212 silver badges208 bronze badges

1 Comment

gbcuzi Over a year ago

This appears to be the right issue. But now im getting unicode decode errors. This exact code works fine elsewhere....am i losing my mind?

Capybara · Accepted Answer · 2024-07-08 06:12:50Z

0

Mine returned an empty DataFrame and I checked:

xl = pd.ExcelFile(path)
print(xl.sheet_names)  # see all sheet names

and I found a hidden sheet.

edited Jul 8, 2024 at 6:12

Capybara

8571 gold badge11 silver badges22 bronze badges

answered Jul 6, 2024 at 8:05

lazydeath

11 bronze badge

Collectives™ on Stack Overflow

Python Pandas read_excel returns empty Dataframe

5 Answers 5

Comments

Comments

1 Comment

1 Comment

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

5 Answers 5

Comments

Comments

1 Comment

1 Comment

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related