Convert row to column header for Pandas DataFrame

Question

The data I have to work with is a bit messy.. It has header names inside of its data. How can I choose a row from an existing pandas dataframe and make it (rename it to) a column header?

I want to do something like:

header = df[df['old_header_name1'] == 'new_header_name1']

df.columns = header

unutbu · Accepted Answer · 2019-08-14 15:52:10Z

313

In [21]: df = pd.DataFrame([(1,2,3), ('foo','bar','baz'), (4,5,6)])

In [22]: df
Out[22]: 
     0    1    2
0    1    2    3
1  foo  bar  baz
2    4    5    6

Set the column labels to equal the values in the 2nd row (index location 1):

In [23]: df.columns = df.iloc[1]

If the index has unique labels, you can drop the 2nd row using:

In [24]: df.drop(df.index[1])
Out[24]: 
1 foo bar baz
0   1   2   3
2   4   5   6

If the index is not unique, you could use:

In [133]: df.iloc[pd.RangeIndex(len(df)).drop(1)]
Out[133]: 
1 foo bar baz
0   1   2   3
2   4   5   6

Using df.drop(df.index[1]) removes all rows with the same label as the second row. Because non-unique indexes can lead to stumbling blocks (or potential bugs) like this, it's often better to take care that the index is unique (even though Pandas does not require it).

edited Aug 14, 2019 at 15:52

answered Oct 1, 2014 at 17:42

unutbu

886k197 gold badges1.9k silver badges1.7k bronze badges

Sign up to request clarification or add additional context in comments.

3 Comments

E.K. Over a year ago

Thank you so much for your quick response! How can I choose a row by value in stead of index location to make it header? So for your example something like.. df.columns = df[df[0] == 'foo']

unutbu Over a year ago

The problem with that is there could be more than one row which has the value "foo". One way around that problem is to explicitly choose the first such row: df.columns = df.iloc[np.where(df[0] == 'foo')[0][0]].

E.K. Over a year ago

Ah I see why you did that way. For my case, I know there is only one row that has the value "foo". So it is ok. I just did this way I guess it is the same as the one you gave me above. idx_loc = df[df[0] == 'foo'].index.tolist()[0] df.columns = df.iloc[idx_loc]

Rob · Accepted Answer · 2017-03-15 22:56:58Z

125

This works (pandas v'0.19.2'):

df.rename(columns=df.iloc[0])

edited Mar 15, 2017 at 22:56

Rob♦

27.5k16 gold badges89 silver badges103 bronze badges

answered Mar 15, 2017 at 22:22

Zachary Wilson

1,3821 gold badge8 silver badges4 bronze badges

3 Comments

ostrokach Over a year ago

You can remove the "header" row by adding .drop(df.index[0])

Javier Over a year ago

I like this better than the actual accepted answer. I love the short oneline solutions.

onestep.ua Over a year ago

Please keep in mind that after dropping the first row, index would start from 1, so you probably would like to add .reset_index(drop=True).

Dawn · Accepted Answer · 2018-12-14 08:04:36Z

41

It would be easier to recreate the data frame. This would also interpret the columns types from scratch.

headers = df.iloc[0]
new_df  = pd.DataFrame(df.values[1:], columns=headers)

answered Dec 14, 2018 at 8:04

Dawn

3,6486 gold badges46 silver badges62 bronze badges

1 Comment

Draco D Over a year ago

Simple and easy. Nice!

Govinda · Accepted Answer · 2020-09-04 17:52:07Z

20

To rename the header without reassign df:

df.rename(columns=df.iloc[0], inplace = True)

To drop the row without reassign df:

df.drop(df.index[0], inplace = True)

answered Sep 4, 2020 at 17:52

Govinda

98910 silver badges7 bronze badges

Comments

ccpizza · Accepted Answer · 2020-06-08 17:21:12Z

8

You can specify the row index in the read_csv or read_html constructors via the header parameter which represents Row number(s) to use as the column names, and the start of the data. This has the advantage of automatically dropping all the preceding rows which supposedly are junk.

import pandas as pd
from io import StringIO

In[1]
    csv = '''junk1, junk2, junk3, junk4, junk5
    junk1, junk2, junk3, junk4, junk5
    pears, apples, lemons, plums, other
    40, 50, 61, 72, 85
    '''

    df = pd.read_csv(StringIO(csv), header=2)
    print(df)

Out[1]
       pears   apples   lemons   plums   other
    0     40       50       61      72      85

edited Jun 8, 2020 at 17:21

answered Aug 13, 2018 at 12:45

ccpizza

32.4k24 gold badges186 silver badges195 bronze badges

2 Comments

pablete Over a year ago

This does not address the question itself, which is asking about an already existing DataFrame.

ccpizza Over a year ago

some of the users who found this question (possibly the majority) would have a more generic use case than the OP; this answer is for that group

G M · Accepted Answer · 2022-08-26 10:38:11Z

0

Keeping it Python simple

Padas DataFrames have columns attribute why not use it with standard Python, it is much clearer what you are doing:

table = [['name', 'Rf', 'Rg', 'Rf,skin', 'CRI'],
 ['testsala.cxf', '86', '95', '92', '87'],
 ['testsala.cxf: 727037 lm', '86', '95', '92', '87'],
 ['630.cxf', '18', '8', '11', '18'],
 ['Huawei stk-lx1.cxf', '86', '96', '88', '83'],
 ['dedo uv no filtro.cxf', '52', '93', '48', '58']]

import pandas as pd
data = pd.DataFrame(table[1:],columns=table[0])

or in the case is not the first row, but the 10th for instance:

columns = table.pop(10)
data = pd.DataFrame(table,columns=columns)

answered Aug 26, 2022 at 10:38

G M

22.8k11 gold badges89 silver badges94 bronze badges

2 Comments

gbox Over a year ago

Tested for performance, although we know that the creation of a new DataFrame is "time-consuming" Anyhow this approach took 40X more time

G M Over a year ago

@gbox thanks for you comment! If you want edit the answer

Collectives™ on Stack Overflow

Convert row to column header for Pandas DataFrame

6 Answers 6

3 Comments

3 Comments

1 Comment

Comments

2 Comments

Keeping it Python simple

2 Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

6 Answers 6

3 Comments

3 Comments

1 Comment

Comments

2 Comments

Keeping it Python simple

2 Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related