How can I convert Sqlalchemy table object to Pandas DataFrame?

Question

Is it possible to convert retrieved SqlAlchemy table object into Pandas DataFrame or do I need to write a particular function for that aim ?

Yes but SqlAlchemy has other use cases in my project as well. — erogol
– erogol, Commented Aug 12, 2014 at 13:17
For when you want to use another selectable than just the table (including working with the orm), take a look at: stackoverflow.com/a/29528804/1273938 — LeoRochael
– LeoRochael, Commented Jul 31, 2015 at 18:39

Halee · Accepted Answer · 2018-07-12 16:38:28Z

16

This might not be the most efficient way, but it has worked for me to reflect a database table using automap_base and then convert it to a Pandas DataFrame.

import pandas as pd
from sqlalchemy.ext.automap import automap_base
from sqlalchemy import create_engine
from sqlalchemy.orm import Session

connection_string = "your:db:connection:string:here"
engine = create_engine(connection_string, echo=False)
session = Session(engine)

# sqlalchemy: Reflect the tables
Base = automap_base()
Base.prepare(engine, reflect=True)

# Mapped classes are now created with names by default matching that of the table name.
Table_Name = Base.classes.table_name

# Example query with filtering
query = session.query(Table_Name).filter(Table_Name.language != 'english')

# Convert to DataFrame
df = pd.read_sql(query.statement, engine)
df.head()

answered Jul 12, 2018 at 16:38

Halee

5029 silver badges15 bronze badges

Sign up to request clarification or add additional context in comments.

2 Comments

Connor Dibble Over a year ago

I benchmarked this and the above (more upvoted) answer. This answer is roughly twice as fast, in addition to being simpler. Nice work. Also, thank you.

Corralien Over a year ago

From SQLAlchemy 1.4, reflect parameter is deprecated and reflection is enabled when autoload_with is passed (this is the case here with engine parameter)

jkmacc · Accepted Answer · 2014-08-12 18:18:05Z

6

I think I've tried this before. It's hacky, but for whole-table ORM query results, this should work:

import pandas as pd

cols = [c.name for c in SQLA_Table.__table__.columns]
pk = [c.name for c in SQLA_Table.__table__.primary_key]
tuplefied_list = [(getattr(item, col) for col in cols) for item in result_list]

df = pd.DataFrame.from_records(tuplefied_list, index=pk, columns=cols)

Partial query results (NamedTuples) will also work, but you have to construct the DataFrame columns and index to match your query.

edited Aug 12, 2014 at 18:18

answered Aug 12, 2014 at 14:04

jkmacc

6,5973 gold badges33 silver badges30 bronze badges

7 Comments

Paul H Over a year ago

just use pandas.read_sql with an SQLAlchemy engine. it's dead simple.

jkmacc Over a year ago

How do you use pandas.read_sql on an ORM query, like: session.query(MyORMTable).limit(100).all() ?

Paul H Over a year ago

pandas.read_sql_table('MyTable', MySQLEngine) see here pandas.pydata.org/pandas-docs/stable/…

jkmacc Over a year ago

Very cool. It looks like it doesn't convert existing query results, though (or work with the ORM), which is how I was interpreting the original question.

Vincent Over a year ago

What is result_list here? I get an error when trying to run this. I also have existing query results that I want to convert to a pandas data frame (as opposed to just loading up a straight table)

|

mirekphd · Accepted Answer · 2022-08-08 06:29:13Z

0

Pandas database functions such as read_sql_query accept SQLAlchemy connection objects (so-called SQLAlchemy connectables, see pandas docs and sqlalchemy docs). Here's one example of using such object called my_connection:

import pandas as pd
import sqlalchemy

# create SQLAlchemy Engine object instance 
my_engine = sqlalchemy.create_engine(f"{dialect}+{driver}://{login}:{password}@{host}/{db_name}")

# connect to the database using the newly created Engine instance
my_connection = my_engine.connect()

# run SQL query
my_df = pd.read_sql_query(sql=my_sql_query, con=my_connection)

answered Aug 8, 2022 at 6:29

mirekphd

7,2314 gold badges62 silver badges89 bronze badges

Comments

showteth · Accepted Answer · 2022-08-15 15:37:26Z

-1

I have a simpler way:

# Step1: import
import pandas as pd
from sqlalchemy import create_engine

# Step2: create_engine
connection_string = "sqlite:////absolute/path/to/database.db"
engine = create_engine(connection_string)

# Step3: select table
print (engine.table_names())

# Step4: read table
table_df = pd.read_sql_table('table_name', engine)
table_df.head()

For other types of connection_string, SQLAlchemy 1.4 Documentation.

answered Aug 15, 2022 at 15:37

showteth

4385 silver badges11 bronze badges

1 Comment

itscarlayall Over a year ago

AttributeError: 'Engine' object has no attribute 'table_names'

Collectives™ on Stack Overflow

How can I convert Sqlalchemy table object to Pandas DataFrame?

4 Answers 4

2 Comments

7 Comments

Comments

1 Comment

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

4 Answers 4

2 Comments

7 Comments

Comments

1 Comment

Your Answer

Sign up or log in

Post as a guest

Linked

Related