insert into cloud sql sql server column names with spaces in them

Ask Question

Asked 3 years, 8 months ago

Modified 3 years, 8 months ago

Viewed 262 times

I am trying to load a CSV file in GCS into Cloud SQL SQL Server database using sqlalchemy and dataframe.to_sql

The problem I am running into is the columns in the database have spaces in them. I replaced the spaces with "_" character and out example looks like this.

Works (@From_Date NVARCHAR(MAX)) INSERT INTO dbo.calendar ([From_Date]) VALUES (@From_Date)

Fails (@From Date NVARCHAR(MAX)) INSERT INTO dbo.calendar ([From Date]) VALUES (@From Date)

Is there anyway to make it behaves like this (@FromDate NVARCHAR(MAX)) INSERT INTO dbo.calendar ([From Date]) VALUES (@FromDate) or this? (@From_Date NVARCHAR(MAX)) INSERT INTO dbo.calendar ([From Date]) VALUES (@From_Date)

Python Script

import os
import pandas as pd
import pytds
import sqlalchemy

from fast_to_sql import fast_to_sql as fts
from google.cloud import storage
from google.cloud.sql.connector import connector
from io import StringIO

column_names = [..., "From Date",...]

def init_connection_engine() -> sqlalchemy.engine.Engine:
    def getconn() -> pytds.Connection:
        conn = connector.connect(
            "instance_name",
            "pytds",
            user="sqlserver",
            password="",
            db="my_db"
        )
        return conn

    engine = sqlalchemy.create_engine(
        "mssql+pytds://localhost",
        creator=getconn,
    )
    engine.dialect.description_encoding = None
    return engine
# GCS client
storage_client = storage.Client(project=project_name)

# storage bucket connection
bucket = storage_client.get_bucket(bucket_name)
blob_file = bucket.get_blob(file_name)
gcs_file_byte = blob_file.download_as_string()
gcs_file_string = gcs_file_byte.decode()
file_data_string = StringIO(gcs_file_string)
calendar_dataframe = pd.read_csv(file_data_string, sep=',', header=None, usecols=[*range(0, 15)],
                                 names=column_names)

pool = init_connection_engine()

with pool.connect() as db_conn:
    calendar_dataframe.to_sql(table_id, db_conn, schema='dbo', if_exists='append', chunksize=None, index=False)

edited Mar 11, 2022 at 13:25

Gord Thompson

125k38 gold badges251 silver badges458 bronze badges

asked Mar 10, 2022 at 17:54

WinstonKyu

2213 silver badges9 bronze badges

1

I am able to reproduce the issue. It looks like a bug in the sqlalchemy-pytds dialect, so you may want to ask them directly. FWIW, mssql+pyodbc:// works fine.

Gord Thompson
– Gord Thompson

2022-03-10 19:46:23 +00:00
Commented Mar 10, 2022 at 19:46
2

thank you for the suggestion. I added an issue on the github page.

WinstonKyu
– WinstonKyu

2022-03-11 05:32:08 +00:00
Commented Mar 11, 2022 at 5:32
github.com/m32/sqlalchemy-tds/issues/7

Gord Thompson
– Gord Thompson

2022-03-11 13:23:23 +00:00
Commented Mar 11, 2022 at 13:23
github.com/denisenkom/pytds/pull/130

Gord Thompson
– Gord Thompson

2022-03-12 01:35:45 +00:00
Commented Mar 12, 2022 at 1:35

Add a comment |

0 Your Answer

Sign up or log in

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

By clicking “Post Your Answer”, you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.

Collectives™ on Stack Overflow

insert into cloud sql sql server column names with spaces in them

0

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

0

Know someone who can answer? Share a link to this question via email, Twitter, or Facebook.

Your Answer

Sign up or log in

Post as a guest