Several fixes and enhancements #39

musashiXXX · 2018-11-06T12:37:33Z

The changes in this pull request fix:

newTable needed to be renamed to new_table in ETLAlchemySource.py
schema_transformer.py was not correctly retrieving the table name from the table_transformations dict, which would effectively disable all mappings in the table_schema_transformation_file.
MSSQL UNIQUEIDENTIFIER types will now be converted to PostgreSQL UUID types when migrating from MSSQL -> PostgreSQL
Added a per_table_buffers kwarg that allows for specifying the number of rows to fetch at once from the source database
We now check the varchar_length and try to adjust it for our target database, e.g., MSSQL varchar(max) -> PostgreSQL text

See line 12 in `schema_transformer.py` for details.

…(e.g., 1000.0) even though the value is actually a whole number. This causes trouble when migrating to other databases that expect a number without a decimal. For example, when migrating SQL Server -> PostgreSQL, the following error will be raised: `psycopg2.DataError: invalid input syntax for integer: "1000048.0"`

would be coerced to `Integer`.

…s for specifying, on a per-table basis, the buffer size when fetching rows from the source databse (i.e., the number of rows to fetch at a time, per table)

(more on that later). Fixed an issue with schema_transformer.py that causes tables to not be renamed per the table_schema_transformation_file.

correctly.

the maximum allowable size, depending on our target database.

…hen convert VARCHAR -> TEXT

The previous required version causes this error: ValueError: invalid literal for int() with base 10: '5 (Ubuntu 10' when used with PostgreSQL >= 10

seanharr11

@musashiXXX thank you for the PR! One concern:

I am not in agreement with your generalization of coercing data to a uuid.UUID if isinstance(bytearray, value) is True. While this likely solved your use case, I am concerned that it doesn't generalize to other columns that hold bytearray values. There may be a BINARY TEXT blob column in a DB, and we don't necessarily want to turn that into a UUID.

musashiXXX · 2018-12-19T13:44:27Z

@seanharr11

I am not in agreement with your generalization of coercing data to a uuid.UUID if isinstance(bytearray, value) is True.

I agree. And the 8th commit in that PR un-does this behavior. The PR with the code you mentioned was submitted in haste. See here. There was this commit containing the ill-conceived coerce all bytea to uuid code, and then two commits later the correction.

musashiXXX added 11 commits October 30, 2018 09:50

ETLAlchemySource.py uses newTable when it sholuld use new_table.

7e20084

See line 12 in `schema_transformer.py` for details.

Fixed an issue caused by the previous commit where BigInteger columns

8c2eaa4

would be coerced to `Integer`.

Added a per_table_buffers kwarg (defaults to empty dict) that allow…

9fe4403

…s for specifying, on a per-table basis, the buffer size when fetching rows from the source databse (i.e., the number of rows to fetch at a time, per table)

Reversed the changes to ETLAlchemySource.py made by a previous commit

a26e184

(more on that later). Fixed an issue with schema_transformer.py that causes tables to not be renamed per the table_schema_transformation_file.

Added support for PostgreSQL UUID types

1eafcb4

Removed debugging print statement

5a0437c

We can now handle UNIQUEIDENTIFIER types (MSSQL -> PostgreSQL)

ee7d45e

correctly.

In the event of a varchar(max), we try to create a varchar column with

cfd0f57

the maximum allowable size, depending on our target database.

If varchar_length exceeds the maximum size for our target database, t…

63c0f52

…hen convert VARCHAR -> TEXT

Updated the required version of SQLAlchemy-Utils to the latest version.

5b0b408

The previous required version causes this error: ValueError: invalid literal for int() with base 10: '5 (Ubuntu 10' when used with PostgreSQL >= 10

seanharr11 requested changes Dec 19, 2018

View reviewed changes

Added some things to the TODO list.

161647c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Several fixes and enhancements #39

Several fixes and enhancements #39

musashiXXX commented Nov 6, 2018

seanharr11 left a comment •

edited

Loading

musashiXXX commented Dec 19, 2018

Several fixes and enhancements #39

Are you sure you want to change the base?

Several fixes and enhancements #39

Conversation

musashiXXX commented Nov 6, 2018

seanharr11 left a comment • edited Loading

Choose a reason for hiding this comment

musashiXXX commented Dec 19, 2018

seanharr11 left a comment •

edited

Loading