You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Currently, modifying the existing external table YAML file definition in dbt does not alter the external table according to the new definition. The process should allow for changes in the external table YAML file definition to be reflected in the actual external table without requiring a full refresh of all tables.
Describe alternatives you've considered
Create an External Table:
• Create an external table using the external table YAML file definition.
Run dbt Operation:
• Use dbt run-operation stage_external_sources to create the external table.
Modify External Table Definition:
• Modify the external table YAML file to add a new column.
• Running dbt run-operation stage_external_sources only refreshes the external table without altering it to include the new column.
Selective Full Refresh:
• If there is any change in the external table definition, running dbt run-operation stage_external_sources --vars "ext_full_refresh: true" should refresh only the modified external table.
Performance Considerations:
• The current command dbt run-operation stage_external_sources --vars "ext_full_refresh: true" recreates or replaces all tables, which can degrade performance in Snowflake.
Additional context
This feature request is specific to Snowflake or other databases that support external tables. The aim is to improve performance and efficiency for those building data ingestion pipelines using dbt.
Who will this benefit?
Data Engineers: Those who build data ingestion pipelines will benefit by having a more efficient process that only updates the changed tables or alters the YAML file definition without affecting other tables.
The text was updated successfully, but these errors were encountered:
Describe the feature
Currently, modifying the existing external table YAML file definition in dbt does not alter the external table according to the new definition. The process should allow for changes in the external table YAML file definition to be reflected in the actual external table without requiring a full refresh of all tables.
Describe alternatives you've considered
• Create an external table using the external table YAML file definition.
• Use dbt run-operation stage_external_sources to create the external table.
• Modify the external table YAML file to add a new column.
• Running dbt run-operation stage_external_sources only refreshes the external table without altering it to include the new column.
• If there is any change in the external table definition, running dbt run-operation stage_external_sources --vars "ext_full_refresh: true" should refresh only the modified external table.
• The current command dbt run-operation stage_external_sources --vars "ext_full_refresh: true" recreates or replaces all tables, which can degrade performance in Snowflake.
Additional context
This feature request is specific to Snowflake or other databases that support external tables. The aim is to improve performance and efficiency for those building data ingestion pipelines using dbt.
Who will this benefit?
Data Engineers: Those who build data ingestion pipelines will benefit by having a more efficient process that only updates the changed tables or alters the YAML file definition without affecting other tables.
The text was updated successfully, but these errors were encountered: