Skip to content

Updating to UC and Streaming Tables #1

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 10 commits into
base: main
Choose a base branch
from

Conversation

shabbirk-db
Copy link

Changes made:

  • Fixed _resources to create and write to Volumes instead of dbfs.
  • Commented out autoloader in the notebook
  • Created streaming_table ingestion layer directly in dbt for the three streams, reading from Volumes.
  • This will create full lineage in UC from Volumes all the way to Gold layer.

To do:

  • Update documentation to reflect the changes
  • Re-spec cluster to be UC native when created.
  • Potentially generate the data as part of the dbdemos installation so we can skip the data-ingestion notebook entirely and make a more streamlined experience (e.g. dbt + ML only)
  • Update the ML model to read from the new tables and make it UC friendly if needed

shabbirk-db and others added 10 commits April 29, 2024 07:28
Commented out autoloader (this is now a streaming table in dbt)
Fixed _resources to create and write to Volumes instead of dbfs.

To do:
* Update documentation to reflect the changes
* Re-spec cluster to be UC native when created.
* Potentially generate the data as part of the dbdemos installation so we can skip the data-ingestion notebook entirely and make a more streamlined experience (e.g. dbt + ML only)
Adding Volumes and UC compatibility
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant