How to bulk process a lot of SQL in python #410
TatsuyaMori63
started this conversation in
General
Replies: 2 comments
-
threads may helps |
Beta Was this translation helpful? Give feedback.
0 replies
-
This is more of a usage thing, so I moved it to discussion. To my knowledge, pandas dataframe apply is running sequentially for user defined function. You can refer to #353 for high level ideas of leveraging parallelism. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi all,
I try sqllineage by using python to get table level lineages from 100,000 of SQL job log.
It takes long time over 10 hours to complete it in my Google Colab. Does anyone have good idea to reduce time for bulk processing?
My source code is here. I put SQLs into dataframe and pass them to function.
Beta Was this translation helpful? Give feedback.
All reactions