请教下当列级血缘和表级血缘在一张图中时是如何处理的? #590
Replies: 12 comments 1 reply
-
|
Beta Was this translation helpful? Give feedback.
-
非常感谢您的回复,我这边在建设大数据领域的字段血缘逻辑遇到些问题,看到你写的文章想了解下方案实现的细节
对于上图来说,Hive TB1 和 Hive TB2 之间的表血缘和字段血缘可以正常构建,但后续通过hive2kafka任务加工得到的kafka实体可能就没有字段级血缘,这种情况在分析一个字段的全部下游时可能就会断在这个没有字段血缘的实体上; |
Beta Was this translation helpful? Give feedback.
-
你截图 hive2kafka & kafka2clickhouse 具体的处理是什么,是SQL么。如果是为什么会没有字段血缘? |
Beta Was this translation helpful? Give feedback.
-
是一个配置化的数据同步任务,这里拿这两种任务类型举例,假设kafka实体不存在字段血缘 |
Beta Was this translation helpful? Give feedback.
-
kafka实体的上,下游实体之间有没有字段血缘。 |
Beta Was this translation helpful? Give feedback.
-
有,kafka上游的hive有字段血缘,下游的clickhouse也有字段血缘,只有kafka没有 |
Beta Was this translation helpful? Give feedback.
-
我说的是H2_C2, CK_C1之间有没有血缘 |
Beta Was this translation helpful? Give feedback.
-
那我理解你的血缘就应该在kafka实体上游就停止了,这是符合预期的吧 |
Beta Was this translation helpful? Give feedback.
-
CK_C1 应该是 H2_C1 的子代, CK_C2 是 H2_C2 的子代 |
Beta Was this translation helpful? Give feedback.
-
我理解在列级别血缘,你这个图没有你说的这个关系 |
Beta Was this translation helpful? Give feedback.
-
如果有些实体没有列级血缘,当存在一张大图中时,需要特殊处理这些没有列级血缘的实体,否则查询时会出现血缘断裂,这种特殊处理你这边时怎么做的呢
Beta Was this translation helpful? Give feedback.
All reactions