· null_row_num
· distinct_num
· co_occure
· inclusion_ratio
Example:
If:
· TabA.C1 distinct = 10,000
· TabB.C1 distinct = 100
· 90 values overlap
inclusion_ratio = 0.9
We treat that as structural inclusion.
This approach ignores naming.
It analyzes actual value behavior.
When scaled across datasets, this becomes automated relationship discovery.

Top comments (0)