DEV Community

Hello Arisyn
Hello Arisyn

Posted on

Detecting Hidden Table Relationships with Inclusion-Based Field Analysis


Key metrics:

· null_row_num

· distinct_num

· co_occure

· inclusion_ratio

Example:

If:

· TabA.C1 distinct = 10,000

· TabB.C1 distinct = 100

· 90 values overlap

inclusion_ratio = 0.9

We treat that as structural inclusion.

This approach ignores naming.
It analyzes actual value behavior.

When scaled across datasets, this becomes automated relationship discovery.

Top comments (0)