<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Arisyn</title>
    <description>The latest articles on DEV Community by Arisyn (@arisyn).</description>
    <link>https://dev.to/arisyn</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3685401%2F8d044c1d-cb24-4d8a-8488-0ead8e9b0166.png</url>
      <title>DEV Community: Arisyn</title>
      <link>https://dev.to/arisyn</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/arisyn"/>
    <language>en</language>
    <item>
      <title>Building Trusted Cross-Database NL2SQL: How IntaLink Unlocks Hidden Data Relationships</title>
      <dc:creator>Arisyn</dc:creator>
      <pubDate>Fri, 05 Jun 2026 14:30:00 +0000</pubDate>
      <link>https://dev.to/arisyn/building-trusted-cross-database-nl2sql-how-intalink-unlocks-hidden-data-relationships-3mf</link>
      <guid>https://dev.to/arisyn/building-trusted-cross-database-nl2sql-how-intalink-unlocks-hidden-data-relationships-3mf</guid>
      <description>&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F13mi0r1plx5dd6en84mz.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F13mi0r1plx5dd6en84mz.png" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Last week, Alex, a data engineer at a mid-sized retail chain, got a frantic call from the marketing team. The AI-generated SQL query for their "national online vs. offline sales comparison" report was off by nearly 20%—a discrepancy large enough to derail their quarterly strategy meeting. After hours of debugging, Alex found the root cause: the NL2SQL tool had naively summed "transaction amount" from the e-commerce database and "actual collected amount" from the in-store POS system, ignoring that one included sales tax and the other didn’t. Worse, the tool failed to recognize the correct cross-database relationship between user IDs in the two systems, leading to misaligned transaction records. This scenario isn’t an anomaly; it’s a daily reality for data teams grappling with the promise and pitfalls of cross-database intelligent querying.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;The Trust Crisis in Cross-Database NL2SQL&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;As enterprises accelerate digital transformation, data silos have become the norm. Critical business data lives across MySQL, Hive, ClickHouse, and cloud data warehouses, with each system serving a specific operational or analytical purpose. Business teams no longer ask for simple single-database reports like "this month’s sales"; they demand complex cross-database analyses such as "how online user conversion rates correlate with in-store inventory levels." &lt;/p&gt;

&lt;p&gt;NL2SQL (natural language to SQL) was supposed to bridge the gap between business users and raw data, eliminating the need for technical teams to write custom queries for every request. But cross-database use cases have exposed a critical flaw: according to a recent industry survey, over 65% of enterprises report that cross-database NL2SQL queries produce logical errors that make results unfit for business decision-making. This trust deficit stems from two deep-seated challenges in multi-source data management.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Challenge 1: Manual Cross-Database Relationship Maintenance Is Unsustainable&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The relationships between tables, field mappings, and business calibers across databases are often scattered in outdated documentation or locked in data engineers’ institutional knowledge. When a new CRM system launches or a data warehouse is updated, engineers spend 3–5 manual days per source mapping relationships, identifying hidden links like matching user IDs (labeled as &lt;code&gt;uid&lt;/code&gt;, &lt;code&gt;user_id&lt;/code&gt;, or &lt;code&gt;customer_id&lt;/code&gt; across systems) and documenting caliber rules (e.g., whether "sales amount" includes tax). &lt;/p&gt;

&lt;p&gt;This process is not only time-consuming but also error-prone. Hidden relationships are often missed, and as business needs evolve, manually maintained relationship tables quickly become obsolete. Data teams are trapped in a vicious cycle: map relationships, watch them become outdated, then re-map—wasting valuable hours that could be spent on high-impact data modeling or analysis.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Challenge 2: NL2SQL Tools Lack a Trusted Data Foundation&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Most NL2SQL solutions rely solely on single-database schema and field names to generate queries, with no visibility into cross-database lineage or semantic relationships. When a user asks a cross-database question, the AI defaults to literal keyword matching, leading to flawed logic: summing incompatible amount fields, joining tables on incorrect keys, or ignoring data transformation rules that change field meanings along the data pipeline. These errors erode business users’ trust in intelligent querying, forcing them to revert to slow, manual requests from data teams.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;The Technical Truth: Cross-Database NL2SQL Depends on Trusted Data Relationships&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The core problem with cross-database NL2SQL isn’t a failure of AI semantics—it’s a lack of trusted, actionable data relationships. Without accurate table joins, field lineage, and semantic mappings, AI cannot understand the business logic behind multi-source data, leading to hallucinations and incorrect queries.&lt;/p&gt;

&lt;p&gt;Traditional metadata management tools passively collect schema information but cannot proactively discover hidden cross-database relationships. Meanwhile, AI-only NL2SQL tools attempt to compensate with large language model (LLM) semantic understanding, but without grounding in real data relationships, LLMs amplify hallucinations, making cross-database queries even more unreliable.&lt;/p&gt;

&lt;p&gt;This is where IntaLink steps in: it builds an automatic, trusted foundation of multi-source data relationships that addresses these gaps. Here’s how it works:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Unified Metadata Collection&lt;/strong&gt;: IntaLink’s built-in engine connects to all enterprise data sources, gathering schema details, field attributes, and basic metadata in a centralized repository.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Intelligent Relationship Discovery&lt;/strong&gt;: Using a multi-dimensional algorithm, IntaLink identifies cross-database &lt;br&gt;
relationships by analyzing field name similarity, data type matches, sample value distributions, and business rules (like unique user ID constraints). For example, it can automatically link an e-commerce order table to a logistics waybill table via &lt;code&gt;order_id&lt;/code&gt;, even if the fields are named differently across systems.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;End-to-End Data Lineage&lt;/strong&gt;: IntaLink tracks data from its source through every transformation, cleaning, and aggregation step. It records caliber changes (e.g., when a raw "transaction amount" is adjusted to exclude tax) and processing rules, forming a complete, traceable data relationship graph.&lt;/p&gt;&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;When paired with Arisyn, this foundation transforms cross-database NL2SQL. IntaLink’s relationship graph acts as Arisyn’s "knowledge base": when a user asks, "What’s the distribution of delivery times for online orders?" Arisyn first uses IntaLink’s graph to confirm the correct join between the order and waybill tables. It then leverages lineage data to validate that "delivery time" is calculated as &lt;code&gt;sign-off_time - dispatch_time&lt;/code&gt;, not a mismatched field like &lt;code&gt;order_creation_time&lt;/code&gt;. The result is an accurate cross-database SQL query that aligns with business logic.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Delivering Real Value with IntaLink and Arisyn&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;1. Freeing Data Engineers from Repetitive Work&lt;/strong&gt;&lt;br&gt;
IntaLink’s automated relationship discovery eliminates the need for manual cross-database mapping, identifying over 90% of valid relationships automatically. This cuts the time to onboard a new data source from 3–5 days to just a few hours. For one regional retail client, IntaLink reduced the time data engineers spent maintaining cross-database relationships by 70%, allowing them to shift focus to building predictive models for inventory optimization and customer segmentation.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;2. Boosting Cross-Database NL2SQL Accuracy&lt;/strong&gt;&lt;br&gt;
By grounding Arisyn’s NL2SQL in IntaLink’s trusted relationship graph and lineage data, cross-database query accuracy jumps from an average of 60% to over 90%. Business users no longer need to second-guess results: every SQL query is traceable back to its source, with clear visibility into how fields are calculated and joined. This trust enables teams to make faster, data-driven decisions without waiting for data engineers to validate every request.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;3. Unifying Semantics to Eliminate Cross-Team Disputes&lt;/strong&gt;&lt;br&gt;
IntaLink’s metadata management capabilities, paired with Arisyn’s dual semantic layer, align technical and business teams on data definitions. For example, the term "sales amount" is standardized across all databases, with clear labels indicating whether it includes tax, shipping fees, or discounts. This eliminates the common friction where marketing and finance teams argue over conflicting metrics, ensuring everyone works from the same trusted data source.&lt;/p&gt;

&lt;p&gt;Conclusion: Data Relationships Are the Invisible Foundation of Cross-Database Intelligence&lt;br&gt;
Cross-database intelligent analysis isn’t just about generating SQL from natural language—it’s about enabling AI to understand the business logic that connects data across systems. IntaLink fills the critical gap by building a trusted, automated network of cross-database relationships, giving Arisyn the context it needs to deliver accurate, reliable queries.&lt;/p&gt;

&lt;p&gt;When enterprises stop wasting hours on manual relationship maintenance and business users can confidently rely on cross-database NL2SQL results, multi-source data stops being a liability and becomes a strategic asset. The true value of enterprise data is unlocked when teams can seamlessly connect siloed information, uncover hidden insights, and drive decisions without being hindered by data relationship fog.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>llm</category>
      <category>agents</category>
      <category>data</category>
    </item>
    <item>
      <title>Why Enterprise Smart Analytics Can’t Succeed Without Data Relationships + Semantic Governance Infrastructure</title>
      <dc:creator>Arisyn</dc:creator>
      <pubDate>Thu, 04 Jun 2026 06:39:41 +0000</pubDate>
      <link>https://dev.to/arisyn/why-enterprise-smart-analytics-cant-succeed-without-data-relationships-semantic-governance-2772</link>
      <guid>https://dev.to/arisyn/why-enterprise-smart-analytics-cant-succeed-without-data-relationships-semantic-governance-2772</guid>
      <description>&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fyqx6xi0kugv438udlw7m.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fyqx6xi0kugv438udlw7m.png" alt=" " width="800" height="533"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;We’ve all seen the pitch: Plug an LLM into your data warehouse, and suddenly every stakeholder can ask natural language questions like “What’s our Q3 customer lifetime value?” and get instant, accurate answers. But when your team tries to deploy this, you hit a wall: the LLM returns numbers that don’t match the finance team’s report, or it confuses “active users” (sales defines it as 30-day engagement; marketing uses 7 days).&lt;/p&gt;

&lt;p&gt;The problem isn’t the LLM itself. It’s that your enterprise is missing a critical layer of infrastructure: trusted data relationships and semantic governance. Without this, even the most powerful AI tools are shooting in the dark.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;The Hidden Bottleneck: Not the Model, but Unstructured Data Context&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Enterprise data is messy. Legacy systems, siloed teams, merged datasets, and inconsistent naming conventions create a labyrinth of disconnected tables and ambiguous terms. LLMs excel at pattern recognition, but they don’t know your business’s unique rules: which orders count toward revenue (completed, not canceled), how to calculate churn (90-day inactivity vs. 30), or that “customer ID” in the sales table maps to “client number” in the finance system.&lt;/p&gt;

&lt;p&gt;When you skip building this context layer, your AI-powered analytics tool will:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Generate queries that join unrelated tables, leading to nonsensical insights.&lt;/li&gt;
&lt;li&gt;Use conflicting business definitions, causing cross-team disputes over metrics.&lt;/li&gt;
&lt;li&gt;Ignore critical filters (like excluding test accounts) that make data actionable.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The bottleneck isn’t model performance—it’s the lack of structured, trusted context that tells AI how to interpret your data.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Data Relationships: The Skeleton of Trusted Analytics&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Data relationships go beyond basic foreign keys in a database. They’re the business rules that define how data points connect and interact. For example:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;A customer’s lifetime value (CLV) should only include completed orders, excluding returns and discounts.&lt;/li&gt;
&lt;li&gt;Churn rate is calculated from users who haven’t logged in for 90 days and have an active subscription.&lt;/li&gt;
&lt;li&gt;Monthly recurring revenue (MRR) excludes one-time setup fees and trial accounts.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Without documenting these relationships, your LLM has no way to know which joins and filters to apply. A common pain point: a sales team runs an LLM query for “Q3 CLV” and gets a number 20% higher than finance’s report, because the LLM included canceled orders.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Enterprise Challenges &amp;amp; Implementation Thinking&lt;/strong&gt;&lt;br&gt;
Legacy systems often don’t have built-in relationship documentation, and siloed teams maintain their own ad-hoc joins. To fix this:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Start with high-priority datasets (customer, order, revenue) and map both technical (database joins) and business (rule-based) relationships.&lt;/li&gt;
&lt;li&gt;Build a data relationship graph that visualizes these connections—this makes it easy for AI tools to traverse and understand dependencies.&lt;/li&gt;
&lt;li&gt;Store this graph in a centralized metadata catalog so all teams (and AI tools) can access the same trusted relationships.&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;&lt;strong&gt;Semantic Governance: The Common Language for Data&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Semantic governance is about creating a single source of truth for business terms. It’s not just a glossary—it’s a machine-readable layer that defines exactly what each metric means, where it comes from, and how it’s calculated.&lt;/p&gt;

&lt;p&gt;For example, “active user” shouldn’t be left to interpretation. A semantic layer would specify:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Definition: A user who has logged in and completed at least one action (purchase, content view) in the past 7 days.&lt;/li&gt;
&lt;li&gt;Data source: Combined user activity logs from the app and website.&lt;/li&gt;
&lt;li&gt;Exclusions: Test accounts, users with expired subscriptions.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Without this, your LLM might pull data from the wrong source or use an outdated definition. This leads to inconsistent insights that erode stakeholder trust in your smart analytics tool.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Enterprise Challenges &amp;amp; Implementation Thinking&lt;/strong&gt;&lt;br&gt;
Cross-team alignment is the biggest hurdle—sales, finance, and marketing all have their own definitions for key metrics. To overcome this:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Host workshops with stakeholders to co-create definitions for high-impact metrics (CLV, MRR, churn).&lt;/li&gt;
&lt;li&gt;Store these definitions in a semantic catalog with version control, so you can track changes and roll back if needed.&lt;/li&gt;
&lt;li&gt;Integrate the catalog with your AI/BI tools, so LLMs automatically reference the latest definitions when generating queries.&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;&lt;strong&gt;Practical Steps to Build This Infrastructure&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;You don’t need to overhaul your entire data stack to implement this layer. Start small with these actionable steps:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;
&lt;strong&gt;Audit Your Data Assets&lt;/strong&gt;: Map existing tables, identify key relationships, and document gaps (e.g., missing links between customer and subscription data).&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Co-Create a Semantic Glossary&lt;/strong&gt;: Work with business teams to define 5-10 core metrics first—this builds momentum and demonstrates value quickly.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Build a Lightweight Semantic Layer&lt;/strong&gt;: Use open-source tools or internal frameworks to translate business terms into standardized SQL queries or data joins.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Integrate with AI Tools&lt;/strong&gt;: Connect your semantic layer and relationship graph to your LLM-powered analytics tool, so it can pull trusted context before generating insights.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Enforce Governance&lt;/strong&gt;: Set up automated checks to ensure new data assets adhere to your relationship and semantic rules (e.g., alerting teams if a new “MRR” field doesn’t match the standardized definition).&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;&lt;strong&gt;The Business Impact: Trusted Insights, Faster Decisions&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;When you invest in this infrastructure, you’re not just fixing AI accuracy—you’re solving long-standing enterprise data pain points:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Reduced disputes&lt;/strong&gt;: Teams no longer waste hours arguing over metric definitions.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Faster time to insight&lt;/strong&gt;: Stakeholders can trust AI-generated answers without manual validation.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Scalable AI&lt;/strong&gt;: As you add more datasets or AI tools, your context layer ensures consistency across the board.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Take a retail company that struggled with inconsistent CLV reports. After building a relationship graph linking customers to completed orders (excluding returns) and a semantic layer standardizing CLV calculations, their LLM tool started generating cross-team aligned insights. This reduced data dispute resolution time by 60% and helped the marketing team target high-value customers more effectively.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Wrap-Up&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Smart analytics isn’t about plugging in the latest LLM and calling it a day. It’s about building the foundation that makes AI useful. Data relationships and semantic governance are the unsung heroes that turn messy enterprise data into trusted, actionable insights.&lt;/p&gt;

&lt;p&gt;Before you invest in the next shiny AI tool, ask yourself: Do we have a clear map of how our data connects, and a common language for what our metrics mean? If not, that’s where your next project should start.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>data</category>
      <category>llm</category>
      <category>semantic</category>
    </item>
    <item>
      <title>From "Afraid to Use" to "Confident to Act": Transparent Query Reasoning Solves NL2SQL Trust Gaps</title>
      <dc:creator>Arisyn</dc:creator>
      <pubDate>Tue, 02 Jun 2026 15:45:00 +0000</pubDate>
      <link>https://dev.to/arisyn/from-afraid-to-use-to-confident-to-act-transparent-query-reasoning-solves-nl2sql-trust-gaps-4cmd</link>
      <guid>https://dev.to/arisyn/from-afraid-to-use-to-confident-to-act-transparent-query-reasoning-solves-nl2sql-trust-gaps-4cmd</guid>
      <description>&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fkhlj466k6334qhqsn8w4.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fkhlj466k6334qhqsn8w4.png" alt=" " width="800" height="446"&gt;&lt;/a&gt;Last month, during a visit to a mid-sized retail enterprise, I sat down with Lisa Chen, the head of regional operations. She leaned back, frustrated, and shared a familiar pain point: “We rolled out an NL2SQL tool to let our team query data without bugging data analysts. But when I asked for ‘2025 Q2 in-store member sales in East China,’ the result was 15% lower than my manual spreadsheet count. The tech team said the AI-generated SQL was correct, but I can’t read SQL to verify. Now I’d rather wait half a day for an analyst’s report than risk making a bad decision with AI data.”&lt;/p&gt;

&lt;p&gt;Lisa’s frustration isn’t an anomaly. As large language models (LLMs) have become mainstream, natural language to SQL (NL2SQL) has emerged as a promising solution to democratize enterprise data access. Yet many organizations face a paradox: NL2SQL tools have high deployment rates, but low actual adoption, because business users simply don’t trust the results.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;The NL2SQL Trust Gap: A Growing Enterprise Challenge&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Gartner’s 2024 report underscores this disconnect: over 60% of enterprises have deployed NL2SQL tools, but only 28% of business users can independently run queries and trust the outcomes. The root cause lies in the “black box” nature of most NL2SQL systems. When a user inputs a natural language question, they get a numerical result or table back – but no visibility into how the AI translated their request into a SQL query, which tables or fields it used, or whether the logic aligns with business rules.&lt;/p&gt;

&lt;p&gt;For years, organizations focused on boosting NL2SQL accuracy as the fix. But in real-world enterprise environments, this approach hits a wall: complex data models with dozens of interconnected tables, ambiguous business terminology (like “sales” that could mean gross vs. net), and evolving data schemas make 100% accuracy an unattainable goal. Worse, even when accuracy is high, users remain skeptical if they can’t see the “why” behind the result. This is where transparent query reasoning becomes the critical bridge between NL2SQL’s technical potential and its practical business value.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Three Core Barriers to NL2SQL Trust&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;To understand why users hesitate to rely on NL2SQL, we need to unpack three persistent trust barriers that business teams face daily:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;The Reasoning Logic Black Box&lt;/strong&gt;: When a user asks for “member sales,” they don’t know if the AI mapped that term to the right field (e.g., actual paid amount vs. gross sales), how it joined the sales order table with the member profile table, or if it applied the correct filters for in-store transactions. If the result conflicts with their expectations, they can’t pinpoint where the breakdown happened – leading to distrust instead of action.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Unvalidated SQL Generation&lt;/strong&gt;: LLMs can generate syntactically correct SQL that still violates business logic. For example, an AI might incorrectly join a non-member order table to the sales data, or use the wrong aggregation function for recurring subscriptions. Since most business users lack SQL expertise, they can’t spot these flaws, forcing them to loop in data analysts for validation – defeating the purpose of democratizing data access and adding unnecessary communication overhead.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Ambiguous Result Boundaries&lt;/strong&gt;: A number without context is meaningless. Did the “member sales” figure include coupon discounts? Does it cover franchise stores or only direct locations? Without clear explanations of data sources, timeframes, and business rules, users can’t be sure if the result applies to their specific decision-making scenario. This ambiguity leads to hesitation, even if the underlying data is correct.&lt;/p&gt;&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;&lt;strong&gt;Transparent Reasoning: Turning NL2SQL from Black Box to White Box&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Breaking through these barriers requires shifting from a “trust the AI” mindset to a “understand the AI” mindset. The solution lies in making the entire NL2SQL process transparent, verifiable, and contextual:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Visualize the Reasoning Chain&lt;/strong&gt;: Instead of hiding the AI’s thought process, show users every step: how their natural language question is parsed into key business dimensions (time, region, metric), how those dimensions map to semantic layers and underlying data tables, and how the final SQL query is constructed. This turns a black box into a “white box” where users can follow the logic and flag inconsistencies.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Automate SQL Validation&lt;/strong&gt;: Before executing a query, validate the generated SQL against the enterprise’s data governance rules and data lineage. This includes checking for logical errors (like incorrect table joins) and ensuring alignment with approved business metrics. If issues are found, surface them to users in plain language, not technical jargon.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Clarify Result Boundaries&lt;/strong&gt;: Alongside the query output, provide clear, actionable context: data source, timeframe, metric definition, filters applied, and any exclusions (e.g., “does not include franchise stores”). This helps users immediately understand the scope and limitations of the result.&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;*&lt;em&gt;Arisyn + Intalink: Building a Trusted NL2SQL Ecosystem&lt;br&gt;
*&lt;/em&gt;&lt;br&gt;
Building this level of transparency requires a unified system that combines robust data governance foundations with intelligent query capabilities – exactly what the Arisyn and Intalink ecosystem delivers.&lt;/p&gt;

&lt;p&gt;Intalink serves as the trusted data relationship base, laying the groundwork for transparent NL2SQL. Its metadata management, automatic relationship discovery, and lineage analysis capabilities create a comprehensive “data map” of the enterprise’s data assets. For example, Intalink can identify that “member sales” corresponds to the &lt;code&gt;actual_paid_amount&lt;/code&gt; field in the &lt;code&gt;sales_orders&lt;/code&gt; table, and that this field must be joined with the &lt;code&gt;member_profiles&lt;/code&gt; table to filter for registered members. It also ensures that these relationships align with established business rules, eliminating invalid joins that could skew results.&lt;/p&gt;

&lt;p&gt;On top of this foundation, Arisyn delivers the transparent query capabilities that address business users’ trust concerns:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Full Query Reasoning Visualization&lt;/strong&gt;: When a user inputs a natural language question, Arisyn breaks down the reasoning process into plain-language steps. For Lisa’s query, it would show: “Your request is parsed into [Time: 2025 Q2, Region: East China, Channel: In-store, Metric: Member Sales] → mapped to the semantic layer’s Member Consumption metric → joins &lt;code&gt;sales_orders&lt;/code&gt;, &lt;code&gt;region_dimensions&lt;/code&gt;, and &lt;code&gt;member_profiles&lt;/code&gt; tables → SQL logic: group by region, filter for in-store locations, sum &lt;code&gt;actual_paid_amount&lt;/code&gt; for registered members.” Even users without SQL expertise can follow this chain to confirm that the AI understood their request correctly.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Intelligent SQL Generation &amp;amp; Validation&lt;/strong&gt;: After generating the SQL query, Arisyn leverages Intalink’s lineage data to validate the logic. For example, if the AI accidentally tries to join &lt;code&gt;sales_orders&lt;/code&gt; with a &lt;code&gt;guest_orders&lt;/code&gt; table, Arisyn flags this issue and asks: “This query includes non-member orders. Would you like to adjust to use the &lt;code&gt;member_profiles&lt;/code&gt; table instead?” It also compares the generated SQL to a library of pre-validated, analyst-approved queries to ensure alignment with business standards.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Proactive Result Boundary Explanation&lt;/strong&gt;: When presenting the final result, Arisyn automatically appends a context panel: “Data Source: Sales Order System (April 1 – June 30, 2025); Metric Definition: Member actual paid amount (excludes coupon discounts); Scope: East China direct stores only (excludes franchises).” This eliminates back-and-forth between business users and analysts to confirm data context.&lt;/p&gt;&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;Additionally, Arisyn’s dual semantic layer governance aligns business terminology with data models, reducing the ambiguity that often leads to NL2SQL errors. For example, it ensures that “sales” is consistently mapped to the correct field based on the user’s department (e.g., net sales for finance, gross sales for operations).&lt;/p&gt;

&lt;p&gt;*&lt;em&gt;Conclusion: Controllable Trust is the Key to NL2SQL Success&lt;br&gt;
*&lt;/em&gt;&lt;br&gt;
NL2SQL’s promise is to put data-driven decision-making into the hands of every business user. But that promise can only be realized if users trust the results. Transparent query reasoning isn’t about eliminating every possible AI error – it’s about giving users the visibility and control to verify, adjust, and confidently act on the data.&lt;/p&gt;

&lt;p&gt;The Arisyn and Intalink ecosystem creates an end-to-end solution that turns NL2SQL from a feared black box into a trusted tool. By combining a robust data relationship foundation with transparent reasoning, automated validation, and contextual result explanations, it empowers business users like Lisa to move from “afraid to use” to “confident to act.” In doing so, it unlocks the true value of enterprise data, enabling faster, more informed decisions without relying on overstretched data teams.&lt;/p&gt;

</description>
      <category>nl2sql</category>
      <category>ai</category>
      <category>llm</category>
      <category>data</category>
    </item>
    <item>
      <title>Before You Deploy AI for Enterprise Analytics, Build This Critical Infrastructure Layer: Data Relationships + Semantic Governance</title>
      <dc:creator>Arisyn</dc:creator>
      <pubDate>Thu, 21 May 2026 15:20:00 +0000</pubDate>
      <link>https://dev.to/arisyn/before-you-deploy-ai-for-enterprise-analytics-build-this-critical-infrastructure-layer-data-49a6</link>
      <guid>https://dev.to/arisyn/before-you-deploy-ai-for-enterprise-analytics-build-this-critical-infrastructure-layer-data-49a6</guid>
      <description>&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fgkdnthwbiie3ygzc83e2.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fgkdnthwbiie3ygzc83e2.png" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Like many data engineers, I’ve watched enterprises rush to deploy LLMs for smart analytics: plugging in a natural-language query tool, connecting it to their data lake, and expecting instant, accurate insights. But more often than not, the result is frustration: the AI generates queries that join incompatible tables, uses outdated definitions for key metrics (like “monthly active users” differing between sales and marketing), or returns insights that don’t align with business reality.&lt;/p&gt;

&lt;p&gt;The mistake? Skipping the critical infrastructure layer that makes smart analytics trustworthy: data relationships and semantic governance. Let’s break down why this layer is non-negotiable, what it entails, and how to build it for your enterprise.&lt;/p&gt;

&lt;h2&gt;
  
  
  The Hidden Bottleneck in Smart Analytics
&lt;/h2&gt;

&lt;p&gt;When teams hit roadblocks with AI-powered analytics, they often blame the model—“it’s not accurate enough” or “it doesn’t understand our business.” But the real issue is almost never the model itself. It’s the lack of context-rich, consistent data foundations.&lt;/p&gt;

&lt;p&gt;Consider these common enterprise pain points:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;A retail company’s LLM generates a report on “customer lifetime value” but joins sales data with outdated support system records because no one documented that &lt;code&gt;customer_id&lt;/code&gt; in the CRM maps to &lt;code&gt;client_number&lt;/code&gt; in the support tool.&lt;/li&gt;
&lt;li&gt;A finance team spends three weeks reconciling revenue numbers because sales uses “gross revenue” while finance uses “net revenue”—and the AI has no way to distinguish between the two.&lt;/li&gt;
&lt;li&gt;An analytics engineer spends 70% of their time cleaning data instead of building insights, because there’s no clear lineage for key datasets (e.g., where does this “user_segment” field come from, and how is it transformed?).&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;These problems stem from missing two core components: trusted data relationships that connect entities across systems, and semantic governance that standardizes how business terms are defined and used. Without them, even the most powerful LLM can’t produce reliable, actionable insights.&lt;/p&gt;

&lt;h2&gt;
  
  
  What Exactly Is This Infrastructure Layer?
&lt;/h2&gt;

&lt;p&gt;Let’s break down the two pillars of this critical layer:&lt;/p&gt;

&lt;h3&gt;
  
  
  1. Data Relationships: Connecting the Dots Across Silos
&lt;/h3&gt;

&lt;p&gt;Data relationships aren’t just foreign keys in a database. They’re the contextual connections between entities (customers, orders, products) across every system in your enterprise. This includes:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Entity resolution&lt;/strong&gt;: Mapping the same entity across datasets (e.g., &lt;code&gt;customer_123&lt;/code&gt; in sales = &lt;code&gt;client_456&lt;/code&gt; in support).&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Data lineage&lt;/strong&gt;: Tracking where data comes from, how it’s transformed, and where it flows (e.g., the “monthly_revenue” metric in the data warehouse is derived from raw sales data minus returns in the ERP).&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Contextual links&lt;/strong&gt;: Documenting business-specific connections (e.g., “Order 789 is linked to Campaign X, which targeted Segment Y”).&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;For AI tools, this layer acts as a roadmap: it tells the model which tables to join, how to resolve conflicting entity IDs, and how to trace insights back to their source. Without it, the AI is guessing—and guessing leads to wrong answers.&lt;/p&gt;

&lt;h3&gt;
  
  
  2. Semantic Governance: Aligning Technical Data with Business Context
&lt;/h3&gt;

&lt;p&gt;Semantic governance is the bridge between technical data fields and business language. It’s a living system that:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Defines standard business terms (e.g., “active user” = a user who logged in at least once in the last 30 days).&lt;/li&gt;
&lt;li&gt;Maps technical fields to these terms (e.g., &lt;code&gt;login_count_last_30d&lt;/code&gt; in the user database maps to “active user”).&lt;/li&gt;
&lt;li&gt;Enforces these definitions across all teams and tools (so sales, marketing, and finance all use the same “revenue” metric).&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This layer eliminates the “language barrier” between technical systems and business stakeholders—and between AI tools and the real world. When an LLM receives a query like “show me monthly active users for Q3,” it knows exactly which data fields to pull and how to calculate the metric correctly.&lt;/p&gt;

&lt;h2&gt;
  
  
  Practical Steps to Build This Layer
&lt;/h2&gt;

&lt;p&gt;Building this infrastructure doesn’t require a complete overhaul of your data stack. Start with these actionable steps:&lt;/p&gt;

&lt;h3&gt;
  
  
  For Data Relationships:
&lt;/h3&gt;

&lt;ol&gt;
&lt;li&gt;
&lt;strong&gt;Prioritize core entities&lt;/strong&gt;: Focus on the 3-5 entities that drive your most critical analytics (e.g., customers, orders, products). Map how these entities appear across your CRM, ERP, data warehouse, and other systems.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Automate + supplement lineage&lt;/strong&gt;: Use open-source tools like Apache Atlas or lineage trackers integrated with your data pipeline (e.g., dbt’s lineage feature) to capture automated lineage. Then add human context (e.g., “This &lt;code&gt;user_segment&lt;/code&gt; field is updated weekly via the marketing segmentation script”).&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Store relationships in a graph or metadata platform&lt;/strong&gt;: Use a graph database (like Neo4j) or centralized metadata tool to make relationships accessible to AI tools. This lets the LLM query relationships dynamically instead of hardcoding them.&lt;/li&gt;
&lt;/ol&gt;

&lt;h3&gt;
  
  
  For Semantic Governance:
&lt;/h3&gt;

&lt;ol&gt;
&lt;li&gt;
&lt;strong&gt;Co-create a business glossary&lt;/strong&gt;: Involve data engineers, analysts, and business stakeholders to define terms. Avoid top-down mandates—make sure definitions reflect how the business actually uses the terms (e.g., “revenue” should be agreed upon by sales and finance).&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Automate term mapping&lt;/strong&gt;: Use tools that scan your data catalog to suggest mappings between technical fields and glossary terms. For example, if your sales table has a &lt;code&gt;gross_rev&lt;/code&gt; field, map it to the glossary term “Gross Revenue.”&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Implement review workflows&lt;/strong&gt;: Set up a process to update terms as business needs change (e.g., if the definition of “active user” shifts, notify all teams and update the mappings in your glossary).&lt;/li&gt;
&lt;/ol&gt;

&lt;h2&gt;
  
  
  Addressing Enterprise-Specific Challenges
&lt;/h2&gt;

&lt;p&gt;Building this layer comes with unique hurdles for large organizations:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Resistance to change&lt;/strong&gt;: Teams may be attached to their own definitions. Solution: Start with a high-impact use case (e.g., unifying sales and marketing metrics for quarterly reports) to show tangible value.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Scaling across teams&lt;/strong&gt;: With hundreds of systems, standardizing everything at once is impossible. Solution: Use a federated approach—let teams manage their own terms, but align on core entities and metrics.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Keeping the layer dynamic&lt;/strong&gt;: Business needs evolve, so your infrastructure can’t be static. Solution: Integrate governance into your CI/CD pipeline—when a new dataset is deployed, automatically check if it aligns with existing semantic standards.&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Wrap-Up
&lt;/h2&gt;

&lt;p&gt;Smart analytics isn’t just about deploying the latest LLM—it’s about building a foundation where data is trusted, consistent, and context-rich. Data relationships and semantic governance aren’t just “nice-to-have” infrastructure; they’re the backbone that makes AI-generated insights reliable enough to drive business decisions.&lt;/p&gt;

&lt;p&gt;Before you invest in another AI tool, take a step back: assess how well your enterprise understands its data relationships and enforces semantic standards. Building this layer will save you hours of cleanup, reduce errors in analytics, and unlock the true potential of smart analytics for your business.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>data</category>
      <category>semantic</category>
      <category>governance</category>
    </item>
    <item>
      <title>Enterprise Data Intelligence in the AI Era: The Hard Part Is Not Choosing a Tool</title>
      <dc:creator>Arisyn</dc:creator>
      <pubDate>Mon, 18 May 2026 15:25:00 +0000</pubDate>
      <link>https://dev.to/arisyn/enterprise-data-intelligence-in-the-ai-era-the-hard-part-is-not-choosing-a-tool-1fa9</link>
      <guid>https://dev.to/arisyn/enterprise-data-intelligence-in-the-ai-era-the-hard-part-is-not-choosing-a-tool-1fa9</guid>
      <description>&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F71x3t3wab2c0xh8kqi21.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F71x3t3wab2c0xh8kqi21.png" alt=" " width="800" height="450"&gt;&lt;/a&gt;Over the past year, the most interesting part of AI has moved from model demos to enterprise systems.&lt;/p&gt;

&lt;p&gt;TechCrunch’s AI coverage spans generative AI, large language models, speech, vision, predictive analytics, AI companies, and ethical questions. Behind the daily news cycle, one trend is becoming clear: AI is moving from isolated capabilities into enterprise workflows.&lt;/p&gt;

&lt;p&gt;As a CTO, I am less interested in which model was released this week and more interested in a harder question:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Can AI actually enter the real data workflows of an enterprise?&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;That question is still unresolved.&lt;/p&gt;

&lt;p&gt;An enterprise is not a chat window. It is a complex machine made of data sources, permissions, business processes, metrics, systems, teams, and accountability boundaries. Even the strongest model will struggle if it cannot understand the company’s data structure, business semantics, and governance rules.&lt;/p&gt;

&lt;p&gt;OpenAI COO Brad Lightcap made a similar point in a TechCrunch interview, saying that enterprise AI has not yet really penetrated business processes because enterprises are complex organizations with many people, teams, systems, tools, and layers of context.&lt;/p&gt;

&lt;p&gt;That is the reality.&lt;/p&gt;

&lt;p&gt;Bringing AI into the enterprise is not the same as connecting a chatbot to internal systems.&lt;/p&gt;

&lt;p&gt;The real question is whether the enterprise is ready to make its data world understandable to AI.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;The first-principles question: what must AI understand?&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;When companies start building AI data applications, they often begin with tool selection.&lt;/p&gt;

&lt;p&gt;Should we buy a BI tool?&lt;br&gt;
A data catalog?&lt;br&gt;
A semantic layer?&lt;br&gt;
An agent platform?&lt;br&gt;
An NL2SQL engine?&lt;br&gt;
A governance tool?&lt;br&gt;
A RAG system?&lt;br&gt;
A Copilot-style interface?&lt;/p&gt;

&lt;p&gt;These are valid questions, but they are not the first-principles question.&lt;/p&gt;

&lt;p&gt;The real question is:&lt;/p&gt;

&lt;p&gt;When a business user asks a question in natural language, how does the system move from that question to a trustworthy answer?&lt;/p&gt;

&lt;p&gt;Take this example:&lt;/p&gt;

&lt;p&gt;Which products contributed most to the profit decline of strategic customers in the East region this year?&lt;/p&gt;

&lt;p&gt;This question looks simple. It is not.&lt;/p&gt;

&lt;p&gt;It hides several layers of meaning:&lt;/p&gt;

&lt;p&gt;What does “this year” mean? Calendar year, fiscal year, or business reporting period?&lt;/p&gt;

&lt;p&gt;What is “East region”? Customer ownership region, sales territory, delivery region, or financial reporting region?&lt;/p&gt;

&lt;p&gt;What is a “strategic customer”? Revenue-based, contract-based, manually tagged, or account-tier based?&lt;/p&gt;

&lt;p&gt;What does “profit” mean? Gross profit, net profit, contract margin, project profit, or finance-adjusted profit?&lt;/p&gt;

&lt;p&gt;How are customers, products, contracts, orders, invoices, and profit detail tables connected?&lt;/p&gt;

&lt;p&gt;Does the current user have permission to access this data?&lt;/p&gt;

&lt;p&gt;If these questions are not answered systematically, AI can only guess.&lt;/p&gt;

&lt;p&gt;And in enterprise data intelligence, the most dangerous failure mode is not that the system is slow. It is that the system gives a fluent, confident, and wrong answer.&lt;/p&gt;

&lt;p&gt;Why traditional tools are not enough&lt;/p&gt;

&lt;p&gt;Traditional data tools are valuable, but most of them were designed for humans using data, not for AI understanding data.&lt;/p&gt;

&lt;p&gt;Data warehouses are good at storage, computation, and modeling.&lt;/p&gt;

&lt;p&gt;BI tools are good at reporting and visualization.&lt;/p&gt;

&lt;p&gt;Data catalogs are good at registering assets and metadata.&lt;/p&gt;

&lt;p&gt;Governance platforms are good at standards, permissions, quality, and compliance.&lt;/p&gt;

&lt;p&gt;ETL and ELT tools are good at data movement and transformation.&lt;/p&gt;

&lt;p&gt;These tools have supported enterprise digitization for years.&lt;/p&gt;

&lt;p&gt;But AI introduces a new requirement:&lt;/p&gt;

&lt;p&gt;In the past, humans read documentation, inspected schemas, and wrote SQL. Now AI needs to understand those things and act on them.&lt;/p&gt;

&lt;p&gt;That means enterprise data assets must not only be managed. They must become machine-understandable.&lt;/p&gt;

&lt;p&gt;Many traditional toolchains have structural gaps:&lt;/p&gt;

&lt;p&gt;Business semantics are disconnected from technical fields.&lt;/p&gt;

&lt;p&gt;Data lineage is disconnected from actual query paths.&lt;/p&gt;

&lt;p&gt;Metric definitions are disconnected from SQL generation.&lt;/p&gt;

&lt;p&gt;Permission systems are disconnected from AI tool usage.&lt;/p&gt;

&lt;p&gt;Data governance is disconnected from user-facing analytics.&lt;/p&gt;

&lt;p&gt;This is why many NL2SQL, intelligent query, and data agent projects work well in demos but break down in real business scenarios. They select the wrong tables. They infer the wrong fields. They generate unstable joins. They mix metric definitions. They ignore permission boundaries. They produce results that cannot be traced.&lt;/p&gt;

&lt;p&gt;The problem is not always that the model is weak.&lt;/p&gt;

&lt;p&gt;The problem is often that the enterprise has not provided enough reliable context.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;The global trend: models must connect to enterprise context&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Recent enterprise AI coverage points toward the same conclusion: AI is no longer just about standalone models. It is about connecting models to enterprise data, tools, permissions, and workflows.&lt;/p&gt;

&lt;p&gt;TechCrunch noted in early 2026 that agents failed to live up to the hype in 2025 partly because it was hard to connect them to the systems where work actually happens. Protocols like MCP matter because they reduce the friction of connecting agents to databases, search engines, APIs, and external tools.&lt;/p&gt;

&lt;p&gt;Snowflake’s partnership with OpenAI reflects the same direction. TechCrunch reported that Snowflake customers would gain access to OpenAI models across major cloud providers, with the goal of building and deploying AI on top of trusted, secure, governed enterprise data.&lt;/p&gt;

&lt;p&gt;Glean is another example. TechCrunch described Glean’s strategy as becoming the connective layer between models and enterprise systems. Its CEO made the point directly: large language models are generic; they do not understand a company’s people, work, products, or internal context by themselves.&lt;/p&gt;

&lt;p&gt;The pattern is clear:&lt;/p&gt;

&lt;p&gt;Enterprise AI is not just a model race. It is a context engineering race.&lt;/p&gt;

&lt;p&gt;The companies that organize data, semantics, permissions, workflows, and tools into AI-readable context will have the better chance of turning AI into production capability.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Do not start with the chat interface&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Many enterprise data intelligence projects begin with an intelligent query interface.&lt;/p&gt;

&lt;p&gt;That is understandable. A chat interface is the easiest way to demonstrate AI.&lt;/p&gt;

&lt;p&gt;But from a CTO’s perspective, starting with the chat window is risky. The chat interface is only the entry point. It is not the capability.&lt;/p&gt;

&lt;p&gt;A more reliable implementation path has five layers.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;1. Make data assets visible&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The enterprise must first know what data exists.&lt;/p&gt;

&lt;p&gt;This includes data sources, tables, fields, primary keys, row counts, update frequency, owners, quality status, and system ownership.&lt;/p&gt;

&lt;p&gt;Without this layer, AI does not know what it can use.&lt;/p&gt;

&lt;p&gt;Traditional catalogs and metadata platforms cover part of this, but AI needs a more structured and callable representation of fields, business objects, and data interfaces.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;2. Make data relationships knowable&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Knowing which tables exist is not enough.&lt;/p&gt;

&lt;p&gt;The hard part of enterprise data is the relationship between tables.&lt;/p&gt;

&lt;p&gt;How does the customer table connect to the order table?&lt;/p&gt;

&lt;p&gt;How does the order table connect to invoices?&lt;/p&gt;

&lt;p&gt;How does a project connect to employee time records?&lt;/p&gt;

&lt;p&gt;Can a contract table directly connect to profit details?&lt;/p&gt;

&lt;p&gt;If not, which intermediate table is required?&lt;/p&gt;

&lt;p&gt;Traditionally, this knowledge lives in senior engineers’ heads, legacy SQL scripts, ETL jobs, and report logic.&lt;/p&gt;

&lt;p&gt;AI cannot rely on institutional memory. It needs structured relationship context.&lt;/p&gt;

&lt;p&gt;This is where a data relationship layer becomes important. In the Arisyn / Intalink architecture, Intalink is positioned as an enterprise data lineage and relationship discovery platform. Its documented capabilities include data source management, table management, relationship discovery, task execution, and relationship quality indicators such as co-occurrence count, distinct count, and inclusion ratio.&lt;/p&gt;

&lt;p&gt;The point is not to draw a nice lineage diagram.&lt;/p&gt;

&lt;p&gt;The point is to provide AI with a computable, verifiable, and callable map of how enterprise data connects.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;3. Govern business semantics&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Data relationships explain how tables connect. They do not explain what the business means.&lt;/p&gt;

&lt;p&gt;Business users do not ask:&lt;/p&gt;

&lt;p&gt;SELECT SUM(amount) FROM fact_sales WHERE region = 'East';&lt;/p&gt;

&lt;p&gt;They ask:&lt;/p&gt;

&lt;p&gt;How are our strategic customers performing in the East region?&lt;/p&gt;

&lt;p&gt;Terms like “performance,” “strategic customer,” and “East region” are business concepts, not database columns.&lt;/p&gt;

&lt;p&gt;Enterprises need a semantic layer to manage metrics, dimensions, terminology, formulas, units, scope, versions, and governance rules.&lt;/p&gt;

&lt;p&gt;Arisyn is documented as an enterprise semantic-layer intelligent query engine. Its architecture includes natural language understanding, business semantic definitions, semantic mapping, terminology management, metric and dimension definitions, and version/gray-release management.&lt;/p&gt;

&lt;p&gt;A semantic layer does not exist to make terminology look organized.&lt;/p&gt;

&lt;p&gt;It exists to constrain AI before it generates SQL, selects data, or explains results.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;4. Make intelligent query explainable&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Once data assets, data relationships, and business semantics are in place, intelligent query finally has a reliable foundation.&lt;/p&gt;

&lt;p&gt;A trustworthy enterprise query system should not only return results. It should answer:&lt;/p&gt;

&lt;p&gt;Why were these tables selected?&lt;/p&gt;

&lt;p&gt;Why was this join path used?&lt;/p&gt;

&lt;p&gt;Which metric definition was applied?&lt;/p&gt;

&lt;p&gt;How was the SQL generated?&lt;/p&gt;

&lt;p&gt;What is the business definition of the result?&lt;/p&gt;

&lt;p&gt;Were there ambiguities?&lt;/p&gt;

&lt;p&gt;Was the user authorized to access this data?&lt;/p&gt;

&lt;p&gt;If the result looks unusual, what might explain it?&lt;/p&gt;

&lt;p&gt;Arisyn’s intelligent query flow includes intent recognition, synonym retrieval, clarification, table relationship discovery, SQL generation and validation, query execution, and result summarization. Its result display includes summary, reasoning, boundaries, SQL, data, charts, and timing details.&lt;/p&gt;

&lt;p&gt;For a CTO, explainability is not a nice-to-have.&lt;/p&gt;

&lt;p&gt;It is a production requirement.&lt;/p&gt;

&lt;p&gt;Without explanation, business users cannot trust the result.&lt;/p&gt;

&lt;p&gt;Without SQL, technical teams cannot review it.&lt;/p&gt;

&lt;p&gt;Without definitions, management cannot rely on it.&lt;/p&gt;

&lt;p&gt;Without boundaries, governance cannot control it.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;5. Build a feedback loop&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Enterprise data intelligence is not a one-time project. It is a system that must improve over time.&lt;/p&gt;

&lt;p&gt;Every failed query, field ambiguity, metric conflict, and user correction should feed back into semantic governance, relationship correction, knowledge supplementation, and test validation.&lt;/p&gt;

&lt;p&gt;Without a feedback loop, the system remains a demo.&lt;/p&gt;

&lt;p&gt;With a feedback loop, it gradually becomes production-grade.&lt;/p&gt;

&lt;p&gt;In the documented relationship between Intalink and Arisyn, Intalink provides data lineage, relationship discovery, and metadata management, while Arisyn builds semantic definitions, intelligent querying, and workflow orchestration on top. Together they form a layered data infrastructure and intelligent application architecture.&lt;/p&gt;

&lt;p&gt;This layered design turns one-off AI answers into a governable, auditable, and continuously improving data intelligence system.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Tool selection: stop comparing feature checklists&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Enterprise teams often evaluate tools by feature lists.&lt;/p&gt;

&lt;p&gt;Does it support natural language query?&lt;/p&gt;

&lt;p&gt;Does it support NL2SQL?&lt;/p&gt;

&lt;p&gt;Does it have lineage?&lt;/p&gt;

&lt;p&gt;Does it have a data catalog?&lt;/p&gt;

&lt;p&gt;Does it support permissions?&lt;/p&gt;

&lt;p&gt;Does it have agents?&lt;/p&gt;

&lt;p&gt;Does it support MCP?&lt;/p&gt;

&lt;p&gt;Does it have workflow orchestration?&lt;/p&gt;

&lt;p&gt;These questions matter, but they are not enough.&lt;/p&gt;

&lt;p&gt;A CTO should ask deeper questions.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Does the tool strengthen the existing data system, or bypass it?&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Some AI tools produce fast demos by bypassing existing governance, permissions, and metric systems.&lt;/p&gt;

&lt;p&gt;That is dangerous.&lt;/p&gt;

&lt;p&gt;A good enterprise AI data tool should organize existing systems into AI-readable context, not replace them with an isolated shortcut.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Can it turn technical metadata into business semantics?&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Managing tables and fields is not the same as supporting business questions.&lt;/p&gt;

&lt;p&gt;Can fields map to business metrics?&lt;/p&gt;

&lt;p&gt;Do metrics have versions?&lt;/p&gt;

&lt;p&gt;Do dimensions have valid scopes?&lt;/p&gt;

&lt;p&gt;Can business definitions be governed?&lt;/p&gt;

&lt;p&gt;Can ambiguity be detected and resolved?&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Does it understand table relationships, or only field names?&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Many NL2SQL errors come from incorrect joins.&lt;/p&gt;

&lt;p&gt;If a system relies mainly on field-name similarity, it will fail in complex enterprise environments.&lt;/p&gt;

&lt;p&gt;Relationship discovery, relationship confidence, candidate paths, best-path selection, and relationship updates are foundational for intelligent querying.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Is the result explainable and auditable?&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Enterprise data intelligence is not a consumer chatbot.&lt;/p&gt;

&lt;p&gt;Wrong results can affect business decisions.&lt;/p&gt;

&lt;p&gt;Wrong permissions can create compliance risks.&lt;/p&gt;

&lt;p&gt;Wrong definitions can create organizational conflict.&lt;/p&gt;

&lt;p&gt;The system must explain reasoning, SQL, data sources, metric definitions, and access boundaries.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Can it learn from failure?&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Many intelligent query projects fail because the first production results are not accurate enough.&lt;/p&gt;

&lt;p&gt;But the real issue is not that the first answer is wrong. The issue is whether the system can understand why it was wrong and retain the correction.&lt;/p&gt;

&lt;p&gt;Without feedback, humans will always be firefighting.&lt;/p&gt;

&lt;p&gt;With feedback, the system can improve.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;My view of the enterprise data intelligence stack&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;If I were designing an enterprise data intelligence architecture from scratch, I would not define it as an “AI query tool.”&lt;/p&gt;

&lt;p&gt;I would define it as a five-layer system.&lt;/p&gt;

&lt;p&gt;The first layer is the data asset layer: connecting data sources, extracting metadata, and maintaining table and field assets.&lt;/p&gt;

&lt;p&gt;The second layer is the data relationship layer: discovering and validating table relationships, field relationships, cross-source relationships, and join paths.&lt;/p&gt;

&lt;p&gt;The third layer is the semantic governance layer: managing business terms, metrics, dimensions, formulas, versions, and permission constraints.&lt;/p&gt;

&lt;p&gt;The fourth layer is the intelligent execution layer: handling intent understanding, query generation, tool calls, SQL validation, multi-step reasoning, and result generation.&lt;/p&gt;

&lt;p&gt;The fifth layer is the feedback and operations layer: collecting user feedback, diagnosing errors, supplementing knowledge, managing tickets, evaluating quality, and improving continuously.&lt;/p&gt;

&lt;p&gt;Each layer has a clear responsibility.&lt;/p&gt;

&lt;p&gt;The model should not decide business definitions by itself.&lt;/p&gt;

&lt;p&gt;The semantic layer should not guess data relationships.&lt;/p&gt;

&lt;p&gt;The relationship layer should not replace business explanation.&lt;/p&gt;

&lt;p&gt;The query layer should not bypass governance.&lt;/p&gt;

&lt;p&gt;The feedback layer should not depend on human memory.&lt;/p&gt;

&lt;p&gt;That is the architecture enterprise AI data systems need.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Conclusion: the stronger AI becomes, the more enterprises need data order&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The global AI trend is becoming clearer.&lt;/p&gt;

&lt;p&gt;Models will become stronger.&lt;/p&gt;

&lt;p&gt;Agents will become more common.&lt;/p&gt;

&lt;p&gt;Tool-calling standards will mature.&lt;/p&gt;

&lt;p&gt;Enterprise systems will become more deeply connected to AI.&lt;/p&gt;

&lt;p&gt;But the real dividing line will not be who adopts the newest model first.&lt;/p&gt;

&lt;p&gt;The real dividing line will be:&lt;/p&gt;

&lt;p&gt;Who can organize messy enterprise data into AI-understandable structure?&lt;/p&gt;

&lt;p&gt;Who can turn business language into governed semantics?&lt;/p&gt;

&lt;p&gt;Who can turn table relationships into verifiable connection maps?&lt;/p&gt;

&lt;p&gt;Who can make intelligent query explainable, auditable, and correctable?&lt;/p&gt;

&lt;p&gt;Those are the companies that will move AI from demo capability to production capability.&lt;/p&gt;

&lt;p&gt;The future of enterprise data intelligence will not be just a smarter BI tool.&lt;/p&gt;

&lt;p&gt;It will not be just a chatbot that writes SQL.&lt;/p&gt;

&lt;p&gt;It will be a new operating layer for enterprise data:&lt;/p&gt;

&lt;p&gt;Semantics to understand the business.&lt;/p&gt;

&lt;p&gt;Relationships to connect the data.&lt;/p&gt;

&lt;p&gt;Governance to define the boundaries.&lt;/p&gt;

&lt;p&gt;Agents to execute work.&lt;/p&gt;

&lt;p&gt;Feedback to improve over time.&lt;/p&gt;

&lt;p&gt;That is the real implementation path for enterprise data intelligence in the AI era.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>agents</category>
      <category>data</category>
    </item>
    <item>
      <title>AI Coding Agents Are Lowering the Barrier to Building Enterprise Data Apps - But the Real Barrier Is Moving</title>
      <dc:creator>Arisyn</dc:creator>
      <pubDate>Thu, 14 May 2026 16:00:00 +0000</pubDate>
      <link>https://dev.to/arisyn/ai-coding-agents-are-lowering-the-barrier-to-building-enterprise-data-apps-but-the-real-barrier-19ni</link>
      <guid>https://dev.to/arisyn/ai-coding-agents-are-lowering-the-barrier-to-building-enterprise-data-apps-but-the-real-barrier-19ni</guid>
      <description>&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fyie0f0bke7u2ffjws4pu.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fyie0f0bke7u2ffjws4pu.png" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;br&gt;
In April 2026, GitHub published a beginner-focused series on GitHub Copilot CLI, showing how developers can use an AI coding assistant directly from the command line. GitHub describes Copilot CLI as bringing agentic AI capabilities into the terminal, where it can understand repositories, generate code, run tests, fix errors, and support iterative development without forcing developers to switch tools.&lt;/p&gt;

&lt;p&gt;This is more than a productivity feature.&lt;/p&gt;

&lt;p&gt;It signals a deeper shift: software development is moving from code-first interaction toward intent-driven execution.&lt;/p&gt;

&lt;p&gt;Instead of starting with files, functions, and command syntax, developers can now begin with natural language:&lt;/p&gt;

&lt;p&gt;Generate a data API for this analytics module.&lt;br&gt;
Review this SQL query for performance issues.&lt;br&gt;
Add tests for this data transformation.&lt;br&gt;
Convert this API response into a JSON format suitable for charts.&lt;/p&gt;

&lt;p&gt;For enterprise data teams, this looks like a major reduction in development friction.&lt;/p&gt;

&lt;p&gt;But there is a catch.&lt;/p&gt;

&lt;p&gt;AI coding agents reduce the barrier to writing code.&lt;br&gt;
They do not automatically solve the harder problem of understanding enterprise data.&lt;/p&gt;

&lt;p&gt;AI coding agents lower the barrier to engineering actions&lt;/p&gt;

&lt;p&gt;GitHub’s documentation describes Copilot CLI as a terminal-native AI coding assistant that brings agentic capabilities directly into the command line and can work autonomously on complex tasks while keeping users in control.&lt;/p&gt;

&lt;p&gt;That matters because the terminal is still one of the most important places where real engineering work happens.&lt;/p&gt;

&lt;p&gt;With an AI agent inside the CLI, developers can:&lt;/p&gt;

&lt;p&gt;understand unfamiliar repositories faster;&lt;br&gt;
generate or modify code from a prompt;&lt;br&gt;
run and fix tests;&lt;br&gt;
summarize project structure;&lt;br&gt;
automate focused tasks through non-interactive commands.&lt;/p&gt;

&lt;p&gt;GitHub also explains that Copilot CLI supports both interactive and non-interactive modes: interactive mode is useful for iterative, hands-on work, while non-interactive mode is designed for quick, focused prompts directly from the shell.&lt;/p&gt;

&lt;p&gt;This makes AI coding agents useful not only for senior developers, but also for junior developers, data engineers, analysts, and platform teams who need to move faster across unfamiliar projects.&lt;/p&gt;

&lt;p&gt;However, enterprise data applications are not normal applications.&lt;/p&gt;

&lt;p&gt;The difficult part is often not creating a route, rendering a chart, or writing a function.&lt;/p&gt;

&lt;p&gt;The difficult part is knowing what the data actually means.&lt;/p&gt;

&lt;p&gt;Enterprise data apps are hard because enterprise data is hard&lt;/p&gt;

&lt;p&gt;Imagine a business user asks for a new data app:&lt;/p&gt;

&lt;p&gt;Show the gross profit trend of strategic customers by region over the past six months, and identify customers with a significant decline.&lt;/p&gt;

&lt;p&gt;At first glance, this looks like a dashboard request.&lt;/p&gt;

&lt;p&gt;But behind the request are many hidden questions:&lt;/p&gt;

&lt;p&gt;What does “strategic customer” mean?&lt;br&gt;
Is “region” based on customer ownership, sales organization, delivery location, or finance reporting structure?&lt;br&gt;
Does “gross profit” come from orders, invoices, contracts, or finance-adjusted profit tables?&lt;br&gt;
Which tables contain the required data?&lt;br&gt;
How should customer, order, invoice, product, and profit tables be joined?&lt;br&gt;
Are there multiple valid join paths?&lt;br&gt;
Which metric definition is currently active?&lt;br&gt;
Does the requesting user have permission to see this data?&lt;/p&gt;

&lt;p&gt;An AI coding agent can generate code faster.&lt;/p&gt;

&lt;p&gt;But if it does not understand these business and data constraints, it may generate a working application that returns the wrong answer.&lt;/p&gt;

&lt;p&gt;That is the real challenge.&lt;/p&gt;

&lt;p&gt;As AI reduces the cost of code generation, the bottleneck shifts from coding to context.&lt;/p&gt;

&lt;p&gt;From code-driven development to context-driven development&lt;/p&gt;

&lt;p&gt;Traditional development is code-driven.&lt;/p&gt;

&lt;p&gt;A requirement becomes a specification.&lt;br&gt;
The specification becomes APIs, SQL, services, and UI components.&lt;/p&gt;

&lt;p&gt;AI coding agents push the process toward context-driven development:&lt;/p&gt;

&lt;p&gt;Natural language intent + codebase context + data context + business semantic context + tool execution = working data application&lt;/p&gt;

&lt;p&gt;This means future development productivity will depend not only on how well a team uses AI tools, but also on how well the enterprise prepares machine-readable context.&lt;/p&gt;

&lt;p&gt;For enterprise data apps, an AI agent needs at least four types of context.&lt;/p&gt;

&lt;p&gt;First, it needs business semantic context: metrics, dimensions, business terms, definitions, formulas, and valid scopes.&lt;/p&gt;

&lt;p&gt;Second, it needs data asset context: data sources, tables, fields, primary keys, field meanings, and data types.&lt;/p&gt;

&lt;p&gt;Third, it needs data relationship context: how tables connect, which fields are used for joins, and which relationship paths are trustworthy.&lt;/p&gt;

&lt;p&gt;Fourth, it needs governance context: permissions, versions, audit requirements, sensitive fields, and data quality status.&lt;/p&gt;

&lt;p&gt;Without these layers, the agent is mostly guessing.&lt;/p&gt;

&lt;p&gt;In simple projects, guessing may be acceptable.&lt;/p&gt;

&lt;p&gt;In enterprise data systems, guessing is dangerous.&lt;/p&gt;

&lt;p&gt;The semantic layer becomes the translator for AI coding agents&lt;/p&gt;

&lt;p&gt;When a business user says:&lt;/p&gt;

&lt;p&gt;“I want to analyze the decline in gross profit for strategic customers.”&lt;/p&gt;

&lt;p&gt;An AI coding agent should not immediately write SQL.&lt;/p&gt;

&lt;p&gt;It should first understand the business meaning behind the request.&lt;/p&gt;

&lt;p&gt;That is the role of the semantic layer.&lt;/p&gt;

&lt;p&gt;A semantic layer translates business language into governed data language. It manages metrics, dimensions, terminology, formulas, units, scopes, and versions.&lt;/p&gt;

&lt;p&gt;In the Arisyn architecture, Arisyn is positioned as an enterprise semantic-layer intelligent query engine. Its documented capabilities include natural language understanding, business semantic definitions, semantic mapping, terminology management, metric and dimension definitions, and version/gray-release management.&lt;/p&gt;

&lt;p&gt;For AI coding agents, this matters because the semantic layer can answer questions such as:&lt;/p&gt;

&lt;p&gt;What does this business term mean?&lt;br&gt;
Which metric definition is active?&lt;br&gt;
What dimensions are allowed?&lt;br&gt;
Which tables and fields represent this concept?&lt;br&gt;
Are there ambiguities?&lt;br&gt;
Is the current user allowed to query it?&lt;/p&gt;

&lt;p&gt;Without this layer, AI may automate misunderstanding.&lt;/p&gt;

&lt;p&gt;With this layer, AI can generate code under business constraints.&lt;/p&gt;

&lt;p&gt;The data relationship layer becomes the map for AI coding agents&lt;/p&gt;

&lt;p&gt;Enterprise data apps often need to combine multiple tables.&lt;/p&gt;

&lt;p&gt;A customer profitability dashboard may involve customer master data, contracts, orders, invoices, payments, product information, sales organization, and profit detail tables.&lt;/p&gt;

&lt;p&gt;The hard part is not writing SELECT.&lt;/p&gt;

&lt;p&gt;The hard part is choosing the correct join path.&lt;/p&gt;

&lt;p&gt;Intalink is documented as an enterprise data lineage and relationship discovery platform. Its capabilities include data source management, table management, relationship discovery, task execution, and relationship indicators such as co-occurrence count, distinct count, and inclusion ratio. It also discovers table relationships, field relationships, primary/foreign key relationships, and semantic relationships.&lt;/p&gt;

&lt;p&gt;In an AI coding agent workflow, this kind of layer becomes a data connection map.&lt;/p&gt;

&lt;p&gt;Instead of guessing:&lt;/p&gt;

&lt;p&gt;customer.id = order.customer_id&lt;/p&gt;

&lt;p&gt;the agent should ask the relationship layer:&lt;/p&gt;

&lt;p&gt;Which tables are actually connected?&lt;br&gt;
What fields connect them?&lt;br&gt;
How strong is the relationship?&lt;br&gt;
Are there multiple candidate paths?&lt;br&gt;
Which path matches the current business definition?&lt;br&gt;
Are there cross-system relationships?&lt;/p&gt;

&lt;p&gt;This reduces the risk of producing code that runs but returns misleading results.&lt;/p&gt;

&lt;p&gt;A new development workflow for enterprise data apps&lt;/p&gt;

&lt;p&gt;A future enterprise data app workflow may look like this:&lt;/p&gt;

&lt;p&gt;A business user describes the goal:&lt;/p&gt;

&lt;p&gt;I want a dashboard showing gross profit decline for strategic customers by region and product over the past six months.&lt;/p&gt;

&lt;p&gt;The AI coding agent does not immediately generate code.&lt;/p&gt;

&lt;p&gt;Instead, it performs a context-enriched development flow:&lt;/p&gt;

&lt;p&gt;Ask the semantic layer to clarify “strategic customer,” “gross profit,” “region,” and the time period.&lt;br&gt;
Ask the data relationship layer to identify valid table relationships.&lt;br&gt;
Generate SQL based on governed definitions and trusted join paths.&lt;br&gt;
Generate backend APIs.&lt;br&gt;
Generate frontend components.&lt;br&gt;
Generate tests.&lt;br&gt;
Run the project locally.&lt;br&gt;
Produce reviewable code changes.&lt;br&gt;
Ask for human clarification when ambiguity remains.&lt;/p&gt;

&lt;p&gt;The documented relationship between Intalink and Arisyn follows this kind of layered logic: Intalink provides data source management, table and field extraction, and technical relationship discovery, while Arisyn builds business semantics and supports intelligent querying and NL2SQL on top of that foundation.&lt;/p&gt;

&lt;p&gt;This is the real opportunity.&lt;/p&gt;

&lt;p&gt;AI coding agents are not just making developers faster.&lt;/p&gt;

&lt;p&gt;They are pushing enterprise software development toward a governed assembly line powered by structured context.&lt;/p&gt;

&lt;p&gt;What changes for enterprise teams?&lt;/p&gt;

&lt;p&gt;AI coding agents will not eliminate developers.&lt;/p&gt;

&lt;p&gt;But they will change the shape of enterprise data teams.&lt;/p&gt;

&lt;p&gt;In the past, data application development depended heavily on:&lt;/p&gt;

&lt;p&gt;data engineers to find tables and write SQL;&lt;br&gt;
backend engineers to build services;&lt;br&gt;
frontend engineers to build dashboards;&lt;br&gt;
analysts to explain requirements;&lt;br&gt;
governance teams to manage definitions, permissions, and quality.&lt;/p&gt;

&lt;p&gt;These roles will remain, but the collaboration model will change.&lt;/p&gt;

&lt;p&gt;Three capabilities become more important.&lt;/p&gt;

&lt;p&gt;The first is context engineering.&lt;br&gt;
Teams that can turn data sources, metadata, metrics, relationships, permissions, and business definitions into agent-readable context will get more value from AI coding tools.&lt;/p&gt;

&lt;p&gt;The second is agent review.&lt;br&gt;
Humans will need to review whether AI-generated code follows business definitions, data rules, security boundaries, and engineering standards.&lt;/p&gt;

&lt;p&gt;The third is data product thinking.&lt;br&gt;
When code becomes easier to generate, the scarce skill becomes defining the right problem, designing the right analysis path, and making the result useful for decisions.&lt;/p&gt;

&lt;p&gt;AI lowers the cost of implementation.&lt;/p&gt;

&lt;p&gt;It increases the value of correct problem definition.&lt;/p&gt;

&lt;p&gt;Conclusion: AI coding agents need enterprise context to be truly useful&lt;/p&gt;

&lt;p&gt;GitHub Copilot CLI shows that AI is moving deeper into the developer workflow: the terminal, the repository, the test loop, and eventually the pull request.&lt;/p&gt;

&lt;p&gt;This will make software development faster.&lt;/p&gt;

&lt;p&gt;But for enterprise data applications, the most important question is not:&lt;/p&gt;

&lt;p&gt;Can AI write code?&lt;/p&gt;

&lt;p&gt;The real question is:&lt;/p&gt;

&lt;p&gt;Can AI write code with the right enterprise data context?&lt;/p&gt;

&lt;p&gt;Without a semantic layer, AI does not know what business language means.&lt;br&gt;
Without a relationship layer, AI does not know how data connects.&lt;br&gt;
Without governance, AI does not know what can be trusted.&lt;br&gt;
Without feedback loops, AI does not know how to improve.&lt;/p&gt;

&lt;p&gt;So the future of enterprise data app development is not simply:&lt;/p&gt;

&lt;p&gt;Developer + Copilot.&lt;/p&gt;

&lt;p&gt;It is more likely to be:&lt;/p&gt;

&lt;p&gt;Business intent + AI coding agent + semantic layer + data relationship engine + governance + human review.&lt;/p&gt;

&lt;p&gt;That is how AI coding agents can truly lower the barrier to enterprise data application development.&lt;/p&gt;

</description>
      <category>agents</category>
      <category>ai</category>
      <category>cli</category>
      <category>github</category>
    </item>
    <item>
      <title>AI Automation Workflows Are Redefining Enterprise Data Engineering</title>
      <dc:creator>Arisyn</dc:creator>
      <pubDate>Wed, 13 May 2026 15:25:00 +0000</pubDate>
      <link>https://dev.to/arisyn/ai-automation-workflows-are-redefining-enterprise-data-engineering-3bad</link>
      <guid>https://dev.to/arisyn/ai-automation-workflows-are-redefining-enterprise-data-engineering-3bad</guid>
      <description>&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fkx5ydqtyrv1xq60w4smr.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fkx5ydqtyrv1xq60w4smr.png" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;br&gt;
Anthropic’s recent AI engineering automation story has attracted a lot of attention. According to media coverage, a complex engineering workload that might have taken weeks was significantly compressed with AI assistance. The AI was able to continue working after engineers left, fixing bugs, running CI, creating changes, and moving pull requests forward.&lt;/p&gt;

&lt;p&gt;The important point is not the old question of whether AI will replace engineers. The real signal is this: AI is moving from a conversational assistant into an executable engineering workflow.&lt;/p&gt;

&lt;p&gt;For enterprise data engineering, this shift matters a lot.&lt;/p&gt;

&lt;p&gt;Most data engineering work is not just about writing SQL. It involves connecting data sources, understanding table structures, identifying field relationships, aligning business definitions, scheduling jobs, validating outputs, and handling failures. A typical data task may require several steps before any analysis can happen:&lt;/p&gt;

&lt;p&gt;A new system needs to be connected.&lt;br&gt;
Tables and fields need to be understood.&lt;br&gt;
Relationships across tables need to be identified.&lt;br&gt;
Business terms need to be mapped to physical data.&lt;br&gt;
Queries need to be generated, validated, and executed.&lt;br&gt;
Errors need to be traced, corrected, and documented.&lt;/p&gt;

&lt;p&gt;These tasks are repetitive, but they are not always simple. Enterprise data environments are often messy. Table names may not follow a standard. Field definitions may be inconsistent. Historical systems may contain hidden dependencies. Cross-system relationships may not be documented. This is why data engineering still depends heavily on human experience.&lt;/p&gt;

&lt;p&gt;AI automation becomes valuable when these tasks can be decomposed into an executable workflow.&lt;/p&gt;

&lt;p&gt;A practical workflow may look like this:&lt;/p&gt;

&lt;p&gt;The system first understands the user’s intent.&lt;br&gt;
Then it searches relevant data sources, tables, fields, and metadata.&lt;br&gt;
Next, it calls tools for relationship discovery, semantic mapping, SQL generation, validation, and execution.&lt;br&gt;
After that, it summarizes the result and checks whether the answer is reliable.&lt;br&gt;
If something is missing or ambiguous, it creates a trackable follow-up task.&lt;/p&gt;

&lt;p&gt;This is similar to what is happening in software engineering. In software development, AI can work with code repositories, tests, CI pipelines, and pull requests. In data engineering, AI needs a different foundation: metadata, data relationships, semantic definitions, job orchestration, permissions, and audit trails.&lt;/p&gt;

&lt;p&gt;Without this foundation, an AI system may look intelligent but still produce unreliable results. It may generate SQL, but not know whether the join path is correct. It may answer a business question, but not know whether the metric definition is approved. It may run a query, but not know whether the underlying data has changed.&lt;/p&gt;

&lt;p&gt;This is where platforms like Arisyn and Intalink become relevant.&lt;/p&gt;

&lt;p&gt;Intalink works closer to the data foundation layer. Its role is to manage data sources, tables, fields, metadata, relationship discovery, and data extraction tasks. In simple terms, it helps answer these questions: Where is the data? What tables exist? What fields are available? How are the tables connected? Which relationships are trustworthy?&lt;/p&gt;

&lt;p&gt;Arisyn sits closer to the semantic and execution layer. It uses natural language understanding, semantic mappings, workflow orchestration, parameter extraction, SQL generation, and result explanation to turn business questions into executable data analysis tasks. It helps answer a different question: How can a business user’s question be understood and converted into a reliable data query?&lt;/p&gt;

&lt;p&gt;Together, the two layers can support a more complete automation chain:&lt;/p&gt;

&lt;p&gt;Natural language question → semantic understanding → metadata narrowing → relationship discovery → SQL generation and validation → query execution → result explanation → knowledge improvement.&lt;/p&gt;

&lt;p&gt;The key value is not simply “using an LLM to write SQL.” The key value is turning the hidden middle layer of data engineering into callable, traceable, and reusable system capabilities.&lt;/p&gt;

&lt;p&gt;Of course, this does not mean enterprise data engineering can become fully autonomous overnight. In scenarios involving complex business definitions, strict permissions, or unstable data quality, humans still need to review definitions, validate results, and handle exceptions. AI should not bypass governance. It should make governance faster, more transparent, and easier to accumulate.&lt;/p&gt;

&lt;p&gt;The lesson from Anthropic’s case is clear: the next leap in engineering productivity will not come from a model alone. It will come from the combination of models, tools, workflows, and verification mechanisms.&lt;/p&gt;

&lt;p&gt;For enterprise data teams, the next competitive advantage may not only be the number of data engineers they have. It may be whether they can turn repetitive data tasks, relationship discovery, semantic interpretation, and execution processes into automated system capabilities.&lt;/p&gt;

&lt;p&gt;That is the real meaning of AI automation workflows for enterprise data engineering.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>data</category>
      <category>llm</category>
    </item>
    <item>
      <title>How Enterprise Data Governance Supports Security and Efficiency in the AI Agent Era</title>
      <dc:creator>Arisyn</dc:creator>
      <pubDate>Tue, 12 May 2026 20:15:00 +0000</pubDate>
      <link>https://dev.to/arisyn/how-enterprise-data-governance-supports-security-and-efficiency-in-the-ai-agent-era-19oa</link>
      <guid>https://dev.to/arisyn/how-enterprise-data-governance-supports-security-and-efficiency-in-the-ai-agent-era-19oa</guid>
      <description>&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Ffb99nehsr994k8aou755.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Ffb99nehsr994k8aou755.png" alt=" " width="800" height="533"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;AI agents are moving beyond conversation. They are no longer limited to answering questions. They can call tools, access systems, read files, operate data, and complete business workflows across applications.&lt;/p&gt;

&lt;p&gt;This shift also explains why the discussion around “AI agent entry points” and “security infrastructure” is becoming more important. A recent 36Kr article about “Lobster Box” highlighted the growing need for end-cloud security infrastructure in the AI agent era, especially as agents increasingly rely on local scheduling, plugin-based execution, and data movement between devices and cloud environments.&lt;/p&gt;

&lt;p&gt;For enterprises, this issue is even more critical.&lt;/p&gt;

&lt;p&gt;Individual users may worry about privacy leakage. Enterprises face a broader set of risks: Can an agent access data it should not access? Can it respect different permission boundaries across departments, roles, and systems? Is the query generated by the agent aligned with the correct business definition? Is the data source trustworthy? If the result affects a business decision, can the company trace how the answer was produced?&lt;/p&gt;

&lt;p&gt;In other words, when AI agents enter enterprise data environments, the security question is not only whether the model is safe. The deeper question is whether the data access chain is governed.&lt;/p&gt;

&lt;p&gt;A reliable enterprise AI agent needs at least three foundational capabilities.&lt;/p&gt;

&lt;p&gt;The first is metadata management.&lt;br&gt;
An agent needs to know what data sources exist, what tables are available, what fields they contain, and what those fields mean in a business context. Without metadata, the agent can only guess. That often leads to incorrect table selection, wrong field usage, and inconsistent metric interpretation.&lt;/p&gt;

&lt;p&gt;The second is data lineage and relationship discovery.&lt;br&gt;
Enterprise data is usually distributed across ERP, CRM, finance systems, supply chain platforms, data warehouses, and data lakes. A simple business question may require multiple tables and several relationship paths. If an agent does not know how tables are connected, it may generate incorrect SQL or join unrelated data.&lt;/p&gt;

&lt;p&gt;The third is permission control, auditability, and traceability.&lt;br&gt;
The more powerful an agent becomes, the more boundaries it needs. Enterprises need to control not only who can ask questions, but also what data can be accessed, what actions can be executed, and how each result is logged and audited.&lt;/p&gt;

&lt;p&gt;From this perspective, the combination of Intalink and Arisyn represents a practical architecture for enterprise adoption.&lt;/p&gt;

&lt;p&gt;Intalink focuses on the underlying data governance layer. It is positioned as an enterprise data lineage and relationship discovery platform, supporting data source management, table and field management, relationship discovery, and relationship context for SQL generation. For AI agents, this works like an enterprise data relationship map before execution begins.&lt;/p&gt;

&lt;p&gt;The purpose of this map is not to expose more technical complexity to business users. Its value is to help agents guess less, make fewer mistakes, and avoid crossing data boundaries. For example, when a user asks, “Show me the latest invoice amount for each customer,” the system should not rely only on semantic similarity. It should use governed metadata, table relationships, field relationships, and lineage context to determine which tables can be joined, which fields should be used, and whether the relationship is reliable.&lt;/p&gt;

&lt;p&gt;Arisyn operates closer to the intelligent application layer. It uses semantic governance and natural language querying to translate business questions into executable analytical processes. Its capabilities include natural language understanding, intent recognition, clarification, relationship discovery, SQL generation and validation, query execution, and result summarization, with support for reasoning traces, SQL, data tables, charts, and execution details.&lt;/p&gt;

&lt;p&gt;This suggests that enterprise AI agents should not jump directly from a user question to a database query. A safer workflow looks like this:&lt;/p&gt;

&lt;p&gt;A user asks a business question.&lt;br&gt;
The semantic layer identifies metrics, dimensions, time ranges, and business definitions.&lt;br&gt;
The governance layer provides trusted data sources, tables, fields, and relationship paths.&lt;br&gt;
The agent generates and validates SQL.&lt;br&gt;
The system executes the query and records the result, reasoning process, and boundaries.&lt;/p&gt;

&lt;p&gt;The key idea is to place intelligent execution on top of data governance. The agent can complete tasks faster, but every step is constrained by semantics, relationships, and permissions.&lt;/p&gt;

&lt;p&gt;In the future, the main challenge of enterprise AI agent deployment may not be whether a company has access to powerful models. The real question may be whether the company has a governance foundation that allows agents to access data safely, understand business meaning correctly, and execute tasks reliably.&lt;/p&gt;

&lt;p&gt;Without that foundation, the stronger the agent becomes, the greater the risk.&lt;br&gt;
With that foundation, agents can move from impressive demos to real enterprise productivity.&lt;/p&gt;

&lt;p&gt;In the AI agent era, enterprise data intelligence is not only about making machines smarter. It is about enabling machines to do the right things within the right data boundaries.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>agents</category>
      <category>data</category>
    </item>
    <item>
      <title>AI Agents for Enterprise Data Analytics: From Chat Interfaces to Reliable Execution</title>
      <dc:creator>Arisyn</dc:creator>
      <pubDate>Tue, 12 May 2026 15:30:00 +0000</pubDate>
      <link>https://dev.to/arisyn/ai-agents-for-enterprise-data-analytics-from-chat-interfaces-to-reliable-execution-23hj</link>
      <guid>https://dev.to/arisyn/ai-agents-for-enterprise-data-analytics-from-chat-interfaces-to-reliable-execution-23hj</guid>
      <description>&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Ft4wvkyair1kxdbtysz6f.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Ft4wvkyair1kxdbtysz6f.png" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;The global AI conversation is changing. Companies are no longer asking only whether large language models are powerful. They are asking a more practical question: can AI agents actually enter enterprise workflows, connect to real data, understand business context, and produce reliable results?&lt;/p&gt;

&lt;p&gt;This shift matters a lot for enterprise data analytics.&lt;/p&gt;

&lt;p&gt;Most companies do not lack data. They already have databases, dashboards, BI tools, and reporting systems. The real problem is that data is fragmented across systems, business terms are inconsistent, metric definitions are unclear, and table relationships often live only in the heads of experienced data engineers.&lt;/p&gt;

&lt;p&gt;A business user may ask a simple question: “Which customers are growing the fastest?” or “Where is inventory risk concentrated?” But behind that question, a data team may need to identify the right tables, confirm metric definitions, write SQL, validate joins, and explain the results.&lt;/p&gt;

&lt;p&gt;This is why an AI agent for enterprise analytics cannot be just another chatbot. It needs at least three layers of capability.&lt;/p&gt;

&lt;p&gt;The first layer is business understanding.&lt;br&gt;
Natural language questions must be translated into structured analytical intent. The system needs to identify metrics, dimensions, time ranges, business entities, and possible ambiguity. For example, “sales growth” may refer to order value, contract value, revenue, or gross margin. Without a governed semantic layer, an AI system may produce answers that sound correct but are not aligned with the business definition.&lt;/p&gt;

&lt;p&gt;The second layer is data structure understanding.&lt;br&gt;
Enterprise data usually lives across multiple databases, schemas, and tables. An AI agent should not guess how tables are connected. It needs reliable metadata, trusted join paths, field relationships, and data lineage. This layer determines whether natural language can be turned into accurate SQL.&lt;/p&gt;

&lt;p&gt;The third layer is governance and traceability.&lt;br&gt;
Enterprises cannot rely on a system that is “sometimes right.” They need explainable reasoning, visible SQL, clear query boundaries, ambiguity handling, auditability, and a feedback loop that improves the knowledge base over time.&lt;/p&gt;

&lt;p&gt;From this perspective, the combination of Arisyn and Intalink represents a practical implementation path.&lt;/p&gt;

&lt;p&gt;Arisyn works as the intelligent analytics interface. It turns business questions into a structured reasoning process: intent recognition, synonym retrieval, clarification, relationship discovery, SQL generation and validation, query execution, and result summarization. Instead of returning only a final answer, it can expose the reasoning path, SQL, data table, visualization, and execution details.&lt;/p&gt;

&lt;p&gt;Intalink works as the underlying data relationship engine. It focuses on data source management, metadata management, table and field relationship discovery, lineage analysis, and relationship quality evaluation. For AI agents, this foundation is critical. Agents should not rely only on language reasoning when performing data analysis. They need a trusted relationship layer that tells them where the data is and how it can be connected.&lt;/p&gt;

&lt;p&gt;A more reliable enterprise AI analytics architecture may look like this:&lt;/p&gt;

&lt;p&gt;A business user asks a question.&lt;br&gt;
The semantic layer interprets the business meaning.&lt;br&gt;
The data relationship engine provides trusted table and field paths.&lt;br&gt;
The AI agent generates SQL based on governed semantics and relationship context.&lt;br&gt;
The system executes the query and returns results, logic, SQL, and boundaries together.&lt;/p&gt;

&lt;p&gt;The value of this architecture is not to replace data teams. It is to reduce repetitive work: searching for tables, confirming definitions, writing similar SQL again and again, and explaining basic data logic. Business teams get faster answers. Data teams keep governance and control. The enterprise gradually builds reusable assets: metrics, semantics, metadata, and relationship knowledge.&lt;/p&gt;

&lt;p&gt;In the next stage of enterprise AI, the key differentiator may not be only model intelligence. It may be the ability to connect AI agents with real enterprise data, governed business meaning, and trusted analytical execution.&lt;/p&gt;

&lt;p&gt;For data analytics, the breakthrough is not simply asking AI a question. The breakthrough is enabling AI to understand what the question means, where the data lives, how the data connects, whether the answer is trustworthy, and what analysis should happen next.&lt;/p&gt;

&lt;p&gt;That is how AI agents move from impressive demos to real enterprise adoption.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>llm</category>
      <category>nl2sql</category>
    </item>
    <item>
      <title>Enterprise AI Is Not Just About LLMs — It Is About Making Data Understandable</title>
      <dc:creator>Arisyn</dc:creator>
      <pubDate>Mon, 11 May 2026 07:30:32 +0000</pubDate>
      <link>https://dev.to/arisyn/enterprise-ai-is-not-just-about-llms-it-is-about-making-data-understandable-4n39</link>
      <guid>https://dev.to/arisyn/enterprise-ai-is-not-just-about-llms-it-is-about-making-data-understandable-4n39</guid>
      <description>&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fedhrle498gijptn7bdwx.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fedhrle498gijptn7bdwx.png" alt=" " width="799" height="448"&gt;&lt;/a&gt;Over the past year, the AI conversation has been dominated by larger models, cheaper inference, agentic workflows, and new integration standards such as MCP. The direction is clear: AI is moving from impressive demos to practical systems that can work inside real business environments.&lt;/p&gt;

&lt;p&gt;But for data engineers, one problem remains largely unchanged.&lt;/p&gt;

&lt;p&gt;The hard part of enterprise AI is not connecting to an LLM. The hard part is helping the model understand enterprise data.&lt;/p&gt;

&lt;p&gt;In many AI data projects, the first prototype looks simple. A user asks a question in natural language. The model generates SQL. The SQL runs against a database. A table or chart comes back.&lt;/p&gt;

&lt;p&gt;That works well in a controlled demo.&lt;/p&gt;

&lt;p&gt;In a real enterprise environment, things become messy very quickly.&lt;/p&gt;

&lt;p&gt;What does “revenue” mean? Is it based on orders, invoices, payments, or financial confirmation?&lt;br&gt;
Is “customer” the same entity across CRM, contracts, billing, and support systems?&lt;br&gt;
Can two tables be joined just because both contain a column named customer_id?&lt;br&gt;
If there are raw tables, summary tables, historical tables, and manually maintained Excel imports, which one should the model use?&lt;/p&gt;

&lt;p&gt;These are not purely language problems. They are data-context problems.&lt;/p&gt;

&lt;p&gt;LLMs are good at understanding text, but they do not automatically understand the hidden structure of enterprise data. They do not know which tables are trusted, which fields represent business concepts, which relationships are valid, or which joins are dangerous.&lt;/p&gt;

&lt;p&gt;This is why enterprise AI data applications need a stronger foundation beneath the model.&lt;/p&gt;

&lt;p&gt;A practical architecture usually needs at least three layers.&lt;/p&gt;

&lt;p&gt;The first layer is metadata.&lt;br&gt;
The system needs to know what data sources exist, what tables are available, what columns they contain, what data types they use, and which tables are allowed to participate in analysis. Without a reliable metadata layer, the model is reasoning with incomplete context.&lt;/p&gt;

&lt;p&gt;The second layer is relationship discovery.&lt;br&gt;
In real databases, many relationships are not declared as foreign keys. They may be hidden in values, naming conventions, legacy system migrations, or business processes. A field such as customer name, contract number, product code, or project ID may appear across multiple systems. Whether those fields can be joined should be validated by data patterns, not guessed by a model.&lt;/p&gt;

&lt;p&gt;The third layer is semantic governance.&lt;br&gt;
Business users do not ask for columns. They ask for concepts: active customers, inventory balance, project workload, gross margin, revenue contribution. Each concept has definitions, filters, dimensions, time windows, and sometimes permission rules. If these meanings are not governed, the generated SQL may be technically valid but business-wrong.&lt;/p&gt;

&lt;p&gt;A more reliable path for enterprise AI data applications looks like this:&lt;/p&gt;

&lt;p&gt;connect data sources → extract metadata → discover data relationships → define business semantics → interpret natural language → generate and validate SQL → explain results → collect feedback.&lt;/p&gt;

&lt;p&gt;This is where the separation between a relationship engine and a semantic query layer becomes important.&lt;/p&gt;

&lt;p&gt;A system like Intalink focuses on the lower layer: data source management, metadata extraction, lineage, and relationship discovery. A system like Arisyn sits above that layer: natural language understanding, semantic mapping, SQL generation, multi-step reasoning, and result explanation.&lt;/p&gt;

&lt;p&gt;The value of this architecture is not that it adds more tools. The value is that it separates different types of uncertainty.&lt;/p&gt;

&lt;p&gt;The relationship engine handles how data connects.&lt;br&gt;
The semantic layer handles what business concepts mean.&lt;br&gt;
The LLM handles user intent, query planning, and explanation.&lt;br&gt;
The feedback loop improves the knowledge base and semantic rules over time.&lt;/p&gt;

&lt;p&gt;This is much more stable than asking a model to generate SQL directly from a vague business question.&lt;/p&gt;

&lt;p&gt;The next stage of enterprise AI will not be defined only by model size. It will be defined by how well AI systems can connect to real enterprise systems, understand structured data, respect business semantics, and produce results that engineers and business users can trust.&lt;/p&gt;

&lt;p&gt;In other words, enterprise AI does not start with the model.&lt;/p&gt;

&lt;p&gt;It starts with reliable data context.&lt;/p&gt;

&lt;p&gt;If LLMs are becoming the new interface for work, then metadata, lineage, relationship discovery, and semantic governance are the infrastructure that makes that interface usable.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>data</category>
      <category>llm</category>
    </item>
    <item>
      <title>The Missing Layer in Your Data Stack Why Semantic Intelligence Matters More Than Another BI Tool</title>
      <dc:creator>Arisyn</dc:creator>
      <pubDate>Fri, 17 Apr 2026 15:51:00 +0000</pubDate>
      <link>https://dev.to/arisyn/the-missing-layer-in-your-data-stackwhy-semantic-intelligence-matters-more-than-another-bi-tool-2i7f</link>
      <guid>https://dev.to/arisyn/the-missing-layer-in-your-data-stackwhy-semantic-intelligence-matters-more-than-another-bi-tool-2i7f</guid>
      <description>&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Flbh4r6gc80bbjljmtad2.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Flbh4r6gc80bbjljmtad2.jpg" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;You’ve invested in the data warehouse. You’ve wired up the pipelines. You’ve licensed the BI platform. Your dashboards look great in the demo.&lt;/p&gt;

&lt;p&gt;And yet — your business teams still can’t get a straight answer out of their own data.&lt;/p&gt;

&lt;p&gt;This isn’t a tooling problem. It’s a semantic problem. And adding another BI tool won’t fix it.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;The Paradox of the Modern Data Stack&lt;/strong&gt;&lt;br&gt;
The modern data stack has solved a lot of hard problems. Storage is cheap. Compute is elastic. Pipelines are observable. Schemas are documented.&lt;/p&gt;

&lt;p&gt;But there’s a gap nobody talks about — the gap between what data says and what the business means.&lt;/p&gt;

&lt;p&gt;Ask three analysts “what was our revenue last quarter?” and you’ll get three different numbers. Not because the data is wrong. Because “revenue” means something different in each team’s model. Recognized revenue. Booked revenue. Collected revenue. Each is technically correct. None of them agree.&lt;/p&gt;

&lt;p&gt;This is the last-mile problem of the data stack: data reaches the warehouse, but it never reaches a shared understanding.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;What a Semantic Layer Actually Is (and Isn’t)&lt;/strong&gt;&lt;br&gt;
Before going further, let’s be precise — because this term gets misused constantly.&lt;/p&gt;

&lt;p&gt;A semantic layer is not:&lt;/p&gt;

&lt;p&gt;· A BI tool with a friendly UI&lt;br&gt;
· A metadata catalog that documents your tables&lt;br&gt;
· A search index over your data assets&lt;br&gt;
· A natural language wrapper around SQL&lt;br&gt;
A semantic layer is a governed translation layer that sits between raw data structures and the business logic that depends on them. It maps business vocabulary to technical representations, understands the relationships between entities, and enforces governance policies — all as first-class concerns, not afterthoughts.&lt;/p&gt;

&lt;p&gt;The key word is governed. Without governance, you just have a mapping file. With governance, you have a semantic operating layer.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;The Technical Architecture (In Plain Terms)&lt;/strong&gt;&lt;br&gt;
A proper semantic layer has three core components working together:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;1. The Semantic Model&lt;/strong&gt;&lt;br&gt;
Business concepts — “Customer,” “Revenue,” “Churn Rate,” “Active SKU” — are defined as semantic objects with precise, versioned definitions. Each object maps to one or more physical data structures, with full lineage attached. When the definition of “Active Customer” changes, the change is versioned, audited, and propagated — not silently overwritten in a dashboard config file.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;2. Relationship-Aware Query Logic&lt;/strong&gt;&lt;br&gt;
This is where most BI tools fall short. Flat SQL joins can answer simple questions. But real business questions traverse relationships: “Which product lines had the highest return rate in regions where NPS also dropped last quarter?”&lt;/p&gt;

&lt;p&gt;That question touches products, returns, regions, NPS surveys, and time — across at least three different source systems. A semantic engine understands these relationships structurally and resolves the query path automatically, without a data engineer writing a custom join.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;3. Governance as a First-Class Citizen&lt;/strong&gt;&lt;br&gt;
Access control in most data stacks is enforced at the infrastructure level — who can query which table. Semantic governance operates at the meaning level: who can access which business concept, under which policy, with which context. Row-level security is expressed in business terms. Audit trails attach to semantic objects, not just raw queries.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;A Concrete Example&lt;/strong&gt;&lt;br&gt;
Here’s what the query pipeline looks like for a natural language question:&lt;/p&gt;

&lt;p&gt;“Show me revenue growth by region compared to last year”&lt;/p&gt;

&lt;p&gt;Without a semantic layer, this lands in an analyst’s queue. With one:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;1.Intent interpretation&lt;/strong&gt; — the engine identifies “revenue growth,” “region,” and “YoY comparison” as semantic concepts&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;2.Term resolution&lt;/strong&gt; — “revenue” maps to fact_sales.net_revenue (per the governed definition); “region” maps to dim_geography.sales_region&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;3.Relationship traversal&lt;/strong&gt; — the engine resolves the join path across three tables automatically&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;4.Governance check&lt;/strong&gt; — the requesting user’s role is validated against the semantic access policy for revenue data&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;5.Result + lineage&lt;/strong&gt; — the answer is returned with its full semantic provenance: which definitions were used, which relationships were traversed, which version of the metric was applied&lt;/p&gt;

&lt;p&gt;The business user gets an answer in seconds. The answer is explainable. And it’s the same answer every time — because it’s derived from a governed semantic model, not a one-off query.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;What This Means for Your Data Team&lt;/strong&gt;&lt;br&gt;
The practical impact is significant:&lt;/p&gt;

&lt;p&gt;· Analyst bottleneck shrinks. Routine business questions are answered directly, without a ticket.&lt;br&gt;
· Metric consistency improves. One governed definition of “revenue” across every tool, every team, every dashboard.&lt;br&gt;
· Explainability becomes the default. Every result carries its reasoning — which matters enormously when a CFO asks “where did this number come from?”&lt;br&gt;
· Governance scales. Policies are defined once at the semantic level and enforced everywhere, rather than duplicated across dozens of BI reports.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;The Semantic Layer Is Infrastructure, Not a Feature&lt;/strong&gt;&lt;br&gt;
The data stack conversation has matured significantly over the past decade. We’ve moved from “how do we store data?” to “how do we move data?” to “how do we model data?”&lt;/p&gt;

&lt;p&gt;The next question is: how do we make data understandable?&lt;/p&gt;

&lt;p&gt;That’s not a BI problem. It’s not a pipeline problem. It’s a semantic problem — and it requires a semantic solution.&lt;/p&gt;

&lt;p&gt;The organizations that build a governed semantic layer aren’t just improving their dashboards. They’re building the infrastructure that makes every downstream data product — BI, AI, embedded analytics, executive reporting — more reliable, more consistent, and more trustworthy.&lt;/p&gt;

&lt;p&gt;That’s not a feature you add to your stack. That’s the layer your stack has been missing.&lt;/p&gt;

&lt;p&gt;Arisyn is a semantic layer platform built for enterprise data teams that need governed, explainable, and queryable intelligence across complex data landscapes.&lt;/p&gt;

</description>
      <category>dataengineering</category>
      <category>semanticlayer</category>
      <category>datagovernance</category>
      <category>naturallanguagequery</category>
    </item>
    <item>
      <title>Arisyn: Building an Enterprise Semantic Layer Between Natural Language and SQL</title>
      <dc:creator>Arisyn</dc:creator>
      <pubDate>Thu, 16 Apr 2026 15:40:00 +0000</pubDate>
      <link>https://dev.to/arisyn/arisyn-building-an-enterprise-semantic-layer-between-natural-language-and-sql-10fi</link>
      <guid>https://dev.to/arisyn/arisyn-building-an-enterprise-semantic-layer-between-natural-language-and-sql-10fi</guid>
      <description>&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fa61qu4s9mjdo5du45qnk.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fa61qu4s9mjdo5du45qnk.png" alt=" " width="799" height="403"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Most Text-to-SQL tools answer one question at a time. Arisyn does something more ambitious — it builds a permanent semantic bridge between business language and your data warehouse.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;The Problem with One-Off NL2SQL&lt;/strong&gt;&lt;br&gt;
You’ve seen the demos. Ask an AI “What were our sales last quarter?” and it generates a perfect SQL query. Impressive.&lt;/p&gt;

&lt;p&gt;But here’s what those demos don’t show: what happens the next week, when someone asks the same question slightly differently?&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F3phlz6ezuy5kzyj345i8.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F3phlz6ezuy5kzyj345i8.png" alt=" " width="800" height="125"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Without a semantic layer, you’re relying on the AI to correctly interpret each phrasing every single time. That’s fragile. Inconsistent. Hard to audit.&lt;/p&gt;

&lt;p&gt;Arisyn takes a fundamentally different approach.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;What Is Arisyn?&lt;/strong&gt;&lt;br&gt;
Arisyn is an enterprise-grade semantic layer and intelligent query platform. Instead of treating each natural language query as an isolated event, it maintains a persistent layer of business semantics that maps business concepts to specific data fields, tables, and calculations.&lt;/p&gt;

&lt;p&gt;The platform address: &lt;a href="http://8.152.97.100:23030/" rel="noopener noreferrer"&gt;http://8.152.97.100:23030/&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Core philosophy: Define your business terms once. Query them naturally forever.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;The Architecture: Four Layers, One Goal&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fve9rbqxm0x53seemwgxx.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fve9rbqxm0x53seemwgxx.png" alt=" " width="796" height="79"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;This layered design means every capability — from query execution to version control to ticket management — has its proper place.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;The Six-Step Reasoning Chain&lt;/strong&gt;&lt;br&gt;
When a user types “query inventory balance total”, Arisyn executes a six-step reasoning pipeline:&lt;/p&gt;

&lt;p&gt;1.Intent Recognition — What type of query is this?&lt;br&gt;
2.Synonym Retrieval — Match business terminology to known semantic definitions&lt;br&gt;
3.Clarification Judgment — Does the query have enough information? (e.g., time range)&lt;br&gt;
4.Table Relationship Discovery — Discover candidate join paths via TeraLink integration&lt;br&gt;
5.SQL Generation &amp;amp; Validation — Generate SQL, validate syntax&lt;br&gt;
6.Execution &amp;amp; Aggregation — Run the query, aggregate results&lt;br&gt;
The critical difference: every step’s output is visible to the user. You see the generated SQL. You see which tables were joined. You see execution timing per step. This is an auditable AI — essential in enterprise contexts.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Dual Semantic Layers: Business ↔ Data&lt;/strong&gt;&lt;br&gt;
This is where Arisyn separates itself from generic NL2SQL tools.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Business Semantic Layer (Business User View)&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fwraydmbgfmmre4717pvj.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fwraydmbgfmmre4717pvj.png" alt=" " width="800" height="148"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Data Semantic Layer (Data Administrator View)&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fe2wsx5b4d0tpned35ih9.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fe2wsx5b4d0tpned35ih9.png" alt=" " width="800" height="151"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;The admin configures the mapping once. After that, any natural language phrasing that maps to this semantic definition will produce consistent, correct SQL.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Versioning &amp;amp; Gradual Rollout&lt;/strong&gt;&lt;br&gt;
This is an enterprise feature rarely seen in NL2SQL tools:&lt;/p&gt;

&lt;p&gt;Full version history for every mapping rule&lt;br&gt;
Gradual rollout with configurable percentage (0–100%)&lt;br&gt;
Per-customer rollout lists for multi-tenant deployments&lt;br&gt;
Draft → Active → Archived state management&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Who Is Arisyn For?&lt;/strong&gt;&lt;br&gt;
· Business users who need data without knowing SQL&lt;br&gt;
· Operations teams running recurring analysis (customer intel, inventory structure)&lt;br&gt;
· Data analysts who want to verify/accelerate SQL writing&lt;br&gt;
· Data administrators maintaining semantic standards across the org&lt;br&gt;
· IT/DevOps monitoring system health and managing tickets&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;The Bigger Picture: Why Semantic Layers Matter&lt;/strong&gt;&lt;br&gt;
The most valuable thing Arisyn does isn’t the AI query — it’s semantic persistence.&lt;/p&gt;

&lt;p&gt;Most AI tools have no memory. Ask the same thing two weeks later with different wording, and you might get a different answer.&lt;/p&gt;

&lt;p&gt;With a semantic layer:&lt;/p&gt;

&lt;p&gt;· Every correct query adds to organizational knowledge&lt;br&gt;
· Business terms have authoritative, versioned definitions&lt;br&gt;
· Data governance becomes systematic, not ad-hoc&lt;br&gt;
As enterprise data teams grapple with more users needing access to more data, the semantic layer isn’t a nice-to-have — it’s the architecture that makes data governance and data democratization compatible.&lt;/p&gt;

</description>
    </item>
  </channel>
</rss>
