Lineage Overview
Lineage is an AI-driven, code-aware lineage intelligence platform that provides comprehensive end-to-end visibility across data, applications, and transformation processes. It derives lineage by capturing logic directly from SQL, code, and transformation pipelines—rather than relying solely on metadata tables—to provide an accurate and explainable view of data dependencies.
The platform automatically generates lineage graphs that illustrate how data elements, processes, and systems are connected. Lineage traces data movement down to the individual column level, showing exactly which input fields contribute to specific outputs. This granular view enables users to explore dependencies, understand upstream and downstream relationships, and assess how changes propagate across assets at both the process and field level.
Lineage also connects data flows across databases, ETL/ELT pipelines, analytics engines, BI tools, and application components. This unified visibility strengthens governance, supports modernization initiatives, and enhances decision‑making by revealing the operational pathways that affect reporting, compliance, and enterprise processes.
Lineage Core Principles
Lineage is built on three core principles—truth, intelligence, and control—that support accurate dependency understanding, enterprise visibility, and proactive governance.
|
Truth |
Intelligence |
Control |
|
MCP Connectors ingest lineage signals directly from enterprise systems, platforms, & runtimes |
Connects lineage across data platforms, ETL/ELT, analytics, BI, and workflows |
Quantifies downstream impact of changes at column, process, and system levels |
|
Lineage captured directly from SQL, code, and transformation logic, not inferred metadata |
Navigable, human-readable lineage paths that make complex transformations understandable |
Embeds lineage into wave planning, phased migration, and conversion execution |
|
Precise tracing down to individual columns and fields |
Surfaces lineage paths that power critical reports, processes, and decisions |
AI-assisted verification reduces manual effort while scaling lineage across the estate |
|
Deterministic lineage graphs spanning data elements, applications, workflows, and systems |
From technical artifacts to explainable relationships |
From isolated decisions to portfolio-level governance |
Key Capabilities
The Lineage module includes the following key capabilities:
- End-to-End Enterprise Lineage: Generates comprehensive lineage graphs highlighting dependencies between data elements, applications, workflows, and systems.
- Column-Level Lineage: Traces lineage down to individual columns, showing which input fields influence which outputs.
- Code-Aware (Process) Dependency Extraction: Captures lineage directly from code, SQL, and transformation logic, not only from metadata tables.
- Cross-Domain Traceability: Connects lineage across data, workflows, ETL/ELT, analytics, and BI engines.
- Business-Critical Data Flow Identification: Identifies data flows and lineage paths that support critical business processes and reports.
- Impact & Change Analysis: Quantifies how modifications in one asset propagate across others, including column and process level impacts.
- Explainable Trace Paths: Provides navigable lineage explanations with trace paths and transformations.
- Integration with Modernization Lifecycle: Embeds lineage into wave plan – a phased migration roadmap & conversion execution phases of modernization.