FY27-Q1 ABM Ad Copy Framework

GF #1 Procter & Gamble Greenfield

Databricks · Airflow · Azure · ML platforms AI factory investment at scale No active data catalog identified

Account value prop — internal context, not ad copy

P&G has invested heavily in an AI factory running Databricks and Airflow pipelines across a global data estate. The problem: without data contracts enforced at pipeline runtime, models train on whatever data arrives — degraded schemas, stale records, missing fields. Real-time metadata from DataHub streams lineage from every pipeline execution, so AI teams catch data quality issues in seconds rather than discovering them after a bad model run.

⚡ Signal: AI factory build-out creates urgent need for data contract enforcement upstream of model training.

1

Problem Name the pain — earn attention

AI factory. No data contract layer.

Databricks and Airflow pipelines power your AI factory. Without contracts, models train on whatever arrives — corrupt data included.

Headline: 6 words ✓ Body: 19 words ✓

2

Solution Make the fix feel within reach

Data contracts enforced as pipelines execute.

Every Airflow run and Databricks job validated against freshness and schema SLAs in real time, before models consume degraded data.

Headline: 6 words ✓ Body: 20 words ✓

3

Why DataHub Specific, credible reason to click

Streaming metadata. Not 24-hour batch lag.

Purview scans on schedule. DataHub streams. When a pipeline fails at 2am, your AI factory knows in seconds — not at dawn.

Headline: 6 words ✓ Body: 20 words ✓

OSS #1 Parker Hannifin OSS Tier 1

Databricks · Airflow · Azure Synapse Filtration Group + Curtis Instruments acquisitions Multi-stack lineage gap post-merger

Account value prop — internal context, not ad copy

Parker Hannifin has absorbed two major acquisitions — Filtration Group and Curtis Instruments — each bringing its own data stack. The result is three separate data environments (Databricks, Airflow, Azure Synapse) with no unified lineage layer across them. DataHub's OSS foundation maps column-level lineage across all three stacks simultaneously, and it's portable — so when the next acquisition arrives, the metadata layer moves with it.

⚡ Signal: Active post-merger data consolidation creates immediate need for cross-stack lineage unification.

1

Problem Name the pain — earn attention

Two acquisitions. Zero unified lineage layer.

Filtration Group and Curtis Instruments added two more data stacks. No one can trace lineage across all three environments.

Headline: 6 words ✓ Body: 19 words ✓

2

Solution Make the fix feel within reach

One lineage graph. All three stacks, live.

DataHub maps column-level lineage across Databricks, Airflow, and Synapse — streaming as each inherited pipeline executes.

Headline: 7 words ✓ Body: 16 words ✓

3

Why DataHub Specific, credible reason to click

Lineage that survives the next acquisition, too.

DataHub's OSS foundation keeps your metadata layer portable. When architecture shifts mid-merger, your lineage goes with it.

Headline: 7 words ✓ Body: 17 words ✓

GF #2 Johnson & Johnson Greenfield

Airflow · dbt · Snowflake · Databricks Population Analytics — 15PB clinical data FDA audit traceability requirement

Account value prop — internal context, not ad copy

J&J's Population Analytics team manages 15 petabytes of clinical trial data flowing through Airflow, dbt, Snowflake, and Databricks. FDA submissions require full data provenance — and today that traceability is reconstructed manually. DataHub streams column-level lineage from clinical source systems through every dbt transformation in real time, so the chain of custody is always current when auditors ask.

⚡ Signal: FDA audit requirements make real-time provenance a compliance imperative, not a nice-to-have.

1

Problem Name the pain — earn attention

Fifteen petabytes. No real-time lineage layer.

Your Population Analytics team traces clinical data by hand across Airflow, dbt, Snowflake, and Databricks. FDA audits don't wait.

Headline: 6 words ✓ Body: 20 words ✓

2

Solution Make the fix feel within reach

Automated provenance from trial data to report.

Column-level lineage flows from source through every dbt transformation — verified, auditable, live. No manual reconciliation before submissions.

Headline: 7 words ✓ Body: 18 words ✓

3

Why DataHub Specific, credible reason to click

Real-time lineage for 15PB of clinical data.

DataHub streams metadata as trials run, not hours later. Every downstream consumer sees chain-of-custody that's current — not yesterday's snapshot.

Headline: 7 words ✓ Body: 19 words ✓

GF #3 Verizon Greenfield

Kafka · Airflow · BigQuery 200PB across Verizon + Frontier stacks Alation deployed — batch scan limitation

Account value prop — internal context, not ad copy

Verizon's Frontier acquisition doubled their data estate to 200PB across two separate telecom stacks. Alation is deployed but relies on scheduled batch scans — meaning lineage across Kafka, Airflow, and BigQuery reflects yesterday, not now. DataHub streams metadata in real time as pipelines run, giving teams accurate cross-stack lineage immediately — not after the nightly scan cycle completes.

⚡ Signal: Frontier integration creates active cross-stack lineage gap that Alation's batch model cannot bridge.

1

Problem Name the pain — earn attention

Frontier doubled your data. Not your visibility.

200PB across two telecom stacks. Alation's batch scans don't reflect what changed overnight in Airflow, Kafka, or BigQuery.

Headline: 7 words ✓ Body: 19 words ✓

2

Solution Make the fix feel within reach

Unified metadata across both stacks, live.

DataHub streams lineage from Kafka through Airflow to BigQuery — across Verizon and Frontier infrastructure — as pipelines run.

Headline: 6 words ✓ Body: 18 words ✓

3

Why DataHub Specific, credible reason to click

Alation batches. DataHub streams. Speed matters.

When your team needs to know which Frontier pipeline feeds a consumer dashboard, DataHub answers in seconds — not after the nightly scan.

Headline: 5 words ✓ Body: 20 words ✓

OSS #2 IQVIA OSS Tier 1

Snowflake · dbt · Airflow Active cloud modernization program Governance lagging migration pace

Account value prop — internal context, not ad copy

IQVIA's Data Architecture team is executing a major cloud modernization — migrating to Snowflake with dbt and Airflow powering transformation pipelines. The pattern: infrastructure moves fast, governance arrives six months later. DataHub inverts this by attaching lineage metadata to every new pipeline at ingestion, so governance isn't retrofitted after the migration settles — it's built in from the start.

⚡ Signal: Active cloud migration creates a governance lag opportunity — DataHub closes the gap in real time.

1

Problem Name the pain — earn attention

Cloud migration. Governance arriving six months late.

Your Data Architecture team is modernizing fast — Snowflake, dbt, Airflow. But metadata governance lags behind every migration wave.

Headline: 6 words ✓ Body: 18 words ✓

2

Solution Make the fix feel within reach

Governance that migrates with every pipeline.

DataHub ingests Snowflake, dbt, and Airflow metadata in real time — so every new cloud pipeline arrives with lineage already attached.

Headline: 6 words ✓ Body: 19 words ✓

3

Why DataHub Specific, credible reason to click

Built for migrations. Not retrofitted after.

Most catalogs document what's already settled. DataHub tracks what's actively moving — so your cloud modernization doesn't inherit technical debt.

Headline: 5 words ✓ Body: 19 words ✓

GF #4 State Street Greenfield

Snowflake · Airflow · dbt Alpha Data Platform — $380B new mandates Compliance traceability requirement

Account value prop — internal context, not ad copy

State Street's Alpha Data Platform is onboarding new institutional mandates at record pace — $380B in new AUM requiring data workflows that compliance teams can actually trace. Without real-time lineage, custody and compliance teams cannot verify which pipelines touch which mandate's data. DataHub streams column-level lineage through Snowflake, Airflow, and dbt as each mandate's data moves — so investment SLAs are traceable in real time, not reconstructed after the nightly batch.

⚡ Signal: Alpha Platform onboarding acceleration creates urgent compliance lineage gap.

1

Problem Name the pain — earn attention

$380B in new mandates. Governance pending.

The Alpha Data Platform is onboarding at record pace. Without real-time lineage, compliance teams can't trace what each mandate actually touches.

Headline: 6 words ✓ Body: 20 words ✓

2

Solution Make the fix feel within reach

Live lineage for every investment workflow.

Column-level lineage streams through Snowflake, Airflow, and dbt as each mandate's data moves — no gaps, no batch lag.

Headline: 6 words ✓ Body: 18 words ✓

3

Why DataHub Specific, credible reason to click

Investment SLAs can't run on batch metadata.

DataHub streams lineage as portfolios update. When custody teams trace a data issue, answers come in seconds — not after the nightly scan.

Headline: 7 words ✓ Body: 20 words ✓

OSS #3 Charles Schwab OSS Tier 1

Snowflake · BigQuery · Redshift · Pub/Sub · Airflow TD Ameritrade (Forge) integration ongoing Three-warehouse metadata silos

Account value prop — internal context, not ad copy

Schwab runs three separate data warehouses — Snowflake, BigQuery, and Redshift — plus Pub/Sub and Airflow for streaming and orchestration. The TD Ameritrade (Forge) integration adds new data flows across all three. No single catalog can see across all three warehouses simultaneously. Snowflake Horizon covers Snowflake. Collibra covers documentation. DataHub is the only platform that maps real-time lineage across all three warehouses at once — live, not scheduled.

⚡ Signal: Forge integration creates active multi-warehouse lineage gap that single-platform tools cannot bridge.

1

Problem Name the pain — earn attention

Three warehouses. Zero unified lineage view.

Snowflake, BigQuery, Redshift — each with its own metadata silo. When Forge data arrives, no one knows what feeds what.

Headline: 6 words ✓ Body: 19 words ✓

2

Solution Make the fix feel within reach

One lineage layer across all three warehouses.

DataHub maps column-level dependencies from Pub/Sub through Airflow to BigQuery, Snowflake, and Redshift — simultaneously, in real time.

Headline: 7 words ✓ Body: 17 words ✓

3

Why DataHub Specific, credible reason to click

Snowflake Horizon only sees one warehouse.

Your Forge integration spans all three. DataHub is the only catalog that crosses Snowflake, BigQuery, and Redshift at once — live.

Headline: 6 words ✓ Body: 18 words ✓

OSS #4 Intel OSS Tier 1

Airflow · Snowflake · Databricks · Kafka Platform efficiency mandate No unified metadata layer across four tools

Account value prop — internal context, not ad copy

Intel is running Airflow, Snowflake, Databricks, and Kafka simultaneously — each generating metadata that no team can trace across platforms. An active efficiency mandate requires consolidation. DataHub's OSS foundation ingests lineage from all four platforms simultaneously without custom scripts or manual documentation, and because it's open source, the metadata layer is owned by Intel — not locked to any vendor that might itself be consolidated.

⚡ Signal: Internal efficiency mandate creates urgency around eliminating duplicated tooling and manual metadata work.

1

Problem Name the pain — earn attention

Four platforms. One metadata consolidation mandate.

Airflow, Snowflake, Databricks, Kafka — each generating metadata no one can trace. Your efficiency mandate needs one answer.

Headline: 5 words ✓ Body: 18 words ✓

2

Solution Make the fix feel within reach

One real-time metadata layer across all four.

DataHub ingests lineage from every platform simultaneously — so data teams see dependencies without manual documentation or custom scripts.

Headline: 7 words ✓ Body: 18 words ✓

3

Why DataHub Specific, credible reason to click

Open source survives any restructuring.

Enterprise catalogs lock you in. DataHub's OSS foundation means your metadata stays yours — regardless of which platforms get consolidated.

Headline: 5 words ✓ Body: 18 words ✓

OSS #5 Nike OSS Tier 1 HOT 97 / 100

128 active devs Airflow · dbt · Databricks · Snowflake 60 champions · highest density "Win Now" turnaround $5B marketing analytics Entry: amit.ankalkoti@nike.com

Account value prop — internal context, not ad copy

Nike runs Airflow orchestration, Databricks Lakehouse, Snowflake analytics, and dbt transformations simultaneously — but without a unified metadata layer, lineage lives in four separate silos. When a pipeline breaks, engineers trace it manually across platforms. The "Win Now" turnaround and $5B marketing investment demand data that's reliable at speed. DataHub's native connectors for all four platforms — plus real-time streaming metadata — unify lineage without migration or retraining. The OSS pilot path (2–3 weeks) leverages 128 existing developer champions already evaluating the platform.

⚡ Urgency: 128 devs active May 6–8, 2026 · HOT evaluation in flight · Win Now mandate creates budget authority NOW · "Add to FY27 strategic accounts list immediately" flagged in the model

1

Problem Name the pain — earn attention

Four platforms. One metadata blind spot.

Airflow jobs, Databricks notebooks, Snowflake tables, dbt models — all generating siloed metadata. Your team debugs by hand.

Headline: 6 words ✓ Body: 17 words ✓

2

Solution Make the fix feel within reach

Live lineage as every DAG runs.

Column-level lineage streams from Airflow through Databricks to Snowflake in real time — no batch reconstruction, no manual tracing.

Headline: 6 words ✓ Body: 18 words ✓

3

Why DataHub Specific, credible reason to click

Because stale metadata breaks more than pipelines.

When a DAG fails and metadata is hours old, debugging takes days. DataHub updates in seconds — so your team finds the root cause, not the symptom.

Headline: 7 words ✓ Body: 20 words ✓

OSS #6 Adobe OSS · DEPLOY HOT · Active Opp 93 / 100

192 active devs Databricks · dbt · Snowflake · AI Tooling OSS telemetry confirmed Semrush acquisition $1.9B · Apr 2026 Entry: vchandra@adobe.com (active May 5)

Account value prop — internal context, not ad copy

Adobe runs DataHub OSS across a 90-team Databricks environment at DEPLOY stage — the upgrade conversation is already open. The Semrush acquisition (April 2026, $1.9B) adds 3,000+ new data sources that need immediate governance. DataHub Cloud closes the gap the OSS deployment leaves: managed ingestion, automated data contracts, proactive quality detection. CEO Narayen stepping down creates a decision window before new leadership freezes platform decisions. Motion: validate OSS deployment against Cloud value, not land-and-replace.

⚡ Urgency: Semrush closes Q3 2026 — 3,000+ sources need governance BEFORE integration. CEO transition = 60-day decision window. Murthy Chandrapaty active May 5, 2026.

1

Problem Name the pain — earn attention

Semrush added three thousand new data sources.

Your 90 Databricks teams already run fast. Now 3,000+ Semrush sources need the same lineage and governance — immediately.

Headline: 6 words ✓ Body: 18 words ✓

2

Solution Make the fix feel within reach

Governance that scales with every acquisition.

DataHub's streaming metadata ingests every new source as it onboards — column-level lineage attached from day one, not after migration settles.

Headline: 6 words ✓ Body: 19 words ✓

3

Why DataHub Specific, credible reason to click

You're already running DataHub. Cloud completes it.

192 of your engineers are on OSS. DataHub Cloud adds the automated governance and data contracts your 90-team mesh actually needs.

Headline: 6 words ✓ Body: 20 words ✓

Ad Copy Framework — FY27 Q1