Back to Case Studies
Automotive / Retail

One Number the Whole Group Trusts: Unifying CDK, Reynolds, and Tekion into a Single Source of Truth

June 1, 2026
10 min read

Grow a dealer group by acquisition and you inherit its data the hard way: one rooftop on CDK, another on Reynolds, the newest on Tekion. Three systems, three definitions of "customer," three versions of the truth, and no number leadership fully trusts.

Multi-DMSCDKReynoldsTekionDatabricksdbtIdentity ResolutionLakehouseData EngineeringAutomotive

The Problem: A Dealer Group with Three Versions of the Truth

Grow a dealer group by acquisition and you inherit its data the hard way. One rooftop runs CDK. Another runs Reynolds and Reynolds. The newest acquisition runs Tekion. Three systems, three definitions of "customer," three versions of the truth, and a leadership team that cannot get one trustworthy number across the group without a person and a spreadsheet in the middle.

The visible cost is the analyst time. Month-end becomes three to five days of exporting, stitching, and reconciling, and the GM still does not fully trust the figure that lands in the meeting. The invisible cost is larger: the same household gets mailed across rooftops because no system knows they are the same person, redundant reporting tools are paid for store by store, and decisions on inventory, ad spend, and staffing get made on data that is days stale.

This is a reference architecture, built and validated on AutoDataLab: DataStackX's synthetic dealer-group dataset of 40 rooftops, ~250,000 customers, and live CDK, Reynolds, and Tekion feeds with their real-world quirks preserved. The architecture, pipelines, and data contracts are production-grade and real. The dealer group and the dollar figures are modeled on industry benchmarks and our team's 20+ years of combined dealer-data experience, not a specific client. We scope your group's actual numbers in the first working session.

Why It Is Genuinely Hard: Three DMS Realities

This is not a "connect the APIs" problem. Each DMS models the dealership differently, down to how it decides a customer exists. A credible unification absorbs those quirks without losing data, and that is exactly where generic data teams stall and where dealer-native experience earns its keep.

DMSLineageThe quirks that break naive integrations
CDK GlobalMainframeALL-CAPS tables, codes instead of values, dates stored as YYYYMMDD numbers, and a customer who "does not exist" until they buy (service-only customers live in a separate table)
Reynolds & Reynolds90s relationalOne invoice table for both sales and service, customer IDs that collide across rooftops (must dedupe with dealer ID), and inconsistent NULL versus "N/A" handling
TekionCloud-nativeUUID keys and ISO timestamps (modern), but JSONB blobs that hide real complexity and soft-delete flags downstream logic must respect

A "customer count" that naively unions these three systems can be off by 20 to 30%: it double-counts the household that bought at one rooftop and services at another, and drops the service-only customers CDK never promoted to the master file.

What We Built: A Governed Foundation, Not Another Dashboard

Every DMS feeds one lakehouse through a source-adapter layer. Data flows through three governed tiers (raw, conformed, and analytics-ready), with identity resolution stitching customers into one identity across systems and households. KPIs are defined once, so "gross," "unit," and "service retention" mean the same thing in every rooftop and every report.

Data contracts and automated tests guard every layer, so a drifting feed or a broken number is caught before it reaches a meeting. The same codebase runs on a laptop and in the cloud; the difference is configuration, not a rewrite, which keeps the foundation ready for enterprise procurement and for every analytics use case that comes after it.

From "Three Days and a Debate" to One Trusted Number

What changesBefore: fragmented DMSAfter: single source of truth
Month-end group reporting3 to 5 days of manual stitchingAutomated overnight refresh
"How many unique customers?"Days of work, still wrongInstant, deduped, household-aware
Service retention across rooftopsNot measurableTracked daily, group-wide
Trust in the numberDebated in every meetingOne agreed definition
A new report request1 to 2 weeks of analyst workSelf-serve in hours
Adding a rooftop or DMSMonths of reworkPlug in one adapter

The headline operational result is roughly a 70% reduction in reporting cycle time and manual effort, with one governed definition of every core KPI and a clean base for everything downstream.

The ROI, in Dollars

Modeled for a representative 10-rooftop group (around 13,000 vehicles retailed and 150,000 repair orders per year, around $9 to $10M annual ad spend). Every figure is a hard, countable saving: a real line item a controller can tie to a payroll cost, a software contract, or a marketing invoice. No projected revenue, no "better decisions," no soft value.

Value driverHow it is recoveredConservativeUpside
Reporting labor redeployedAbout 1 to 1.5 FTE of controller and analyst time freed from manual stitching and reconciliation$120K$180K
Redundant tooling rationalizedPer-rooftop reporting add-ons, DMS report modules, and overlapping dashboards consolidated$40K$90K
Duplicate-marketing waste eliminatedIdentity resolution removes the 15 to 30% of mail and CRM spend hitting the same household across rooftops$50K$120K
Total hard annual savings~$210K~$390K

We deliberately excluded the larger "better decisions" upside (smarter inventory, pricing, and ad-spend calls), even though it is real. That value is earned by the analytics this foundation unlocks and is scoped as separate engagements. Keeping it out means the $210K to $390K is a floor you can defend in a board meeting, not a forecast you have to believe.

How Fast: An AI-Accelerated Build

The build runs in weeks, not quarters, because we do not start from a blank repo. AutoDataLab gives us proven CDK, Reynolds, and Tekion adapter patterns, an identity-resolution and KPI framework, and AI-assisted delivery that scaffolds the pipeline, generates the dbt models and tests, and proposes the source mappings.

PhaseDurationWhat lands
Discovery and accessAbout 1 weekDMS landscape mapped, KPI definitions agreed, data access secured, ROI numbers put against your group
AI-accelerated buildAbout 4 weeksAdapters, the three governed tiers, identity resolution, data contracts, and your core trusted KPIs, live in production on your cloud
Harden and expandOngoingYour team owns it with documentation and runbooks; we harden edge cases, add deeper domains, and sequence the first analytics win

The four-week build targets your core sales, service, and customer KPIs and assumes data access is in place; deeper domains follow in the expand phase.

The Real Prize: Everything It Unlocks

A trusted foundation is what makes the high-value work possible. Lead scoring, co-op fund recovery, F&I product propensity, service-retention modeling, used-car pricing: each is a seven-figure opportunity, and each is dead on arrival without clean, unified, governed data underneath it. The foundation is the land; the analytics roadmap is the expand.

What This Demonstrates

This reference build shows the full data-foundation stack for a multi-rooftop dealer group: source adapters that absorb each DMS's quirks, identity resolution and household stitching, a Kimball model with data contracts and automated tests, and a single governed KPI layer, all production-grade and cloud-switchable.

It is grounded in 20+ years of combined experience building these systems for the US automotive industry, including platforms that ran reporting and analytics across 4,000+ dealerships. The same pattern applies to any multi-system business carrying fragmented customer and operational data: retail groups, healthcare networks, and financial services.

Not Sure Where to Start?

Book a free 30-minute strategy session with a senior data architect — no pitch, no obligation.

Schedule Your Free Strategy Session

Not Sure Where to Start? Start Here.

We offer a free 30-minute strategy session with a senior data or AI architect — not a sales rep. Bring your current challenge, your stack, or just a vague sense that your data situation needs to improve. We'll give you an honest assessment of where to begin.

No pitch. No obligation. Just a useful conversation.

Typically responds within 1 business day · Available for India, US, UK & Canada