What Is Data Privacy?
A customer files a data subject access request. The privacy team starts the clock. Thirty days. The legal team reviews the policy. The data team starts pulling records. The CRM exports cleanly. The product database exports cleanly. The marketing platform exports cleanly.
On day twenty-eight, a data scientist mentions that a copy of the customer's records is in a feature store nobody on the privacy team knew existed.
I have seen this on the security side, in rbac-audit-first walks through a Kubernetes platform. The audit shows the controls. The controls are present. The exposure is in a service account that was created six months ago by a team that did not register it with the platform team, running in a namespace the audit was not configured to look at. The control library is good; the inventory is wrong.
Data privacy programs fail this same way. The policy is well written. The DSAR runbook is well documented. The data inventory is wrong, because the inventory cannot keep up with the rate at which copies of the data are being made by teams that do not consider themselves data teams.
Step One — The Wrong Assumption
"We have a privacy program. We have a DSAR runbook. We are compliant."
"We have policies. We have controls. We have a data inventory. The DSAR will be straightforward." — Privacy review, every organization, the day before the first real DSAR
The privacy program is not the problem. The policy is reasonable, the runbook covers the documented systems, the controls map cleanly to GDPR Articles or CCPA Sections. None of this is theater. The program does what it says it does for the systems it knows about.
The systems it knows about are the systems the privacy team was told about during the inventory exercise three years ago. Since then, the company has shipped two ML platforms, three new analytics environments, an AI feature store, four data lakes, and a vector database that nobody calls a database. Each of these contains copies of customer data. None of them are on the inventory. None of them appear in the DSAR runbook. The first DSAR that touches any of them will surface the gap.
Step Two — The Partial Signal
Three of four systems respond to the DSAR. The fourth was never inventoried.
The runbook lists the systems to query for a DSAR: CRM, transactional databases, marketing platforms, support ticketing. These are the systems on the inventory. They respond. They produce records. The team meets the documented procedure.
What the documented procedure does not cover is the half-dozen places that customer data was copied into for legitimate operational reasons during the last two years — a feature store maintained by the ML team for fraud detection; an embedding index maintained by the AI team for personalization; a sandbox maintained by analytics for ad-hoc queries; a snapshot in a cold-storage bucket that was supposed to be deleted but is not. None of these are documented. None of them are on the runbook. The DSAR meets the procedure and misses the actual exposure.
This is the partial signal in privacy programs: the controls list goes green, the audit passes, and the substantive reach of the DSAR was always going to be limited by the quality of the inventory, which was never as complete as the program assumed it was.
Step Three — The Failed Fix
You commission an inventory refresh. Six months later it is already stale.
The team responds correctly. They commission a fresh inventory exercise: every system, every database, every ML platform, every analytics environment, every cold-storage bucket. They produce a current map. They update the runbook. They train the team. Six months pass. The map is already wrong.
The reason it is wrong is structural. New copies of customer data get created continuously, by teams whose primary job is not data privacy, in tooling that was procured without a privacy review. Each individual creation is reasonable in context. The aggregate is a privacy posture that decays at the speed the business adopts new tooling, which is faster than any inventory refresh cycle can catch up.
The team can refresh the inventory annually. The exposure is daily. The fix did not fix anything; it produced a snapshot of a position that the organization had moved past before the snapshot was reviewed.
Fig. 1 — The runbook ran perfectly. The inventory it ran against was the trailing indicator.
Step Four — The Real Failure
It was never a documentation gap. It was that nobody owned the inventory as a continuous job.
The actual failure is in the absence of a control function whose job is to know, continuously, every place customer data lives. Most privacy programs assume that responsibility belongs to the inventory exercise. The exercise produces a deliverable; the deliverable goes stale; the responsibility was never operational, so nothing in the organization fights the staleness.
What is missing is a control that fires when a new system is provisioned, when a new data flow is established, when a copy is made for a new use case — a control that registers the system, classifies the data, and binds it to the privacy posture before the copy is in production. Without that control, the inventory will always be the trailing indicator. The DSAR will always find the gap that was opened after the last review.
This is the same lesson on a different stack. The control library is the precondition. The operating discipline that keeps the controls accurate is the actual product. Programs that ship the first and skip the second produce a privacy posture that looks correct in the deck and fails on the day someone files a request.
Step Five — The Definition
Now the definition lands.
Data privacy is the continuous ability to know, control, and account for every use of personal information in the organization — not a list of policies or a set of one-time controls, but an operational discipline that produces an accurate inventory at the speed the business creates new copies.
Most definitions describe data privacy through its rights: the right to access, the right to be forgotten, the right to data portability. These are real and they are codified in GDPR, CCPA, LGPD, and twenty other regimes. The rights describe what the program owes the customer. The program's ability to honor them depends on something the rights themselves do not specify, which is the program's accuracy about where the customer's data actually is.
The rights are the front door. The inventory is the foundation. Programs that polish the front door without working the foundation fail in predictable, embarrassing, and expensive ways.
What Solix Enforces
The control fires when the copy is made, not when the request is filed.
What Solix's data privacy and governance layer enforces is the registration of new data flows at the boundary they cross — whether that boundary is the move from production to a feature store, the embedding of records into a vector index, the snapshot into a sandbox, or the export to a third party. The classification, the lineage, the access policy, and the retention rule are bound to the data at the moment of the copy, not retroactively reconstructed during the next inventory refresh.
For SAP ECC and Oracle E-Business Suite decommissioning, for legacy application retirement, and for AI training pipelines that consume historical records, the same model applies: the data leaves under policy, or it does not leave. The DSAR has somewhere to look because the privacy posture was a property of the boundary, not a property of a deck.
Three things to do this week
- Run a DSAR drill on a real (consenting) employee identity. Pick someone willing. Issue the DSAR internally. Watch where the request can and cannot reach. The systems that surprise you are the ones that will surprise you on the day a customer's request lands. Do this drill quarterly, not annually.
- Map the gap between your privacy inventory and your asset register. If the security team has an asset register that lists more systems than your privacy inventory, the difference is your privacy gap. The exercise of producing the diff is uncomfortable. The diff is the point.
- Establish the control that fires when a new system is provisioned. Whether through change management, terraform policy, or a procurement gate, build a path where a new system that will hold customer data cannot be provisioned without registering with the privacy program. Without this control, every other control trails the business indefinitely.
References
- Forrester Research — The Forrester Wave™: Privacy Management Software, Q4 2025. Report ID RES188585
- Forrester Research — The Privacy Management Software Landscape, Q2 2025. Report ID RES184404
- Gartner Peer Insights, market category — Digital Communications Governance and Archiving Solutions. Reviewed 2026
About the author
Barry Kunst is VP of Marketing at Solix Technologies. He writes about enterprise data lifecycle, application retirement, and modernization in systems that have outlived their original mandate. Earlier in his career he supported IBM zSeries ecosystems for CA Technologies' multi-billion-dollar mainframe business, with first-hand exposure to lifecycle risk at scale.
- Solix Leadership
- Forbes Technology Council
- MIT
Find him at:
What you can do with Solix
Enter to win a $100 Amex Gift Card
Related Resources
Explore related resources to gain deeper insights, helpful guides, and expert tips for your ongoing success.
-
-
On-Demand WebinarThe Power of Less: How Data Minimization Drives Data Privacy Compliance
Watch On-Demand Webinar -
Why SOLIXCloud
SOLIXCloud offers scalable, secure, and compliant cloud archiving that optimizes costs, boosts performance, and ensures data governance.
-
Common Data Platform
Unified archive for structured, unstructured and semi-structured data.
-
Reduce Risk
Policy driven archiving and data retention
-
Continuous Support
Solix offers world-class support from experts 24/7 to meet your data management needs.
-
On-demand AI
Elastic offering to scale storage and support with your project
-
Fully Managed
Software as-a-service offering
-
Secure & Compliant
Comprehensive Data Governance
-
Free to Start
Pay-as-you-go monthly subscription so you only purchase what you need.
-
End-User Friendly
End-user data access with flexibility for format options.
