Data Governance Foundations
Expert-defined terms from the Professional Certificate in Dama International Data Governance course at London School of Planning and Management. Free to read, free to share, paired with a professional course.
Accountability – The obligation of individuals or groups to explain and j… #
Related terms: Responsibility, Stewardship. Example: A data stewards reports quarterly on data quality metrics to senior management. Challenge: Ensuring clear lines of accountability across decentralized business units.
Actionable Insight – Information derived from data analysis that can be d… #
Related terms: Analytics, Decision Support. Example: Sales forecast indicating a 10% increase in demand for a product line, prompting inventory adjustments. Challenge: Translating raw data into insights that are timely and relevant to stakeholders.
Adverse Impact Assessment – A systematic evaluation of potential negative… #
Related terms: Bias Mitigation, Fairness. Example: Testing a credit scoring model for disparate impact on minority applicants. Challenge: Obtaining sufficient demographic data while respecting privacy regulations.
Aggregation – The process of summarizing detailed data into higher‑level… #
Related terms: Roll‑up, Summarization. Example: Summing daily sales transactions to produce monthly revenue figures. Challenge: Maintaining data lineage so users understand how aggregates were derived.
Algorithmic Transparency – The practice of documenting and communicating… #
Related terms: Explainability, Model Governance. Example: Publishing a model card that describes input variables, weighting, and expected performance. Challenge: Balancing intellectual property protection with the need for stakeholder trust.
Archiving – The long‑term storage of data that is no longer actively used… #
Related terms: Retention, Preservation. Example: Moving legacy financial records to a secure, immutable storage tier after seven years. Challenge: Ensuring archived data remains accessible and readable as technology evolves.
Asset Register – An inventory that lists all data assets, their owners, c… #
Related terms: Data Catalog, Metadata Repository. Example: A spreadsheet that records each database, its purpose, and the appointed data owner. Challenge: Keeping the register up to date in fast‑changing environments.
Authority Matrix – A governance tool that defines who has decision‑making… #
Related terms: RACI, Governance Framework. Example: A matrix showing that the Chief Data Officer approves data‑sharing agreements, while line managers approve data quality rules. Challenge: Preventing overlap or gaps that could lead to conflicting decisions.
Audit Trail – A chronological record of all actions performed on data, in… #
Related terms: Logging, Provenance. Example: System logs that capture who updated a customer address and when. Challenge: Storing audit logs securely while ensuring they are tamper‑proof and searchable.
Authority – The formal power granted to an individual or body to define,… #
Related terms: Governance, Mandate. Example: A data governance council empowered to approve data‑access requests. Challenge: Aligning authority with organizational culture to avoid resistance.
Baseline Data Quality – The initial measurement of data quality dimension… #
) Against which future improvements are tracked. Related terms: Data Profiling, Quality Metrics. Example: Reporting that 85% of customer records contain a valid email address at project start. Challenge: Establishing a realistic baseline without exhaustive profiling.
Business Glossary – A curated list of business terms, definitions, and sy… #
Related terms: Data Dictionary, Semantic Layer. Example: Defining “Customer Lifetime Value” with a precise formula and agreed‑upon calculation method. Challenge: Achieving consensus among diverse business units.
Business Rules – Formal statements that dictate how data must be captured… #
Related terms: Validation Rules, Policy. Example: “All sales orders must have a shipping date that is later than the order date.” Challenge: Managing rule changes without disrupting downstream systems.
Change Management – The structured approach to transitioning individuals,… #
Related terms: Adoption, Stakeholder Engagement. Example: Conducting workshops to train data stewards on new metadata standards. Challenge: Overcoming entrenched habits and legacy practices.
Classification – The act of assigning data to categories based on sensiti… #
Related terms: Sensitivity Labels, Data Tiering. Example: Tagging personal health information as “Highly Sensitive” to trigger encryption controls. Challenge: Automating classification at scale while minimizing false positives.
Compliance – Adherence to external laws, regulations, and internal polici… #
Related terms: Regulatory Requirements, Audit. Example: Ensuring GDPR‑required consent records are retained for the statutory period. Challenge: Keeping up with evolving legislation across multiple jurisdictions.
Consent Management – The processes and technologies used to capture, stor… #
Related terms: Privacy, Opt‑in/Opt‑out. Example: A web portal allowing users to withdraw consent for marketing emails. Challenge: Synchronizing consent status across legacy systems.
Data Architecture – The high‑level design of data structures, flows, and… #
Related terms: Enterprise Architecture, Data Modeling. Example: A layered architecture separating raw, curated, and presentation data zones. Challenge: Aligning architecture with rapid cloud adoption and hybrid environments.
Data Asset – Any collection of data that has value to the organization, s… #
Related terms: Data Inventory, Information Resource. Example: The customer master file used by sales, marketing, and support. Challenge: Identifying hidden or shadow assets that reside outside official inventories.
Data Catalog – A searchable repository that provides metadata, lineage, a… #
Related terms: Metadata Management, Asset Register. Example: A web portal where analysts locate the “Sales_2023” table and view its schema and owners. Challenge: Maintaining accurate metadata as pipelines evolve.
Data Classification Policy – A documented set of rules that dictate how d… #
Related terms: Classification, Security Controls. Example: Policy stating that “Confidential” data must be encrypted at rest and in transit. Challenge: Enforcing policy consistently across cloud and on‑premise resources.
Data Consumer – An individual or system that accesses, analyzes, or utili… #
Related terms: Data Producer, Stakeholder. Example: A business analyst extracting sales trends for a quarterly report. Challenge: Providing appropriate access while preserving data security.
Data Custodian – The technical owner responsible for the safe storage, tr… #
Related terms: Data Steward, IT Operations. Example: Database administrators who manage backups, patching, and performance tuning. Challenge: Balancing operational efficiency with governance controls.
Data Governance Council – A cross‑functional body that sets strategic dir… #
Related terms: Steering Committee, Authority Matrix. Example: Monthly meetings where the CDO, legal counsel, and business unit leaders review data‑risk assessments. Challenge: Ensuring the council has sufficient authority and representation to act decisively.
Data Governance Framework – The overall structure that defines roles, pro… #
Related terms: Governance Model, Control Environment. Example: A framework that integrates data quality, privacy, and security into a unified approach. Challenge: Adapting the framework to different business domains without creating silos.
Data Governance Maturity Model – A roadmap that assesses an organization’… #
Related terms: Capability Assessment, Continuous Improvement. Example: Moving from “Ad‑hoc” to “Defined” stage by establishing a formal data stewardship program. Challenge: Selecting metrics that accurately reflect progress.
Data Governance Policy – A high‑level document that articulates the organ… #
Related terms: Policy Statement, Standard. Example: A policy mandating that all personal data be stored in jurisdictions with adequate protection. Challenge: Translating policy language into actionable procedures.
Data Governance Program – The collection of initiatives, activities, and… #
Related terms: Program Management, Strategic Plan. Example: Launching a data‑quality improvement campaign alongside a privacy compliance audit. Challenge: Coordinating efforts across multiple departments and timelines.
Data Governance Roles – Defined responsibilities such as Data Owner, Data… #
Related terms: RACI, Role‑Based Access. Example: Assigning the Marketing Director as Data Owner for the “Campaign_Leads” dataset. Challenge: Avoiding role ambiguity that leads to gaps in accountability.
Data Governance Strategy – The long‑term plan that aligns data initiative… #
Related terms: Roadmap, Vision. Example: A strategy to achieve “Zero Data Breaches” by 2028 through mature governance. Challenge: Securing executive sponsorship and budget for sustained effort.
Data Integration – The process of combining data from disparate sources i… #
Related terms: ETL, Data Federation. Example: Merging CRM and ERP data to create a single customer 360 profile. Challenge: Reconciling inconsistent data models and handling latency.
Data Lineage – The visual or documented trace of data’s origin, transform… #
Related terms: Provenance, Traceability. Example: A lineage diagram showing that the “Revenue” metric originates from the “Sales_Transactions” table, is aggregated, and fed into the “Executive Dashboard”. Challenge: Capturing lineage in real time for dynamic pipelines.
Data Lifecycle – The series of stages that data passes through, from crea… #
Related terms: Retention, Disposition. Example: Data is captured during transaction processing, used for reporting, then archived after five years. Challenge: Enforcing lifecycle policies consistently across cloud services.
Data Literacy – The ability of individuals to read, work with, and commun… #
Related terms: Training, Skill Development. Example: A workshop teaching business users how to interpret data visualizations. Challenge: Scaling literacy programs to reach all employee levels.
Data Management – The set of disciplines that ensure data is accurate, av… #
Related terms: Data Governance, Data Operations. Example: Implementing master data management to maintain a single source of truth for products. Challenge: Integrating management practices across legacy and modern platforms.
Data Owner – The business individual accountable for the quality, securit… #
Related terms: Data Steward, Accountability. Example: The Finance VP owns the “General Ledger” dataset and approves access requests. Challenge: Aligning ownership with incentives and performance metrics.
Data Profiling – The systematic analysis of data to assess its structure,… #
Related terms: Data Quality Assessment, Discovery. Example: Scanning a customer table to identify missing phone numbers and duplicate records. Challenge: Performing profiling on large, streaming datasets without impacting performance.
Data Quality – The degree to which data is fit for its intended purpose,… #
Related terms: Data Cleansing, Metrics. Example: An accuracy rate of 98% for product SKUs after a cleansing run. Challenge: Maintaining quality in rapidly changing environments.
Data Quality Dashboard – A visual interface that displays key data‑qualit… #
Related terms: Scorecard, Monitoring. Example: A dashboard showing monthly completeness percentages for critical master data tables. Challenge: Selecting meaningful KPIs that drive corrective action.
Data Quality Rules – Prescriptive statements that define acceptable data… #
Related terms: Validation Rules, Business Rules. Example: “Postal code must be 5 digits for US addresses.” Challenge: Managing rule proliferation and ensuring they remain up‑to‑date.
Data Quality Standards – Formalized benchmarks that specify the minimum a… #
Related terms: Policy, Target. Example: Setting a 99% completeness target for key customer attributes. Challenge: Balancing ambitious standards with realistic operational capabilities.
Data Retention – The policy‑driven timeframe for which data must be kept… #
Related terms: Disposition, Compliance. Example: Retaining financial transaction logs for seven years to satisfy audit requirements. Challenge: Automating deletion while preserving evidence for potential investigations.
Data Security – The set of controls and practices designed to protect dat… #
Related terms: Encryption, Access Control. Example: Implementing role‑based access to restrict sensitive HR data to HR personnel only. Challenge: Ensuring security measures do not hinder legitimate data use.
Data Steward – The domain‑level manager responsible for defining data def… #
Related terms: Data Owner, Data Custodian. Example: A data steward for “Product Information” maintains the product master and resolves data‑issue tickets. Challenge: Providing stewards with sufficient authority and resources.
Data Stewardship – The ongoing activities performed by data stewards to o… #
Related terms: Governance, Operational Management. Example: Conducting monthly data‑issue reviews and updating the business glossary. Challenge: Embedding stewardship into daily workflows rather than treating it as a side project.
Data Subject – An individual whose personal data is processed under priva… #
Related terms: Personal Data, Consent. Example: A customer whose email address is stored for marketing communications. Challenge: Providing mechanisms for data subjects to exercise their rights (access, erasure, portability).
Data Synchronization – The process of ensuring that data copies across mu… #
Related terms: Replication, Integration. Example: Updating the CRM system whenever a new order is entered in the ERP. Challenge: Handling conflicts and latency in distributed environments.
Data Taxonomy – A hierarchical classification scheme that organizes data… #
Related terms: Ontology, Classification. Example: A taxonomy that groups “Financial Data” → “Revenue” → “Recurring Revenue”. Challenge: Keeping the taxonomy aligned with evolving business terminology.
Data Transparency – The principle that data handling practices, provenanc… #
Related terms: Algorithmic Transparency, Trust. Example: Publishing a data‑usage statement that explains how customer data supports personalization. Challenge: Balancing transparency with confidentiality of proprietary processes.
Data Validation – The act of checking data against defined rules to ensur… #
Related terms: Data Quality Rules, Input Controls. Example: Rejecting a record where the “Date of Birth” field contains a future date. Challenge: Implementing validation at scale without creating bottlenecks.
Data Visualization – The graphical representation of data to aid comprehe… #
Related terms: Dashboard, Reporting. Example: A bar chart showing quarterly sales growth by region. Challenge: Preventing misinterpretation through poor chart selection or lack of context.
Data Warehouse – A centralized repository optimized for analytical queryi… #
Related terms: OLAP, ETL. Example: A star schema containing fact tables for sales and dimension tables for products and time. Challenge: Managing schema evolution while preserving backward compatibility.
Data‑as‑a‑Service (DaaS) – A delivery model where data is provided on dem… #
Related terms: API, Cloud Data. Example: A market data provider offering real‑time pricing feeds through a RESTful API. Challenge: Ensuring consistent quality and latency guarantees across service tiers.
Decision Rights – The authority granted to individuals or groups to make… #
Related terms: Authority Matrix, Governance. Example: The Data Protection Officer holds the decision right to approve data‑processing agreements. Challenge: Documenting rights clearly to avoid decision paralysis.
De‑identification – The process of removing or obscuring personal identif… #
Related terms: Anonymization, Pseudonymization. Example: Replacing Social Security Numbers with randomly generated tokens. Challenge: Ensuring de‑identified data cannot be re‑identified through linkage attacks.
Disposal – The secure destruction or irreversible deletion of data that i… #
Related terms: Data Erasure, Retention. Example: Using cryptographic shredding to destroy backup tapes after the retention period expires. Challenge: Verifying complete removal across all storage media, including cloud snapshots.
Domain‑Driven Data Governance – An approach that assigns governance respo… #
Related terms: Data Stewardship, Federated Model. Example: Each product line owns its master data, while a central council sets enterprise‑wide policies. Challenge: Coordinating cross‑domain standards without creating bottlenecks.
Enterprise Data Model – A comprehensive representation of the organizatio… #
Related terms: Data Architecture, Logical Model. Example: A model that maps “Customer”, “Order”, and “Product” entities with their cardinalities. Challenge: Keeping the model current as new applications are introduced.
Entity‑Relationship Diagram (ERD) – A visual tool that depicts data entit… #
Related terms: Data Modeling, Diagram. Example: An ERD showing a one‑to‑many relationship between “Customer” and “Invoice”. Challenge: Translating complex ERDs into physical database designs without loss of intent.
Ethical Data Use – The practice of handling data in ways that respect mor… #
Related terms: Privacy, Fairness. Example: Using anonymized location data for traffic planning rather than targeted advertising. Challenge: Defining ethical boundaries in emerging technologies like AI.
Executive Sponsorship – The active support and advocacy of senior leaders… #
Related terms: Stakeholder Engagement, Governance Council. Example: The CEO champions a data‑quality improvement program and allocates budget. Challenge: Maintaining sponsor interest over long‑term projects.
External Data Source – Data that originates outside the organization, suc… #
Related terms: Data Integration, Third‑Party Data. Example: Purchasing demographic data from a census bureau to enrich customer profiles. Challenge: Verifying data provenance and ensuring compliance with licensing terms.
Federated Governance – A distributed model where individual business unit… #
Related terms: Domain‑Driven Governance, Authority Matrix. Example: Regional offices manage local data catalogs, but must follow corporate data‑classification standards. Challenge: Preventing policy drift and ensuring consistent enforcement.
File‑Based Data – Data stored in flat files such as CSV, JSON, XML, or Ex… #
Related terms: Structured Data, Data Ingestion. Example: Importing a CSV of vendor contacts into the CRM system. Challenge: Managing schema changes and ensuring data quality without a formal database structure.
GDPR (General Data Protection Regulation) – The EU regulation that sets s… #
Related terms: Privacy, Data Subject Rights. Example: Implementing a mechanism to delete a user’s data upon request within 30 days. Challenge: Aligning GDPR obligations with existing legacy systems.
Governance Framework – The collection of policies, standards, processes,… #
Related terms: Data Governance Framework, Control Environment. Example: A framework that defines data‑owner responsibilities, quality metrics, and audit procedures. Challenge: Ensuring the framework is flexible enough to accommodate new data types.
Information Governance – The broader discipline that encompasses data gov… #
Related terms: Data Governance, Records Management. Example: Coordinating policies for both electronic documents and email archives. Challenge: Integrating disparate governance practices into a unified approach.
Ingestion Pipeline – The automated workflow that captures, transforms, an… #
Related terms: ETL, Data Integration. Example: A streaming pipeline that reads sensor data, validates format, and writes to a time‑series database. Challenge: Ensuring reliability and handling schema evolution without downtime.
Integrity – The assurance that data remains accurate, consistent, and una… #
Related terms: Data Quality, Security. Example: Using checksums to detect corruption in transferred files. Challenge: Maintaining integrity when data traverses multiple heterogeneous systems.
International Data Transfer – The movement of data across national border… #
Related terms: Cross‑Border Compliance, Regulation. Example: Transferring EU customer data to a US‑based analytics platform under SCCs. Challenge: Monitoring regulatory changes that affect transfer mechanisms.
Metadata – Data that describes other data, providing context such as orig… #
Related terms: Data Catalog, Business Glossary. Example: Metadata fields that capture the creator, creation date, and sensitivity label of a dataset. Challenge: Capturing metadata automatically for unstructured data sources.
Metadata Management – The processes and tools used to create, store, main… #
Related terms: Metadata, Data Catalog. Example: Using a metadata repository to enforce naming conventions and lineage capture. Challenge: Aligning metadata standards across multiple technology stacks.
Master Data Management (MDM) – A set of practices and technologies that c… #
Related terms: Data Integration, Data Quality. Example: Consolidating duplicate customer records into a unified master profile. Challenge: Resolving conflicts and aligning governance across source systems.
Non‑Repudiation – A security property that ensures an action or transacti… #
Related terms: Integrity, Audit Trail. Example: A signed API request that proves the originator cannot later dispute having sent the data. Challenge: Implementing non‑repudiation in high‑throughput environments without excessive overhead.
Open Data – Data that is made publicly available without restrictions on… #
Related terms: Data Sharing, Transparency. Example: Publishing transportation statistics as CSV files on a government portal. Challenge: Balancing openness with privacy and intellectual‑property concerns.
Operational Data Store (ODS) – A database designed to integrate data from… #
Related terms: Data Warehouse, ETL. Example: An ODS that consolidates daily sales feeds for real‑time dashboarding. Challenge: Keeping the ODS synchronized with source systems while minimizing latency.
Owner‑Operator Model – A governance arrangement where the data owner defi… #
Example: The Marketing Director (owner) sets data‑retention rules, while the IT team (operator) configures automated deletion. Challenge: Ensuring clear communication to avoid misaligned expectations.
Privacy Impact Assessment (PIA) – A systematic review that evaluates how… #
Related terms: Risk Assessment, GDPR. Example: Conducting a PIA before launching a new mobile app that collects location data. Challenge: Integrating PIAs into agile development cycles without causing delays.
Procedural Governance – The aspect of governance that focuses on defined… #
Related terms: Process Management, Policy. Example: A documented workflow for approving data‑sharing agreements. Challenge: Keeping procedures up‑to‑date as technology and regulations evolve.
Quality Scorecard – A reporting tool that aggregates multiple data‑qualit… #
Related terms: Data Quality Dashboard, KPI. Example: A scorecard showing completeness, validity, and timeliness percentages for core master data. Challenge: Selecting balanced metrics that reflect both technical and business perspectives.
Regulatory Compliance – The state of adhering to laws, regulations, stand… #
Related terms: Compliance, Audit. Example: Demonstrating HIPAA compliance for protected health information storage. Challenge: Managing overlapping and sometimes conflicting regulatory regimes.
Risk Management – The systematic identification, assessment, and mitigati… #
Related terms: Risk Assessment, Controls. Example: Conducting a risk assessment to evaluate the likelihood of a data breach in a cloud environment. Challenge: Quantifying intangible risks such as reputational damage.
Role‑Based Access Control (RBAC) – An access‑management method that assig… #
Related terms: Access Control, Authorization. Example: Granting the “Analyst” role read‑only access to sales data while restricting write privileges. Challenge: Designing role hierarchies that reflect real‑world responsibilities without over‑provisioning.
Scalable Governance – Governance practices and technologies that can expa… #
Related terms: Automation, Federated Model. Example: Using AI‑driven classification to automatically label millions of documents. Challenge: Maintaining governance quality as automation introduces new points of failure.
Security Classification – The labeling of data based on its sensitivity a… #
Related terms: Classification, Access Control. Example: Marking internal financial forecasts as “Confidential” to enforce encryption and restricted access. Challenge: Consistently applying classification across heterogeneous data stores.
Service Level Agreement (SLA) – A contract that defines the expected perf… #
Related terms: Contract, Performance Metrics. Example: An SLA guaranteeing 99.9% Uptime for a data‑lake API. Challenge: Aligning SLAs with realistic operational capabilities and governance requirements.
Single Source of Truth (SSOT) – The concept that a particular data asset… #
Related terms: Master Data, Data Integration. Example: Using a unified product master to drive pricing, inventory, and e‑commerce sites. Challenge: Gaining consensus on which system should be the SSOT and maintaining its integrity.
Stakeholder Engagement – The process of involving relevant parties #
business users, IT, legal, compliance—in governance activities to secure buy‑in and collaboration. Related terms: Executive Sponsorship, Communication. Example: Holding quarterly workshops with data owners to review policy changes. Challenge: Addressing competing priorities and resource constraints.
Standard Operating Procedure (SOP) – A documented, step‑by‑step instructi… #
Related terms: Procedural Governance, Documentation. Example: An SOP for handling data‑subject access requests, from receipt to response. Challenge: Keeping SOPs current in fast‑changing regulatory environments.
Strategic Data Management – The alignment of data initiatives with long‑t… #
Related terms: Data Strategy, Roadmap. Example: Prioritizing data‑quality projects that enable predictive analytics for market expansion. Challenge: Demonstrating ROI to secure executive support.
Subject‑Matter Expert (SME) – An individual with deep knowledge of a spec… #
Related terms: Data Steward, Domain Expert. Example: A product manager who clarifies the meaning of “SKU” for data modeling. Challenge: Allocating SME time without disrupting core responsibilities.
Surrogate Key – An artificial identifier, often autogenerated, used as a… #
Related terms: Primary Key, Data Modeling. Example: Assigning an integer ID to each customer record regardless of external customer numbers. Challenge: Managing key generation in distributed environments to avoid collisions.
Tagging – The practice of attaching descriptive labels or metadata to dat… #
Related terms: Metadata, Classification. Example: Applying a “PII” tag to columns containing personal identifiers. Challenge: Ensuring tags are applied consistently and updated as data evolves.
Technical Debt – The accumulated cost of shortcuts, outdated architecture… #
Related terms: Legacy Systems, Refactoring. Example: Maintaining hand‑coded data‑validation scripts that lack version control. Challenge: Prioritizing debt remediation while delivering new functionality.
Third‑Party Risk Management – The assessment and mitigation of risks asso… #
Related terms: Vendor Management, Compliance. Example: Conducting security questionnaires for a cloud‑storage provider. Challenge: Enforcing consistent standards across a diverse supplier ecosystem.
Time‑Series Data – Data points collected sequentially over time, often us… #
Related terms: Streaming Data, Analytics. Example: Recording temperature readings from IoT sensors every minute. Challenge: Managing high‑velocity ingestion while preserving data quality and lineage.
Transparency Report – A public disclosure that outlines how an organizati… #
Related terms: Data Transparency, Privacy. Example: Publishing an annual report detailing the number of data‑subject requests fulfilled. Challenge: Providing sufficient detail without exposing sensitive operational information.
Trust Framework – A set of principles, standards, and technical mechanism… #
Related terms: Data Sharing, Governance. Example: A consortium of banks adopting a common trust framework for secure data exchange. Challenge: Achieving consensus on controls and auditability across independent organizations.
Unstructured Data – Information that does not conform to a predefined dat… #
Related terms: Data Types, Metadata. Example: An archive of customer support call recordings. Challenge: Extracting meaningful metadata and applying governance policies without a fixed schema.
User‑Generated Content (UGC) – Data created by end users, often through s… #
Related terms: Privacy, Content Moderation. Example: Customer reviews posted on an e‑commerce site. Challenge: Monitoring for compliance with content policies and privacy regulations.