Metadata and Data Governance
Expert-defined terms from the Masterclass Certificate in AI for Metadata course at London School of Planning and Management. Free to read, free to share, paired with a globally recognised certification pathway.
**Artificial Intelligence (AI)** #
**Artificial Intelligence (AI)**
Concept #
Artificial intelligence is the simulation of human intelligence processes by machines, especially computer systems. These processes include learning (the acquisition of information and rules for using the information), reasoning (using the rules to reach approximate or definite conclusions), and self-correction.
**Big Data** #
**Big Data**
Concept #
Big data refers to extremely large data sets that may be analyzed computationally to reveal patterns, trends, and associations, especially relating to human behavior and interactions, and that cannot be processed and analyzed by traditional data-processing software.
**Data Catalog** #
**Data Catalog**
Concept #
A data catalog is a structured inventory of data assets in an organization, making it easier for data professionals and business users to find, understand, and use data. It provides information about the data's origin, usage, lineage, and relationships, and enables data governance and stewardship.
**Data Dictionary** #
**Data Dictionary**
Concept #
A data dictionary is a collection of descriptions of the data elements or attributes in a data model, database, or software application system. It provides detailed information about the data, including definitions, data types, formats, units of measure, and relationships to other data elements.
**Data Governance** #
**Data Governance**
Concept #
Data governance is the overall management of the availability, usability, integrity, and security of data in an organization. It includes the development and implementation of policies, procedures, and standards to manage and use data effectively and efficiently.
**Data Lake** #
**Data Lake**
Concept #
A data lake is a large, centralized repository that allows you to store all your structured and unstructured data at any scale. You can store your data as-is, without having to first structure it, and then apply transformations and analyses as needed.
**Data Lineage** #
**Data Lineage**
Concept #
Data lineage is the life-cycle of data, including where it comes from, how it moves and transforms over time, and where it goes. It provides information about the origin, usage, and movement of data, and helps organizations understand how data is used and how it impacts business processes and decisions.
**Data Mart** #
**Data Mart**
Concept #
A data mart is a subset of an organization's data warehouse that is designed to serve a particular business unit or team. It contains a focused and curated set of data that is relevant to the specific needs of the team, and makes it easier for them to access and analyze the data.
**Data Model** #
**Data Model**
Concept #
A data model is a conceptual representation of data structures and relationships in a system. It provides a blueprint for how data is stored, organized, and accessed, and helps ensure that data is consistent, accurate, and reliable.
**Data Quality** #
**Data Quality**
Concept #
Data quality refers to the overall quality and completeness of data in an organization. It includes the accuracy, consistency, timeliness, and relevance of data, and ensures that data is reliable and trustworthy.
**Data Security** #
**Data Security**
Concept #
Data security is the protection of data from unauthorized access, use, disclosure, disruption, modification, or destruction. It includes the development and implementation of policies, procedures, and technologies to ensure the confidentiality, integrity, and availability of data.
**Data Stewardship** #
**Data Stewardship**
Concept #
Data stewardship is the management and oversight of data assets in an organization. It includes the development and implementation of policies, procedures, and standards to ensure the effective and efficient use of data, and the appointment of data stewards to manage and monitor data quality, security, and compliance.
**Deep Learning** #
**Deep Learning**
Concept #
Deep learning is a subset of machine learning that is based on artificial neural networks with representation learning. It can learn and represent data with multiple levels of abstraction, and is used for tasks such as image and speech recognition, natural language processing, and game playing.
**Machine Learning** #
**Machine Learning**
Concept #
Machine learning is a method of data analysis that automates the building of analytical models. It is based on the idea that systems can learn from data, identify patterns, and make decisions with minimal human intervention.
**Metadata** #
**Metadata**
Concept #
Metadata is data that provides information about other data. It includes details such as the data's origin, format, structure, content, and context, and helps users understand, locate, and use the data effectively.
**Neural Networks** #
**Neural Networks**
Concept #
Neural networks are a type of machine learning algorithm that are inspired by the structure and function of the human brain. They consist of interconnected nodes or "neurons" that can learn and represent data with multiple levels of abstraction, and are used for tasks such as image and speech recognition, natural language processing, and game playing.
In conclusion, this glossary provides a comprehensive overview of the key terms… #
It serves as a valuable resource for learners and practitioners who want to understand and apply these concepts in their work. By providing clear, concise explanations and related terms, this glossary helps users navigate the complex landscape of data management and governance, and enables them to make informed decisions about their data assets.