Data governance is a methodology ensures data is in the proper condition to support business initiatives and operations. Aligning data governance to business initiatives has many benefits. \n

    \n
  • Justify funding for the data governance program \n
  • Motivate participation by the business communities \n
  • Drive the priority of data governance activities \n
  • Drive the level of data integration required across participating business areas \n
  • Help to determine the right operating model, especially the level of centralization and decentralization required. \n","sortDate":"2023-07-12","headlineUrl":"https://aws.amazon.com/what-is/data-governance/?trk=faq_card","id":"faq-hub#what-is-data-governance","category":"Analytics","primaryCTA":"https://portal.aws.amazon.com/gp/aws/developer/registration/index.html?pg=what_is_header","headline":"What is Data Governance? "},"metadata":{"tags":[{"id":"GLOBAL#tech-category#analytics","name":"Analytics","namespaceId":"GLOBAL#tech-category","description":"Analytics","metadata":{}},{"id":"faq-hub#faq-type#what-is","name":"what-is","namespaceId":"faq-hub#faq-type","description":"

    what-is","metadata":{}}]}}]},"metadata":{"auth":{},"testAttributes":{}},"context":{"page":{"pageUrl":"https://aws.amazon.com/what-is/data-governance/"},"environment":{"stage":"prod","region":"us-east-1"},"sdkVersion":"1.0.129"},"refMap":{"manifest.js":"289765ed09","what-is-header.js":"2e0d22c000","what-is-header.rtl.css":"ccf4035484","what-is-header.css":"ce47058367","what-is-header.css.js":"004a4704e8","what-is-header.rtl.css.js":"f687973e4f"},"settings":{"templateMappings":{"category":"category","headline":"headline","primaryCTA":"primaryCTA","primaryCTAText":"primaryCTAText","primaryBreadcrumbText":"primaryBreadcrumbText","primaryBreadcrumbURL":"primaryBreadcrumbURL"}}}

Data governance includes the processes and policies that ensure data is in the proper condition to support business initiatives and operations. Modern organizations collect data from various sources at scale to enhance operations and service delivery. However, data-driven decision-making is effective only when data meets required quality and integrity standards. \n

Data governance determines roles, responsibilities, and standards for data usage. It outlines who can take what action, upon what data, using what methods, and in what situations. With more data being used to support artificial intelligence (AI) and machine learning (ML) use cases, it has become critical that all data usage meets regulatory and ethical requirements. Data governance balances data security with tactical and strategic objectives to ensure maximum effectiveness.","id":"seo-faq-pairs#whats-data-governance","customSort":"1"},"metadata":{"tags":[{"id":"seo-faq-pairs#faq-collections#data-governance","name":"data-governance","namespaceId":"seo-faq-pairs#faq-collections","description":"

data-governance","metadata":{}}]}},{"fields":{"faqQuestion":"Why is data governance important?","faqAnswer":"

Data governance programs have historically been employed to lock down data in silos to prevent data leakage or misuse. However, the consequence of data silos is that legitimate users must navigate barriers to get access to data when they need it. Inadvertently, data-driven innovation gets stifled. \n

In a 2024 survey of 350 CDOs and CDO-equivalent roles, MIT CDOIQ found that 45% of Chief Data Officers identify data governance as a top priority. These data leaders want to establish a data governance framework that lets them make data available to the right people and applications when needed while keeping the data safe and secure with appropriate controls in place.  \n

Balances access and control \n

You have two levers to make governance an enabler of innovation: access and control. The key to success is finding the right balance between the two—each organization's balancing point is different. When you exercise too much control, the data gets locked up in silos, and users are not able to access the data when they need it. This stifles creativity and leads to the creation of shadow IT systems that leave data out of date and unsecured. In contrast, when you provide too much access, data risks becoming unregulated across applications and data stores, increasing unauthorized access risk and impacting data quality. \n

Data governance processes balance access with control, giving users trust and confidence in the data. They promote appropriate discovery, curation, protection, and data sharing, encouraging innovation while safeguarding the data.","id":"seo-faq-pairs#why-data-gov-important","customSort":"2"},"metadata":{"tags":[{"id":"seo-faq-pairs#faq-collections#data-governance","name":"data-governance","namespaceId":"seo-faq-pairs#faq-collections","description":"

data-governance","metadata":{}}]}},{"fields":{"faqQuestion":"What are the benefits of data governance?","faqAnswer":"

Data governance offers a structured framework for managing data across an organization. Here are some key benefits. \n

Improves data quality \n

Data governance establishes standards for data accuracy, completeness, and consistency. You get relevant, current, easy-to-interpret data that is trusted by all stakeholders. This high-quality data reduces errors and generates accurate and timely insights for strategic and operational decision-making. \n

Supports data-driven culture \n

An effective data governance strategy fosters a culture that values data, encouraging all employees to use and understand data in their work. It motivates business community participation and drives data integration across participating business areas. Alignment between data engineers and business users boosts the organization’s overall data literacy and analytical capabilities. \n

Increases operational efficiency \n

Data governance helps to determine the right operating model, especially the level of centralization and decentralization required. You can establish consistent data management practices that streamline operations. Clearly defined data ownership and access rights facilitate collaboration across departments, ensuring everyone works with the same, reliable data sources. Align efforts across teams to reduce duplication, lower operational costs, and improve productivity. \n

Supports regulatory compliance \n

Data governance frameworks take a proactive approach to risk management, ensuring that data practices align with legal and industry regulations. You can prevent unauthorized access by centrally defined policies for who can access or modify data. Data governance tools support compliance with privacy regulations to protect sensitive data.","id":"seo-faq-pairs#what-are-the-benefits-of-data-governance","customSort":"3"},"metadata":{"tags":[{"id":"seo-faq-pairs#faq-collections#data-governance","name":"data-governance","namespaceId":"seo-faq-pairs#faq-collections","description":"

data-governance","metadata":{}}]}},{"fields":{"faqQuestion":"Who builds data governance?","faqAnswer":"

Building a robust data governance strategy requires many job functions. \n

Executive sponsors \n

They identify and establish data governance principles, standards, and policies across the organization. They also understand many business initiatives on the corporate roadmap and can help determine priorities to drive data governance activities. \n

Data stewards \n

They are from the business and are involved in the day-to-day details of projects. They help understand the data issues that are likely to cause challenges with targeted business initiatives. They also implement the data governance process in their projects and ensure data is managed appropriately. They monitor employee and customer compliance and escalate any issues if they arise. \n

Data owners \n

They make policies about the data, including who should have access to it and under what circumstances, how to interpret and apply regulations, and key term definitions. They are also responsible for your data sets' technical administration and access controls. \n

Data engineers \n

They are from IT and select and implement the best data governance tools to secure data, integrate data from various sources, manage data quality, and find the right data.","id":"seo-faq-pairs#who-builds-data-gov","customSort":"4"},"metadata":{"tags":[{"id":"seo-faq-pairs#faq-collections#data-governance","name":"data-governance","namespaceId":"seo-faq-pairs#faq-collections","description":"

data-governance","metadata":{}}]}},{"fields":{"faqQuestion":"What are the styles of data governance?","faqAnswer":"

Your data governance program should balance centralization and decentralization (including self-service). Throughout your organization, you’ll have a mix of centralized, federated, and decentralized governance—again, depending on the business requirements. You should empower domain teams as much as possible while maintaining coherence across domains (such as the ability to link data together).   \n

Centralized data governance \n

Central organizations are ultimately responsible for mission statements, policies, tool choices, and more. However, day-to-day actions are often pushed into lines of business (LOB). \n

Federated data governance \n

Federated data governance empowers individual business units or initiatives to operate in the way that best matches their needs. However, a smaller centralized team focuses on solving problems that repeat frequently, including enterprise-wide data quality tools, for example. \n

Self-serve or decentralized data governance \n

Each department does what it needs for the specific project while aligning with centralized policies. Each project uses any tools or processes from other projects where there is a fit-for-use. As topics like data mesh (itself decentralized) increase in popularity, so does self-service data governance. ","id":"seo-faq-pairs#styles-data-gov","customSort":"5"},"metadata":{"tags":[{"id":"seo-faq-pairs#faq-collections#data-governance","name":"data-governance","namespaceId":"seo-faq-pairs#faq-collections","description":"

data-governance","metadata":{}}]}},{"fields":{"faqQuestion":"How does data governance work?","faqAnswer":"

Data governance requires people, processes, and technology solutions across a range of capabilities. \n

\"how \n

Curate data at scale to limit data sprawl \n

Curating your data at scale means identifying and managing your most valuable data sources, including databases, data lakes, and data warehouses. You can limit the proliferation and transformation of critical data assets. Curating data also means ensuring that the right data is accurate, fresh, and free of sensitive information so users can have confidence in data-driven decisions and the data feeding applications. \n

Capabilities:  Data quality management, data integration, and master data management \n

Discover and understand your data in context. \n

Understanding your data in context means that all users can discover and comprehend the meaning of their data so they can use it confidently to drive business value. With a centralized data catalog, data can be found easily, access can be requested, and data can be used to make business decisions. \n

Capabilities: data profiling, data lineage, and data catalogs \n

Protect and securely share your data with control and confidence. \n

Protecting your data means striking the right balance between data privacy, security, and access. It’s essential to govern data access across organizational boundaries, using tools that are intuitive for both business and engineering users. \n

Capabilities: Data lifecycle, data compliance, and data security \n

Reduce business risk and improve regulatory compliance. \n

Reducing risk means understanding how that data is being used and by whom. AWS services help you monitor and audit data access—including access through ML models to help ensure data security and regulatory compliance. Machine learning also requires auditing transparency to ensure responsible use and simplified reporting. \n

Capabilities: usage auditing for data and ML \n

 ","id":"seo-faq-pairs#how-does-data-gov-work","customSort":"6"},"metadata":{"tags":[{"id":"seo-faq-pairs#faq-collections#data-governance","name":"data-governance","namespaceId":"seo-faq-pairs#faq-collections","description":"

data-governance","metadata":{}}]}},{"fields":{"faqQuestion":"What are data governance best practices?","faqAnswer":"

The key to effective data governance is to attach to already-funded business initiatives. Make sure your team understands which data domains, sources, and elements are needed to support those initiatives. \n

Next steps on AWS

Check out additional product-related resources
Learn about AWS Analytics Services 
Sign up for a free account

Instant get access to the AWS Free Tier.

Sign up 
Start building in the console

Get started building in the AWS management console.

Sign in