what-is","metadata":{}}]}}]},"metadata":{"auth":{},"testAttributes":{}},"context":{"page":{"pageUrl":"https://aws.amazon.com/what-is/data-mart/"},"environment":{"stage":"prod","region":"us-east-1"},"sdkVersion":"1.0.129"},"refMap":{"manifest.js":"289765ed09","what-is-header.js":"2e0d22c000","what-is-header.rtl.css":"ccf4035484","what-is-header.css":"ce47058367","what-is-header.css.js":"004a4704e8","what-is-header.rtl.css.js":"f687973e4f"},"settings":{"templateMappings":{"category":"category","headline":"headline","primaryCTA":"primaryCTA","primaryCTAText":"primaryCTAText","primaryBreadcrumbText":"primaryBreadcrumbText","primaryBreadcrumbURL":"primaryBreadcrumbURL"}}}

data mart","metadata":{}}]}},{"fields":{"faqQuestion":"How does a data mart compare to other types of data storage systems?","faqAnswer":"

Companies use several different types of data storage systems for data management and analytics. Let’s look at some common types of data storage to understand the context in which companies use data marts. \n

Database \n

A database is organized storage that computer systems use to store, search, retrieve, and analyze information. There are various types of databases, such as relational databases. A relational database stores information in tables consisting of rows and columns. Data in different tables is connected by a unique identifier known as a key. Keys are the non-repetitive values in specific columns. \n

Data mart vs. database \n

A data mart serves as the front-facing element for a department’s data.  You can use a data mart to retrieve and analyze information. Meanwhile, a database collects, manages, and stores information. You can then use tools to process, format, and transfer the stored information to a data mart.  \n

Data warehouse   \n

A data warehouse is an extensive database system that stores information for an entire business. It collects raw information from various sources, such as business software and social media feeds, and processes it into structured data stored in a tabular format. Businesses can connect an enterprise data warehouse to business intelligence tools to make smarter decisions.  \n

Data mart vs. data warehouse \n

A data mart shares many of the qualities of a data warehouse. Where they differ is that a data warehouse contains enterprise-wide data about various topics. Meanwhile, a data mart stores information closely related to a specific subject. For example, a data warehouse might store information for the marketing, human resources, procurement, and customer support departments. However, a data mart might store only transactional data relevant to a single department. The appeal of building a data mart is that departments who manage their data marts have complete control over the loading and management of their data.  \n

Many organizations are using technologies like data sharing to publish their data marts to a central data warehouse.  By doing so they can be more agile by distributing ownership and isolating workloads.  Similarly, data sharing allows departmental data marts to consume data shared from a data warehouse or other data marts. \n

Data lake  \n

A data lake is data storage that holds raw and unstructured information. It does not store information in files and folders. Instead, it stores unprocessed information in a flat hierarchy on massive storage. Data lakes store different types of raw information, including text documents, images, videos, and audio.  \n

Data analysts use data lakes to conduct predictive analysis from unstructured data. For example, a data lake might store texts from social media reviews that businesses can use for sentiment analysis. Data analysts can use sentiment analysis to detect negative opinion trends for a company.  \n

Data mart vs. data lake \n

Because data lakes store unprocessed data, some of the information might be duplicates or might not be meaningful to the company. Meanwhile, a data mart stores processed data that meets a specific need. A data lake could be the source of a data mart. Businesses determine data trends by looking at historical data in data marts, but they use data lakes to analyze the stored information deeply.  \n

OLAP \n

Online Analytical Processing (OLAP) is a method to represent data in multiple dimensions. For example, data analysts use an OLAP cube to simultaneously show sales revenue based on months, cities, and products. OLAP data structures are wide, with fields classified as either facts or dimensions and result in data duplication.  This contrasts with conventional relational databases, which favor narrow structures and little data duplication. \n

Data mart vs. OLAP cube \n

OLAP is a specific information storage strategy which denormalizes data into wide tables. OLAP simplifies complex representations of multidimensional data. Some data marts might use OLAP to structure their information, but others use conventional, normalized structures. Business analysts benefit from OLAP structures to visualize information from a data mart.  \n

Operational data store \n

An operational data store (ODS) is information storage that acts as an intermediary between data sources and the data warehouse. Data analysts use the ODS to provide near-real-time reporting about transactional data. The ODS supports simple queries and provides only a limited amount of information. For example, the ODS might store sales records only for the past 12 hours.  \n

Data mart vs. ODS  \n

A data mart extracts subject-oriented information from a data warehouse, but an ODS sends information into the data warehouse for processing. Data marts offer historical information that you can analyze, but an ODS provides an updated view of current operations. For example, you can use a data mart to identify sales patterns for the past quarter but receive hourly sales figure updates from the ODS. ","id":"seo-faq-pairs#how-does-a-data-mart-compare-to-other-types-of-data-storage-systems","customSort":"2"},"metadata":{"tags":[{"id":"seo-faq-pairs#faq-collections#data-mart","name":"data-mart","namespaceId":"seo-faq-pairs#faq-collections","description":"

data mart","metadata":{}}]}},{"fields":{"faqQuestion":"Why is a data mart important? ","faqAnswer":"

These are some good reasons that companies might use a data mart.  \n

Retrieve data more efficiently \n

By using a data mart, companies can access specific information more efficiently. Compared to a data warehouse, a data mart contains relevant and detailed information that a department accesses frequently. Therefore, business managers don’t need to search the entire data warehouse to generate performance reports or graphics. \n

Streamline decision-making \n

Companies can create a subset of data from a data warehouse with a data mart. Employees within the department can then analyze the data and make decisions based on the same set of information.  \n

Control information more effectively \n

A data mart gives employees highly granular access privileges. This means the company can authorize a certain person to view or retrieve specific data. It helps companies to improve data governance and enforce information access policies. For example, you can use data marts to provide user access to employees for specific information in a data warehouse. \n

Manage data flexibly \n

A data mart is smaller and contains fewer tables than a data warehouse. This means data engineers can manage and change information in a data mart without causing major database changes.","id":"seo-faq-pairs#why-is-a-data-mart-important","customSort":"3"},"metadata":{"tags":[{"id":"seo-faq-pairs#faq-collections#data-mart","name":"data-mart","namespaceId":"seo-faq-pairs#faq-collections","description":"

data mart","metadata":{}}]}},{"fields":{"faqQuestion":"How does a data mart work? ","faqAnswer":"

A data mart turns raw information into structured, meaningful content for a specific business department. To do this, data engineers set up a data mart to receive information either from a data warehouse or directly from external data sources.  \n

When it is connected to a data warehouse, the data mart retrieves a selection of information that is relevant to a business unit. Often, the information contains summarized data and excludes unnecessary or detailed data.  \n

ETL  \n

Extract, transform, and load (ETL) is a process for integrating and transferring information from various data sources into a single physical database. Data marts use ETL to retrieve information from external sources when it does not come from a data warehouse. The process involves the following steps. \n

AWS Data Mart next steps

Check out additional product-related resources
View free offers for Analytics services in the cloud 
Sign up for a free account

Instantly get access to the AWS free tier. 

Sign up 
Start building in the console

Get started building in the AWS Management Console.

Sign in