Introducción a BI y Data Warehouse
¿Qué es BI y Data Warehousing?
Niveles de analítica y jerarquía del conocimiento
Conceptos de BI: Data Warehouse, Data Mart, Dimensiones y Hechos
Base de datos OLTP vs. OLAP
Metodologías de Data Warehouse
Quiz: Introducción a BI y Data Warehouse
Modelos dimensionales
Data Warehouse, Data Lake y Data Lakehouse: ¿Cuál utilizar?
Tipos de esquemas dimensionales
Dimensiones lentamente cambiantes
Dimensión tipo 1
Dimensión tipo 2
Dimensión tipo 3
Tabla de hechos (fact)
Configuración de herramientas para Data Warehouse y ETL
Modelado dimensional: identificación de dimensiones y métricas
Modelado dimensional: diseño de modelo
Quiz: Modelos dimensionales
ETL para inserción en Data Warehouse
Documento de mapeo
Creación del modelo físico
Extracción: querys en SQL
Extracción en Pentaho
Transformación: dimensión de cliente
Carga: dimensión de cliente
Soluciones ETL de las tablas de dimensiones y hechos
Parámetros en ETL
Orquestar ETL en Pentaho: job
Revisión de todo el ETL
Quiz: ETL para inserción en Data Warehouse
Cierre
Reflexiones y cierre
You don't have access to this class
Keep learning! Join and start boosting your career
In order to create an effective dimensional model, it is essential to start from well-defined business questions. A business question is one that asks the customer's or organization's needs, which we hope to solve with data. The more business questions we manage to formulate, the more information we will gather and the more robust and adequate model we will obtain to answer multiple situations.
We will not always have the data needed to answer all the business questions. We may have to request permissions or the data may simply not exist. It is crucial to be transparent with the organization about current limitations and how they might be overcome in the future. In addition, it is essential to be clear about the vision of the model and its constraints to manage customer and internal team expectations.
Once the business questions have been formulated, the next step is to identify dimensions and metrics. Recall that the dimensions are the perspectives from which we want to analyze the metrics; the latter are the quantifiable aspects of the data.
Consider this business question, "How many units of each product have been sold to each customer in a given time?"
In this way, we identify that to know the units sold, the dimensions of product, time and customer are needed.
Assume the question, "How much hiring has been done by area in a specific country?"
The correct identification of dimensions and metrics allows us to create a model that provides useful answers for the business.
Let's look at another example to better understand how to work with business questions.
Let's consider the question, "How much have the discounts and net sales been in quantities and values by month and day?"
Measures or metrics:
Dimensions:
Another important question could be, "How much has net sales grown or declined as of the March 2013 cutoff for each salesperson?"
Indicators:
Dimensions:
To understand the market, we could ask, "What is the best-selling product per day, by category?"
Measurements:
Dimensions:
Finally, when asked the question: "Who is the customer who has purchased the most units in the last year?", they are identified:
Measurements:
Dimensions:
By clearly identifying the dimensions and metrics, we can design a well-structured data model that facilitates decision making based on deep and accurate analytics.
Contributions 1
Questions 3
Want to see more contributions, questions and answers from the community?