Basics

Business Intelligence with Power BI

Business Intelligence: It's the ability to transform data and information, and in turn, this information into knowledge, to speed up time in terms of decision-making

BI Flow

  • ETL

    • Data extraction from data sources

    • Data transformation (source consolidation, cleaning and transformation of data)

    • Data load into Power BI data warehouse

    • Tools: PowerCenter, SQL Server Integration Services (SSIS), Oracle Data Integrator (ODI), Power Query (in Power BI)

  • Modeling

    • Relationships

    • KPI's

    • Optimization (to answer all business questions)

    • Tools: Erwin Data Modeler, PowerDesigner (SAP modeling tool), Power Pivot (in Power BI)

      • Data Warehouse: Usually serves as the central database of a company or, in other words, the database where all the useful data of an organization is stocked

      • Data Mart: Stores concise and specific data sets used for analysis for a specific department or line of business, such as the sales department

  • Reporting (putting into operation):

    • Data visualization

    • Reports

    • Dashboards

    • Storytelling

    • Tools: Power BI, Qlik, Tableau, and so on

      • Dashboard: A data dashboard is a businesses tool used to help track, analyze, and display data, usually to gain deeper insight into the overall wellbeing of the organization, a department, or even a specific process. In Power BI, it corresponds to the report

      • Balanced scorecard: It's a tool for monitoring the strategic decisions taken by the company based on indicators previously established and that should often permeate through at least four aspects – financial, customer, internal processes, and learning & growth -. In Power BI, it corresponds to the dashboard

Power BI: It's an integral and comprehensive Business Intelligence solution that provides a detailed view of the most critical data within an organization

Business Suite

  • Power BI Desktop: Desktop tool oriented to the development environment, which allows us to connect to various data sources and create reports

  • Power BI Service: Service in the cloud that allows us to establish the entire collaborative environment

  • Power BI Mobile: Mobile application that allows us to view and interact with a report from any mobile device (Android & Apple)

Power BI components

  • Power Query ETL: Extraction, transformation, and loading of data

  • Power Pivot (DAX) Modeling: Answer all business questions

Power BI architecture

Power BI Free architecture

  • Includes 1 GB of storage

  • Isn't possible to collaborate with other users simultaneously (reports, dashboards, and dataset), but it's possible to publish on websites (without security)

  • Report sharing is only possible in public mode (without security)

Power BI PRO architecture

  • Includes 10 GB of storage

  • Can be shared with internal users as long as they also have a PRO license

  • Optional gateway

  • Can actualize data up to 8 times per day

  • The creator (on Desktop), and user (on Service) must have got, each one, a license

Power BI PREMIUM architecture

  • Includes 100 TB of storage

  • Can be shared with internal users without requiring them to have Power BI PRO

  • Greater scalability and performance than shared capacity in Power BI Service

  • Can actualize data up to 48 times per day

  • The creator (on Desktop), and user (on Service) must have got, each one, a license

Notice: None of this can be started with a personal email account, but only with a business or with an educational email account

Connection types

Connection types

Power BI allows us to connect to a wide variety of data sources, from Excel files, SQL Server databases, websites, and so on

Connection types

  • Live Connection or Dynamic: Read from SSAS or from a Power BI Service dataset, in other words, the data is stored outside Power BI (like Direct Query)

  • Direct Query: The data isn't copied since each interaction requests a query to the database

  • Import: The data is copied locally within the Power BI model (this is the most common type)

  • Composite Models: Combines Import and Direct Query technologies. It also allows us to use multiple datasets

Notices:

  • With Import connectivity, the PBIX file and the dataset are hosted in memory, it's the fastest of all connectivity types, but the Power BI dataset must be scheduled to refresh, and we should always keep our memory usage in mind. With PRO licensing, the Power BI dataset cannot exceed 1GB, however, with Power BI PREMIUM this can reach levels of 400GB

  • With DirectQuery connectivity, no data is ingested and stored in the Power BI model. Power BI will only store the metadata of the underlying data, therefore the table names, relationships, field names, etc., but not the actual data. By not storing any data in the Power BI model means we can store larger volumes of data and there's less chance that we'll run into any of the data volume limitations against the Power BI dataset, such as the 1GB dataset that comes with Pro licensing. DirectQuery is only available with database sources

In this connectivity model, queries will be generated and sent directly to the underlying data source to request data before displaying. The data remains on source side, near real time is available, and no scheduled refresh is required, but it can have an impact on the reporting consumers and on the performance of the underlying data source when too many users are generating queries at the same time. Power BI does have some Query Reduction methods to ease the number of queries to source

Using DirectQuery limits the capabilities of Power BI Desktop. For instance, Time-Intelligence DAX functions such as same period last year are not available for the purpose of limiting the complexity of queries sent to the underlying data source and some transformations in Power Query are restricted, such as removing duplicates and changing to some data types. Also, Power BI only allows DirectQuery to return 1M rows from the underlying data source unless we use Power BI PREMIUM

  • It is very common to find LiveConnection being confused with DirectQuery due to both connectivity types not storing any data in the Power BI model. However, they are two very different connectivity types, and they cannot be used interchangeably. Using a LiveConnection means no data is stored in the Power BI model, therefore all interaction with a report using a LiveConnection will directly query the existing Analysis Services model

LiveConnection can be used with SQL Server Analysis Services (Tabular models and Multidimensional Cubes), Azure Analysis Services (Tabular Models), and Power BI Datasets hosted in the Power BI Service. As these sources are analytical engines, the overall performance will be much higher compared to DirectQuery. Furthermore, all these sources can offer a semantic layer that acts as a single version of the truth, offering the business a golden layer of high integrity, well-governed data, therefore it's commonly used in enterprise deployments of Power BI, but only with named data sources

Using a LiveConnection does have some restrictions from an authoring perspective. However, as all the work has been done in Analysis Services or an existing Power BI dataset, we're simply taking advantage of an existing enterprise-level model, therefore it makes sense that restrictions exist on data transformation and data modeling capabilities. Also, some limitations exist on the DAX measures that can be written, as only Report Level Measures are available in it. This means the measures are stored in the Power BI report and not written back to the Analysis Services model. All reporting capabilities are available for use

A LiveConnection establishes a connection to a semantic layer such as Analysis Services or a Power BI dataset. In an enterprise scenario, it's important to have a semantic layer placed between your data source systems and Power BI, as it simplifies the data in the Data Warehouse by making it more usable for the business. Also, it aligns complex data with familiar business terms and removes the need to do any data preparation, therefore offering a complete & consolidated view across the organization

  • With the Composite connectivity models, we can combine data from multiple DirectQuery data sources and mix data from an Import data source. For instance, we can establish a DirectQuery connection to the sales fact table and multiple dimension tables from a SQL Data Warehouse and ingest data using an Import from an Excel spreadsheet that contains budget values. In the Power BI Desktop application, you'll notice the storage mode is set to Mixed in the bottom right, as the below shows

Last updated