Tableau is the most powerful, secure, flexible, and fastest-growing data visualization tool used in the Business Intelligence Industry. It elevates people with the power of data. It is the best way to change or transform the raw set of data into an easily understandable format with zero technical skills and limited coding knowledge. Let’s start with data blending in Tableau.
What is Data Blending?
Blending means mixing or combining things. Usually, creating a new dataset required lots of effort, and it will be time-consuming. You may have sets of related data from different datasets and needed to analyze together using common fields/dimensions, and for this some time, it might be required to write code, query, or use special applications.
In a simple language, we can explain Data Blending a process which helps the business analyst to combine data from multiple datasets into a functional dataset.
Advantage of Data Blending Over traditional ETL model
Data Blending offers tantamount advantages over the traditional ETL model.
One method is to use joins to set up a new data connection over the traditional ETL data warehouse. First, we need to identify “left” and “right” tables out of two tables and then run a query on the same, which will return the records from the entire left table. Our query will produce a join that displays all data from the left-table and additional rows from the right table. In return, this query adds duplicate rows each time it finds a matching field in the left-table.
This is one of the significant disadvantages of joins because of many duplications.
Let us try to understand the same using below example
Left Join: Left join returns complete set of records/rows from the left table and including data from a right table which because each row has corresponding matching rows in the left table.
For example, suppose you have the following tables Table A and Table B. Columns in Table A (DeptId, DepartmentName) and Columns in Table A ( EmpId, EmpName, DeptID).
Blending Data in Tableau
Data blending can be used to blend data from 2 separates data sources that you want to analyze together on a single sheet in Tableau.
To exemplify data blending in Tableau, I am going to use Salesforce and Excel (2 Different Datasets) with one common field among them and will blend them within the tableau.
Step 1: Connect to Data Source both Salesforce and Excel
- First register on www.salesforce.com to get a trial subscription and navigate to account object in Sales application.
- Open Tableau and select connector as Salesforce connector.
- Provide login credentials provide during registration and click on Login
- Once you received verification code in your registered email provide same and click on verify
- In the next step, allow access. You will be connected to Salesforce, drag, and drop “Account” table from Table section.
- Click on Sheet 1. You can preview dimensions and measures on the left side along with current datasets.
- Click on Data New Data Source, select the second data connector, and connect to the second set of data. In our case, we will be connecting to the Excel dataset.
- Once the second dataset is added, you can preview both datasets added in the data section.
- If Tableau found common fields between both datasets, then it will automatically blend datasets. If not, then we need to edit the connection and configure it manually.
- Click on Data Edit Relationships Choose Custom from dialog box Click “Add” Select common fields from Primary Data Source Field, and Secondary Data Source Field Click on Click on “Ok” again to close the relationship dialog box.
- Once the relationship dialog box is closed, you will note that a small link image appears next to the relationship field in dimension. In our case, it will be “Name.” This indicates the relationship field between two data sources.
Data Blending Limitations
- Data Blending is limited while working with Non-additive aggregates like MEDIAN, COUNTD, and RAWSQLAGG.
- Data blending will affect visualization & data query speed.
- Data Source size will be limited after Data Blending.
- Cube data cannot be used as a Secondary data source for blending data in Tableau.
Starting a career? Search here!
Source- Grey Campus