Setting Up Data Fusion
To fuse data in Zoomdata, you must first connect to the data sources that you want joined. After successful connection, use the Fusion Source Editor to bring together the disparate datasets as a new type of data source . Once the datasets have been joined together, you will be able to explore and analyze the fused dataset in visualizations and dashboards.
For an overview of Zoomdata’s data fusion capability, refer to the article Overview of How Zoomdata Handles Data Fusion . Zoomdata is able to join all of its available pre-built data connectors (from big data sources to flat files, see Figure 1).
Data Fusion Setup Checklist
Use the following checklist to prepare and plan for test together data sources:
- Ensure the targeted data sources are already connected to Zoomdata. If not, refer to the Connecting to Data section of our Support Portal for instructions on connecting the desired data sources.
- Verify that there are matching attributes across the targeted data sources (for example, Sellers) and fact data (Sales) as shown in Figure 2.
- Identify the attributes that will be fused (using the Fusion Editor) as shown in Figure 3.
Data Fusion Setup
The Data Fusion ( ) “connector” setup uses a workflow different from other data source connectors. This section will review some of the new interfaces in preparation for the step-by-step instructions that follows (for setting up data fusion).
The Data Sources Page
This page lets you select the data sources to be joined together. A comprehensive list of all available data sources that can be joined and accessible by the user is displayed in Select Data Source(s) to Create Fusion (see Figure 4), (1). There is a Search bar to help you locate sources when there are a large amount of data sources available for fusion. A checkbox is provided next to each data source for easy selection or deselection of the source.
The right section (see Figure 4, (2) provides a breakdown of the data sources you have selected and some basic information and options including:
- Data source type
- Name of the data source
A user-defined field (to custom label the data source, as shown in Figure 5). Note that the only special character that can be used is the underscore (_) symbol.
Delete: the option to remove a selected data source
Download: the option to obtain the configuration/connection information (if exporting to another Zoomdata instance)
You can sort this list of data sources by either the Data Source Name or Source Alias columns.
The Editor page is where you join the disparate data sources together into one dataset. The Data Source(s) Reference section to the left of the page (as shown in Figure 6) identifies all the attributes and metrics available for each data source (by clicking the ( ) to the right of the data source name).
In order to build a fused attribute, you need to select attributes from the list of available data sources and drag over to the fusion editor workspace (as shown in Figure 7).
Fields from disparate data sources can be joined together in the following ways (as highlighted in Figure 8 and explained in the table below):
|1||Create new fused attributes||Fused attributes appear in a chart's attribute list like any other attribute|
|2||Setup a new form in an existing fused attribute||The form allows attributes from disparate data sources to be fused together|
|3||Build existing form by adding new attributes||Fuses the attributes which are similar from the disparate sources|
When dragging and dropping a new attribute into the workspace, use the guide areas (outlined with dashed lines) to position the attribute to the desired location (as shown in Figure 9).
Once you have created fused attributes and forms, you can provide custom labels that will be used in the chart canvas. Attributes can be deleted from the workspace, as needed, by selecting the icon next to each attribute.
In addition, you can specify whether an attribute is ‘Unique’ (which defines the collection of source fields in which the attribute form’s values are unique per record).
The Fields page for Data Fusion differs from other connectors due to the Fused Attributes that are created. Figure 10 shows a screen capture of the data fusion Fields page with the Fused Attributes section displayed.
This Fields page contains three sections:
- Fused Attributes
- All Other Fields
Shortcut text links (at the top of the page) lets you quickly jump to each of the sections on the page (as shown in Figure 10).
The Fused Attributes section lists all of the attributes that you joined in the Editor . The following user-defined options are available:
- Visible: lets you toggle a particular form to be either visible in a chart’s attributes list or not
- Default: identifies the form(s) that you want to start with when visualizing fused attributes
- Enable Inner Join: compares the rows across all tables (for the fused attributes) and returns the ones that match
All Other Fields lists the fields that are available in each of the data sources you selected for fusion (as shown in Figure 11). These attributes and metrics are also accessible in your charts. These fields also have user-defined options:
- Visible: toggle the visibility of the field
- Label: provide custom name for the field
- Default: set the default parameter for the field (value, time format, and so on)
Use the search bar to help you locate specific fields in this section. In addition, specify whether the Volume metric field should be visible by default and provide a custom label, if desired.
The Calculations section will be available only after you have set up the data fusion source. You can create calculations for fused attributes just like any other available metric or attribute.
Setting Up the Data Fusion Connector
Perform the following steps to configure the connector and set up data fusion.
- Log into Zoomdata as an Administrator.
- Select the Sources menu option.
- Select the Fusion connector icon ( ).
- Name your connector and add a description (if desired).
- Select Next to continue.
page, select the checkbox for each data source you want to join.
Your selections will be added to the fusion list to the right. You can provide an alias for each data source, delete a source that is not needed, and download the configuration details for the data source (if available and you need to export to another Zoomdata instance).For the Source Alias column, the only special character that can be used is the underscore (_) symbol.
- Use the Fusion Editor to join the datasets together.
You can join fields from disparate data sources in the following ways:
- New Fused Attribute
- New form in an existing fused attribute
- Build existing form by adding new attributes
In addition, you can specify whether an attribute is ‘Unique’ (which defines the collection of source fields in which the attribute form's values are unique per record).
- Select Next to continue.
- On the Fields page, review the Fused Attributes fields.
- Determine whether to make the field visible for users.
- Select the fields that will serve as the Default forms for display of an attribute.
- Determine whether to enable inner join for each attribute.
- Select Next to continue.
- Set default parameters for chart styles as needed.
to save your work.
If you need help setting up the default chart parameters, refer to the article Example Chart Setup for more info.