Setting Up Data Fusion
To fuse data in Zoomdata, you must first connect to the data sources that you want to be joined. After successful connection, use the Fusion Source Editor to bring together the disparate datasets as a new type of data source . Once the datasets have been joined together, you are able to explore and analyze the fused dataset in visualizations and dashboards.
For an overview of Zoomdata’s data fusion capability, refer to the article Data Fusion Overview . Zoomdata is able to join all of its available pre-built data connectors (from big data sources to flat files, see Figure 1).
Data Fusion Setup Checklist
Use the following checklist to prepare and plan for test together data sources:
- Ensure the targeted data sources are already connected to Zoomdata. If not, refer to the Connecting to Data section of our Support Portal for instructions on connecting the desired data sources.
- Verify that there are matching attributes across the targeted data sources (for example, Sellers) and fact data (Sales) as shown in Figure 2.
- Identify the attributes that will be fused (using the Fusion Editor) as shown in Figure 3.
Data Fusion Setup
The Data Fusion ( ) connector setup uses a workflow different from other data source connectors. This section will review some of the new interfaces in preparation for the step-by-step instructions that follows (for setting up data fusion).
CONFIGURING THE CONNECTION
For details about what is provided on each page of the connection process, review the article Source Connection Workflow . Depending on your needs, you can either follow the steps in order from start to finish or jump to a specific section in the connection process:
- Log into Zoomdata.
- Click the Sources menu option.
Click the Fusion connector icon.
- Name your connector and add a description (if desired).
- Click Next to continue to the next setup page.
This page lets you select the data sources to be joined together. A comprehensive list of all available data sources that can be joined and accessible by the user is displayed in Select Data Source(s) to Create Fusion (see Figure 6), (1). There is a Search bar to help you locate sources when there are a large amount of data sources available for fusion.
Select the checkbox for each data source you want to join. Your selections will be added to the fusion list to the right. You can provide an alias for each data source, delete a source that is not needed, and download the configuration details for the data source (if available and you need to export to another Zoomdata instance).
The right section (see Figure 6, (2) provides a breakdown of the data sources you have selected and some basic information and options including:
- Data source type
- Name of the data source
A user-defined field (to custom label the data source, as shown in Figure 5). Note that the only special character that can be used is the underscore (_) symbol.
Delete: the option to remove a selected data source
Download: the option to obtain the configuration/connection information (if exporting to another Zoomdata instance)
You can sort this list of data sources by either the Data Source Name or Source Alias columns.
The Editor page is where you join the disparate data sources together into one dataset. The Data Source(s) Reference section to the left of the page (as shown in Figure 8) identifies all the attributes and metrics available for each data source (by clicking the ( ) to the right of the data source name).
In order to build a fused attribute, you need to select attributes from the list of available data sources and drag over to the fusion editor workspace (as shown in Figure 7).
Fields from disparate data sources can be joined together in the following ways (as highlighted in Figure 10 and explained in the table below):
|1||Create new fused attributes||Fused attributes appear in a chart's attribute list like any other attribute|
|2||Setup a new form in an existing fused attribute||The form allows attributes from disparate data sources to be fused together|
|3||Build existing form by adding new attributes||Fuses the attributes which are similar from the disparate sources|
When dragging and dropping a new attribute into the workspace, use the guide areas (outlined with dashed lines) to position the attribute to the desired location (as shown in Figure 11).
Once you have created fused attributes and forms, you can provide custom labels to use in the chart canvas. Attributes can be deleted from the workspace, as needed, by selecting the icon next to each attribute.
In addition, you can specify whether an attribute is ‘Unique’ (which defines the collection of source fields in which the attribute form’s values are unique per record).
The Fields page for Data Fusion differs from other connectors due to the Fused Attributes that are created. Figure 10 shows a screen capture of the data fusion Fields page with the Fused Attributes section displayed.
This Fields page contains three sections:
- Fused Attributes
- All Other Fields
Shortcut text links (at the top of the page) lets you quickly jump to each of the sections on the page (as shown in Figure 12).
The Fused Attributes section lists all of the attributes that you joined in the Editor . The following user-defined options are available:
- Visible: lets you toggle a particular form to be either visible in a chart’s attributes list or not
- Default: identifies the form(s) that you want to start with when visualizing fused attributes
- Enable Inner Join: compares the rows across all tables (for the fused attributes) and returns the ones that match
All Other Fields lists the fields that are available in each of the data sources you selected for fusion (as shown in Figure 13). These attributes and metrics are also accessible in your charts. These fields also have user-defined options:
- Visible: toggle the visibility of the field
- Label: provide custom name for the field
set the default parameter for the field:
Numeric types including Money,
Number,and Integer - ability to select a default aggregation function
- Time fields - ability to define the default time pattern and granularity; if the time field provides granularities of hour, minute and second, then a time zone label may be applied
- Numeric types including Money,
On the Fields page you can:
- Review the Fused Attributes fields:
- Determine whether to make the field visible for users.
- Select the fields that will serve as the Default forms for display of an attribute.
- Determine whether to enable inner join for each attribute.
- Edit the Volume Metric settings, as needed.
Work with Calculations, if available and as needed.
If you are setting up a new connection, the Calculations section will not be available until after the connection is saved.
- Click Next to continue.
On the Charts page, you can:
- Edit Global Default Settings .
- Select the Standard and, if available, Custom chart styles to be used with the data source.
- Set default parameters (group, sub-group, colors, sorting, and so on) for each chart style.
- Not all chart types may be available for your fused source (these will be grayed out in the list)
- Learn more about how to customize a chart .
Click Finish to save your changes. Once your data connection has been established, it is listed under the My Data Sources section of the page.