Zoomdata Version

Connecting to Amazon Redshift

Zoomdata's Amazon Redshift connector lets you access the data available in the Amazon Redshift storage for visualization and exploration within the Zoomdata Client. Zoomdata supports Amazon Redshift versions 1.0.1 and later.

BEFORE YOU BEGIN

Before you can establish a connection between Amazon Redshift and Zoomdata, you must verify that the size of the maximum transmission unit (MTU) on your Zoomdata Server is set to 1500.

The MTU size determines the maximum size, in bytes, of a packet that can be transferred in one Ethernet frame over your network connection. If your MTU size is too large for the connection, you might experience incomplete query results, your query might hang, or the connection might be dropped altogether(1).
(1) Excerpted from aws documentation for Amazon Redshift: " Queries Appear to Hang in the Client and Do Not Reach the Cluster ."

To review the MTU value, use the "ip" command:

$ ip addr show eth0

If you need to edit the MTU value and set the size to 1500, use the following "ip" command:

$ ip link set dev eth0 mtu 1500
Redshift returns whole numbers for aggregates on columns of type DECIMAL and NUMERIC types which have a 0 scale (in other words, 0 decimal places).

Prerequisites

The table below lists information on the features that are supported by Amazon Redshift:

Supports Distinct Count? Yes
Supports Live Mode/ Playback? Yes
SparkIt Capable? Yes
Supports Group-by Time? Yes
Supports Multi Group-by Charts? Yes
Supports Histogram? Yes
Supports Box Plot? Yes
Custom SQL Capable? Yes
Supports Last Value? No
Supports Partition? No

 

Starting with Zoomdata v2.5.3, the Amazon Redshift connector requires a JDBC driver to be configured before you can connect to your data source. You can download the driver from the vendor's site. Be aware that the JDBC Driver for Redshift contains more than one jar file. You need to place them in the same location to avoid any issues.

If you are upgrading, keep in mind you will need to configure the JDBC driver (refer to the article Upgrading Zoomdata Server to v2.5 for instructions). For more information, see Adding a JDBC Driver.

CONFIGURING THE CONNECTION

For details about what is provided on each page of the connection process, review the article Source Connection Workflow . Depending on your needs, you can either follow the steps in order from start to finish or jump to a specific section in the connection process:

Start

  1. Log into Zoomdata.
  2. Click the Sources menu item.

Figure 1

  1. Click the Redshift icon.

General Page

  1. Specify the name of your source and add a description (if desired).

Figure 2

  1. Click Next to continue to the next setup page.

Connection Page

This page defines the connection source for Zoomdata to be able to access the data source. If this is the first time setting up a connection, then you need to input the necessary credentials. If a validated connection already exists, you are given the option to use it.

  1. To create a new connection, select the Input New Credentials option.
  2. Enter a unique name for the connection (to help distinguish between other connections in this Zoomdata account).
  3. Specify the JDBC URL. You can find the URL on the Configuration tab of a cluster under Cluster Database Properties . The format varies slightly based on the type of database being connected. For Redshift, use the following format: jdbc:redshift://HOSTNAME:PORT/DATABASE_NAME
    If authentication has been set up, provide the User Name and Password.
  4. Click Validate .
    If successfully validated, the connection is saved.


Figure 3

Tables Page

The Tables page lets you select the schema and collection to connect with and provides a preview of the selected collection. In addition, caching options and toggling the availability of the fields can be done on this page.

  1. Select the schema, if available, and then select the desired collection to connect to Zoomdata.

Figure 4

  1. Create a Custom SQL query, if needed.
Zoomdata wraps your SQL query into a SELECT statement. If specific statements inside the wrapped query are not supported by your data source, the query will not be executed.
  1. Toggle the caching option, as needed.
  2. Toggle the availability of the fields, as needed.
  3. Click Next to continue.

Figure 5

Fields Page

The Fields page lets you (1) configure attribute options, (2) create custom labels for the fields in your data source (that will be displayed in the charts), (3) manage the Volume metric, and (4) work with Calculations.

  1. Determine whether the field should be visible or not to the user.
  2. Create unique label names, as needed, for each Label field.
When you create a data source, the specific number of distinct values for the attribute fields are saved in Zoomdata depending on the data sample from your data set. You can filter the data on your chart by these values. While editing a data source, if you want to use all distinct values in the filter (that is from whole data source), click the Refresh button in the Statistics column.
  1. For the Type column, you have the option to edit the field type (although usually you won't need to do this).
  2. For the Configure column, numeric and time-based fields may be edited:
    • Numeric types including Money, Number and Integer - ability to select a default aggregation function
    • Time fields - ability to define the default time pattern and granularity; if the time field provides granularities of hour, minute and second, then a time zone label may be applied
  1. Select fields for Distinct Counts as needed.
  2. Refresh the connection to a particular field, as desired.
  3. Configure Filter Display settings for fields.
  4. Edit the Volume Metric settings, as needed.
  5. Work with Calculations , if available and as needed.
    If you are setting up a new connection, the Calculations section will not be available until after the connection is saved.
  6. Click Next to continue.

Figure 6

Refresh Page

The Refresh page lets you schedule asynchronous jobs to update the source metadata. For guidance to set up a refresh schedule, refer to the article Using the Zoomdata Scheduler .

Charts Page

On the Charts page, you can:

  1. Edit Global Default Settings .
  2. Select the Standard and, if available, Custom chart styles to be used with the data source.
  3. Set default parameters (group, sub-group, colors, sorting, and so on) for each chart style.


Figure 7

Learn more about how to customize a chart .

Click Finish to save your changes. Once your data connection has been established, it will be listed under the My Data Sources section of the page.