Connecting to Hive on Tez
Zoomdata supports Hive versions:
1.1.0 - 1.2.1
. In addition, the supported driver version is:
The table below lists information on the features that are supported by Hive on Tez:
|Supports Distinct Count?||Yes|
|Supports Live Mode/ Playback?||Yes|
|Supports Group-by Time?||Yes|
|Supports Multi Group-by Charts?||Yes|
|Supports Box Plot?||Yes|
|Custom SQL Capable?||Yes|
|Supports Last Value?||Yes|
CONFIGURING THE CONNECTION
For details about what is provided on each page of the connection process, review the article Source Connection Workflow . Depending on your needs, you can either follow the steps in order from start to finish or jump to a specific section in the connection process:
- General Page
When the connector server is set up by the Zoomdata Administrator, certain parameters on the Connectors page may be customized. The instructions below assumes that the default configuration parameters were kept. If this is not the case, then your Connection page may differ slightly from the screen captures and references provided.
- Tables Page
- Fields Page
- Refresh Page
- Charts Page
Log into Zoomdata.
Administrators and users with appropriate access privileges can connect data sources in Zoomdata.
Hive on Tez
Specify the name of your source and add a description (if desired).
- Click Next to continue to the next setup page.
This page defines the connection source for Zoomdata to be able to access the data source. If this is the first time setting up a connection, then you need to input the necessary credentials. If a validated connection already exists, you are given the option to use it.
- To create a new connection, select the Input New Credentials option.
- Enter a unique name for the connection (to help distinguish between other connections in this Zoomdata account).
- Specify the JDBC URL. If authentication has been set up, provide the User Name and Password.
- If required, specify the Hive/YARN queue name in the Queue Name field.
- Specify the server timezone.
If successfully validated, the connection is saved.
The Tables page lets you select the schema and collection to connect with and provides a preview of the selected table. In addition, caching options and toggling the availability of the fields can be done on this page.
- Select the schema, if available, and then select the desired table to connect to Zoomdata.
- Create a Custom SQL query, if needed.
- Toggle the caching options (SparkIt and Caching), as needed.
- Toggle the availability of the fields, as needed.
- Click Next to continue.
The Fields page lets you (1) configure attribute options, (2) create custom labels for the fields in your data source (that will be displayed in the charts), (3) manage the Volume metric, and (4) work with Calculations.
- Determine whether the field should be visible or not to the user.
- Create unique label names, as needed, for each Label field.
- For the Type column, you have the option to edit the field type (although usually you won't need to do this).
- Configure the partition settings. For the partitioned fields you can select one of the following options:
- Date - this option is available for the Time field type. If you select this option, the list of the partitioned columns will be displayed in the Configure column.
column, numeric and time-based fields may be edited:
- Numeric types including Money, Number and Integer - ability to select a default aggregation function
- Time fields - ability to define the default time pattern and granularity; if the time field provides granularities of hour, minute and second, then a time zone label may be applied
- Select fields for Distinct Counts as needed.
- Refresh the connection to a particular field, as desired.
- Configure Filter Display settings for fields.
- Edit the Volume Metric settings, as needed.
, if available and as needed.
If you are setting up a new connection, the Calculations section will not be available until after the connection is saved.
- Click Next to continue.
The Refresh page lets you schedule asynchronous jobs to update the source metadata. For guidance to set up a refresh schedule, refer to the article Using the Zoomdata Scheduler .
On the Charts page, you can:
- Edit Global Default Settings
- Select the Standard and, if available, Custom chart styles to be used with the data source
- Set default parameters (group, sub-group, colors, sorting, and so on) for each chart style
Click Finish to save your changes. Once your data connection has been established, it is listed under the My Data Sources section of the page.
Issue: If you run into a warning message that is displayed when you try to open a dashboard based on a Hive on Tez data source, access this troubleshooting article to resolve.