Zoomdata Version

Data Sources Quick Reference Sheet: Capabilities and Limitations

Zoomdata connects to a variety of data sources from Cloudera to Search to SQL-based databases and flat files:

Figure 1

Each data connector has capabilities and limitations when connected to the Zoomdata Server. This quick reference article highlights these details, which include:

  • The data source version that is supported (and tested) by Zoomdata, if applicable
  • Whether the following functionalities are supported:
    • Distinct count
    • Real time streaming
    • Microquerying
    • SparkIt
    • Group-by time
    • Multi-group by visuals

The capabilities and limitations are broken out into the following tables below:

Table 1: List of Tables

Table Description
2 The data source version supported by Zoomdata
3
  • Distinct count support
  • Real time streaming support
  • MicroQuery
4
  • SparkIt
  • Group-by time
  • Multi-group by visuals

Table 2: Data Sources Supported Version

(Only data sources with a known supported version number are listed.)
Data Source Supported Version
Amazon Redshift 1.0+
Apache Phoenix Phoenix 4.5 (HBase 1.1)
Apache Solr 4.8.1, 5.2, 5.3
Cloudera Impala 1.4.2+
Cloudera Search Solr 4.8.1 + CDH 4.8
Elastic Search 1.0 - 1.6
Google Analytics Core Reporting API Version 3.0
HDFS CDH4+
Hive on EMR Hive 1.0.x - 1.2.1
Hive on Tez Hive 1.0.x - 1.2.1
Marketo REST API Version 1
MemSQL MemSQL 3.2
MongoDB 2.6.x, 3.x
MySQL 5.6.13+
Oracle 11g Release 2+
PostgreSQL 9.3.3+
Salesforce Metadata API Version 34
SendGrid REST API v3
Spark SQL 1.3.1-1.5.1
SQL Server 2012
Zendesk REST API v2

Table 3: Distinct Count, Real Time, MicroQuery Support

Data Source Supports Distinct Count? Supports Live Streaming? MicroQuery Capable?
Amazon Aurora Yes No No
Amazon Kinesis Yes Yes Yes
Amazon Redshift Yes No Yes
Apache Phoenix Yes No Yes
Apache Solr No No Yes
Cloudera Impala Yes* No Yes
Cloudera Search No No Yes
Elastic Search No No Yes
Flat Files
(CSV, JSON, TSV, XML) via Web Browser
Yes No Yes
Google Analytics Yes No No
HDFS Yes* No No
Hive on EMR Yes No No
Hive on Tez Yes No No
Marketo Yes No No
MemSQL Yes No No
MongoDB Yes No Yes
MySQL Yes** No No
Oracle Yes** No No***
PostgreSQL Yes** No No***
S3 Yes* No No
Salesforce Yes No No
SendGrid Yes No No
Spark SQL Yes** No No
SQL Server Yes** No No
Twitter Yes Yes Yes
Zendesk Yes No No
Upload API Yes Yes Yes
  • *For Impala , S3 , and HDFS , only one Distinct Count metric is supported per chart.
  • **When SparkIt is enabled for SQL connectors, only one Distinct Count metric is supported per chart.
  • ***MicroQuery can be toggled ON with feature toggle.

Table 4: SparkIt Capable, Group-by Time, Multi-Group-By Visual, Box Plot, and Histogram Support

Data Source SparkIt Capable? Supports Group-by Time? Supports Multi-Group By Charts? Supports Histogram Charts Supports Box Plot Charts
Amazon Aurora Yes Yes Yes Yes No
Amazon Kinesis No Yes Yes No No
Amazon Redshift No Yes Yes Yes Yes
Apache Phoenix No Yes Yes Yes No
Apache Solr No Partially** Partially** Paritally*** Partially***
Cloudera Impala Yes Yes Yes Yes No
Cloudera Search No Partially** No No No
Elastic Search No Yes Yes Yes Yes
Flat Files
(CSV, JSON, TSV, XML) via Web Browser
No Yes Yes No No
Google Analytics Yes, required Yes Yes Yes No
HDFS Yes, required Yes Yes Yes No
Hive on EMR Yes Yes Yes Yes No
Hive on Tez Yes Yes Yes Yes No
Marketo Yes, required Yes Yes Yes No
MemSQL Yes Yes Yes Yes No
MongoDB No Yes Yes No No
MySQL Yes, optional Yes Yes Yes No
Oracle Yes, optional Yes Yes Yes Yes
PostgresSQL Yes, optional Yes Yes Yes No
S3 Yes, required Yes Yes Yes No
Salesforce Yes, required Yes Yes Yes No
SendGrid Yes, required Yes Yes Yes No
Spark SQL No Yes Yes Yes Yes
SQL Server Yes, optional Yes Yes Yes Yes
Twitter No Yes Yes No No
Zendesk Yes, required Yes Yes Yes No
Upload API No Yes Yes No No
  • *Group-by Time is not supported for live/streaming sources.
  • **Solr v5.2 and later versions support Group-by Time; older versions do not.
  • ***Limitations apply.