Connecting to Amazon S3
Amazon Simple Storage Service (S3) provides a “web service interface that can be used to store and retrieve any amount of data, at any time, from anywhere on the web .” Zoomdata connects to S3 sources using the Apache Spark processing framework.
: Excerpted from AWS Documentation “ What is Amazon S3? ”
As a result, Amazon's S3 source utilizes Zoomdata's embedded Spark server. For information, access the article Configuring an Embedded Spark Server in Zoomdata .
CONFIGURING THE S3 CONNECTOR
After setting up Spark, follow the steps below to connect Zoomdata to your Amazon S3 source:
Click Next .
Specify the path to file. This is the path to remote file that you want to be uploaded into Zoomdata.
(you can use this publicly available dataset:
Select the Read Headers checkbox if you want to use the first row of your data source as column names.
Specify the Value Separator that is in your data source. Standard separators include commas (,) and semi-colons (;).
Toggle the caching setting (by default caching is enabled).
Click Next . On the Fields page you can create unique label names for the available fields in your data source. These labels will be displayed in the charts.
- If necessary, change the Type and Default options, select the checkboxes in the Distinct Count column. Configure Filter Display settings for the required fields.
- Click Next to continue.
- On the Refresh page, you can schedule asynchronous jobs to refresh fields in your data source. Refer to Using the Zoomdata Scheduler article for more information.
- On the Charts page you can enable the charts that will be available for the data source and edit the settings for your charts. That is, select the styles that will be available for the data source, change the global default settings, and more. Learn more about how to customize a chart . Click Finish to save your changes.