Manage the Hive Connector

The Composer Hive connector lets you access the data available in Hive storage using the Composer client. It can connect to both Hive on Tez and Hive on Tez with LLAP, depending on the JDBC URL you provide (see Connect to Hive below). The Composer Hive connector supports Hive versions 2.1 through 3.1.

Before you can establish a connection from Composer to Hive storage, a connector server needs to be installed and configured. See Manage Connectors and Connector Servers for general instructions and Connect to Hive for details specific to the Hive connector.

After the connector has been set up, you can create data source configurations that specify the necessary connection information and identify the data you want to use. See Manage Data Source Configurations for more information. After data sources are configured, they can be used to create dashboards and visuals from your data. See Create Dashboards.

Composer Feature Support

Hive connector support for specific Composer features is shown in the following table.

Key: P - Supported; O - Not Supported; N/A - not applicable

Feature Supported?
Admin-Defined Functions P
Box Plots P
Custom SQL Queries P
Derived Fields (Row-Level Expressions) P
Distinct Counts P
Fast Distinct Values N/A
Group By Multiple Fields P
Group By Time P
Group By UNIX Time P
Histogram Floating Point Values P
Histograms P
Kerberos Authentication P
Last Value P
Live Mode and Playback P
Multivalued Fields N/A
Nested Fields N/A
Partitions P
Pushdown Joins for Fusion Data Sources P
Schemas P
Text Search N/A
TLS P
User Delegation P
Wild Card Filters P
Wild Card Filters, Case-Insensitive Mode P
Wild Card Filters, Case-Sensitive Mode P

Connect to Hive

To establish a connection to Hive, you must specify a JDBC URL on the Connection page of your Composer data source definition for the Hive connection.

  • Specify the JDBC URL for Hive.
  • If authentication has been set up, provide the user name and password.
  • If required, specify the Hive/YARN queue name in the Queue Name box.
  • Specify the server timezone. If the timezone of your Hive server is in UTC, leave the Server Timezone box blank. Otherwise, specify the timezone abbreviation in all caps for correct handling the time data (for example, EST, EDT, or CST).
  • Select Validate to test the connection.

To connect to Hive LLAP, the JDBC URL you must specify is different. If you use Hortonworks Data Platform (HDP), you can copy the URL from Ambari. See https://docs.cloudera.com/HDPDocuments/HDP3/HDP-3.1.4/performance-tuning/content/hive_connect_clients_to_llap.html.

See also Connect to Hive Sources on A Kerberized HDP Cluster.

Migrate Your Hive Connectors

In Zoomdata 3.7, the Hive on Tez connector was renamed Hive. If your installation used the Hive on Tez connector in releases prior to 3.7, you will have two connectors after the upgrade: Hive on Tez and Hive. The Hive on Tez connector is outdated and should not be used anymore.

To migrate your existing data source configurations and connections to use the new Hive connector:

  1. Copy any configuration properties you had customized in the Hive on Tez connector's edc-tez.properties configuration file to the new Hive connector's edc-hive.properties configuration file. See Connector Properties and Property Files.
  2. Verify that the new Hive connector, with the zoomdata-edc-hive package name, is running and enabled. See Manage Connectors and Connector Servers.
  3. Log into Composer as the supervisor.
  4. Select to access the supervisor menu and then select Connectors. The Manage Connector Services page appears.
  5. At the bottom of the Manage Connector Services page, in the Connectors table (not the Connector Servers table), locate and select the Hive on Tez connector. The Edit Hive on Tez Connector page appears.
  6. Select the Hive connector from the drop-down list in the Connector Server field.

    The User Attribute checkboxes in the Connector Parameters list on the Edit Hive on Tez Connector page are cleared when you changed the Connector Server field. So, before you make this change, make note of which connector parameters were marked as User Attributes.

  7. If any connector parameters listed on the Edit Hive on Tez Connector page had the User Attribute checkbox selected, select the checkbox again.
  8. Select Save.
  9. Disable the old Hive on Tez connector with the package name zoomdata-edc-tez. See Manage Connectors and Connector Servers.

Your existing data source configurations and connections will now work with the new Hive connector. The old Hive on Tez connector server can be deleted. See Manage Connectors and Connector Servers.

Troubleshooting

If you run into a warning message that is displayed when you try to open a dashboard based on a Hive data source, see Resolve the Hive Timeout Warning Message.