Connecting to Apache HBase

Concept Information

ACCESSDATA command

Apache HBase is the Hadoop database that is a distributed, scalable, big data store. You can use the Apache HBase data connector to import your organization's HBase data.

Note

Analytics provides HBase as an optional connector and if it is not available in your Data Access window, it is likely that the connector was not selected during installation. For more information, see Install optional Analytics data connectors and Python engine.

Before you start

To connect to HBase, you must gather the following:

  • the database server's host name or IP address
  • the correct connection port
  • your username and password if using authentication

For help gathering the connection prerequisites, contact the HBase administrator in your organization. If your administrator cannot help you, you or your administrator should contact HBase Support.

Create an HBase connection

  1. From the Analytics main menu, select Import > Database and application.
  2. From the New Connections tab, in the ACL Connectors section, select HBase.

    Tip

    You can filter the list of available connectors by entering a search string in the Filter connections box. Connectors are listed alphabetically.

  3. In the Data Connection Settings panel, enter the connection settings and at the bottom of the panel, click Save and Connect.

    You can accept the default Connection Name, or enter a new one.

The connection for HBase is saved to the Existing Connections tab. In the future, you can reconnect to HBase from the saved connection.

Once the connection is established, the Data Access window opens to the Staging Area and you can begin importing data. For help importing data from HBase, see Working with the Data Access window.

Connection settings

Basic settings

Setting Description Example
Host The IP address or host name of the HBase server.  
Port The port for the Apache HBase server. 9090
Authentication Type

The authentication mechanism to use for the connection to the HBase server. The options available are:

  • No Authentication - The driver does not authenticate the connection.
  • Basic Authentication - The driver authenticates the connection using an HBase user name and password.
No Authentication
User User name to access the HBase instance.  
Password Password corresponding to the user name to access the HBase instance.  

Advanced settings

Setting Description Example
Maximum rows per fetch Maximum number of rows that a query can return per request. 4096
Schema definition row limit Number of rows that the driver samples when generating a schema. 1024