This documentation describes the integration of MindsDB with Apache Hive, a data warehouse software project built on top of Apache Hadoop for providing data query and analysis. Hive gives an SQL-like interface to query data stored in various databases and file systems that integrate with Hadoop. The integration allows MindsDB to access data from Apache Hive and enhance Apache Hive with AI capabilities.
Before proceeding, ensure the following prerequisites are met:
Establish a connection to Apache Hive from MindsDB by executing the following SQL command and providing its handler name as an engine.
Required connection parameters include the following:
host
: The hostname, IP address, or URL of the Apache Hive server.database
: The name of the Apache Hive database to connect to.Optional connection parameters include the following:
username
: The username for the Apache Hive database.password
: The password for the Apache Hive database.port
: The port number for connecting to the Apache Hive server. Default is 10000
.auth
: The authentication mechanism to use. Default is CUSTOM
. Other options are NONE
, NOSASL
, KERBEROS
and LDAP
.Retrieve data from a specified table by providing the integration and table names:
Run HiveQL queries directly on the connected Apache Hive database:
The above examples utilize hive_datasource
as the datasource name, which is defined in the CREATE DATABASE
command.
Database Connection Error