当前位置:Gxlcms > mysql > 手动安装clouderacdh4.2hadoop+hbase+hive(三)

手动安装clouderacdh4.2hadoop+hbase+hive(三)

时间:2021-07-01 10:21:17 帮助过:47人阅读

本文主要记录手动安装cloudera Hive cdh4.2.0集群过程,环境设置及Hadoop、HBase安装过程见上篇文章。 安装hive hive安装在desktop1上,注意hive默认是使用derby数据库保存元数据,这里替换为postgresql,下面会提到postgresql的安装说明,并且需要拷贝postg

本文主要记录手动安装cloudera Hive cdh4.2.0集群过程,环境设置及Hadoop、HBase安装过程见上篇文章。

安装hive

hive安装在desktop1上,注意hive默认是使用derby数据库保存元数据,这里替换为postgresql,下面会提到postgresql的安装说明,并且需要拷贝postgres的jdbc jar文件导hive的lib目录下。

上传文件

上传hive-0.10.0-cdh4.2.0.tar到desktop1的/opt,并解压缩

安装postgres

  • 创建数据库

这里创建数据库metastore并创建hiveuser用户,其密码为redhat。

psql -U postgres

CREATE DATABASE metastore;
 \c metastore;
CREATE USER hiveuser WITH PASSWORD 'redhat';
GRANT ALL ON DATABASE metastore TO hiveuser;
\q

  • 初始化数据库

psql  -U hiveuser -d metastore
 \i /opt/hive-0.10.0-cdh4.2.0/scripts/metastore/upgrade/postgres/hive-schema-0.10.0.postgres.sql 

  • 编辑postgresql配置文件,修改访问权限

[root@desktop1 ~]# vi /opt/PostgreSQL/9.1/data/pg_hba.conf

# IPv4 local connections:
host    all             all             0.0.0.0/0            md5

[root@desktop1 ~]# vi postgresql.conf

standard_conforming_strings = off

  • 重起postgres

su -c '/opt/PostgreSQL/9.1/bin/pg_ctl -D /opt/PostgreSQL/9.1/data restart' postgres

  • 拷贝postgres 的jdbc驱动到/opt/hive-0.10.0-cdh4.2.0/lib

修改配置文件

  • hive-site.xml 注意修改下面配置文件中postgres数据库的密码,注意配置hive.aux.jars.path,在hive集成hbase时候需要从该路径家在hbase的一些jar文件。

[root@desktop1 ~]# cd /opt/hive-0.10.0-cdh4.2.0/conf/
[root@desktop1 conf]# cat hive-site.xml 




  javax.jdo.option.ConnectionURL
  jdbc:postgresql://127.0.0.1/metastore
  JDBC connect string for a JDBC metastore



  javax.jdo.option.ConnectionDriverName
  org.postgresql.Driver
  Driver class name for a JDBC metastore



  javax.jdo.option.ConnectionUserName
  hiveuser
  username to use against metastore database



  javax.jdo.option.ConnectionPassword
  redhat
  password to use against metastore database



 mapred.job.tracker
 desktop1:8031



 mapreduce.framework.name
 yarn



  hive.aux.jars.path
  file:///opt/hive-0.10.0-cdh4.2.0/lib/zookeeper-3.4.5-cdh4.2.0.jar,
    file:///opt/hive-0.10.0-cdh4.2.0/lib/hive-hbase-handler-0.10.0-cdh4.2.0.jar,
    file:///opt/hive-0.10.0-cdh4.2.0/lib/hbase-0.94.2-cdh4.2.0.jar,
    file:///opt/hive-0.10.0-cdh4.2.0/lib/guava-11.0.2.jar



  hive.metastore.warehouse.dir
  /opt/data/warehouse-${user.name}
  location of default database for the warehouse



  hive.exec.scratchdir
  /opt/data/hive-${user.name}
  Scratch space for Hive jobs



  hive.querylog.location
  /opt/data/querylog-${user.name}
  
    Location of Hive run time structured log file
  



  hive.support.concurrency
  Enable Hive's Table Lock Manager Service
  true



  hive.zookeeper.quorum
  Zookeeper quorum used by Hive's Table Lock Manager
  desktop3,desktop4,desktop6,desktop7,desktop8



  hive.hwi.listen.host
  desktop1
  This is the host address the Hive Web Interface will listen on



  hive.hwi.listen.port
  9999
  This is the port the Hive Web Interface will listen on



  hive.hwi.war.file
  lib/hive-hwi-0.10.0-cdh4.2.0.war
  This is the WAR file with the jsp content for Hive Web Interface


  • 环境变量

参考hadoop中环境变量的设置

  • 启动脚本

在启动完之后,执行一些sql语句可能会提示错误,如何解决错误可以参考Hive安装与配置。

[root@desktop1 ~] hive

  • hive与hbase集成 在hive-site.xml中配置hive.aux.jars.path,在环境变量中配置hadoop、mapreduce的环境变量

异常说明

  • FAILED: Error in metadata: MetaException(message:org.apache.hadoop.hbase.ZooKeeperConnectionException: An error is preventing HBase from connecting to ZooKeeper

原因:hadoop配置文件没有zk

  • FAILED: Error in metadata: MetaException(message:Got exception: org.apache.hadoop.hive.metastore.api.MetaException javax.jdo.JDODataStoreException: Error executing JDOQL query "SELECT "THIS"."TBL_NAME" AS NUCORDER0 FROM "TBLS" "THIS" LEFT OUTER JOIN "DBS" "THIS_DATABASE_NAME" ON "THIS"."DB_ID" = "THIS_DATABASE_NAME"."DB_ID" WHERE "THIS_DATABASE_NAME"."NAME" = ? AND (LOWER("THIS"."TBL_NAME") LIKE ? ESCAPE '\' ) ORDER BY NUCORDER0 " : ERROR: invalid escape string 建议:Escape string must be empty or one character..

参考:https://issues.apache.org/jira/browse/HIVE-3994

  • hive> select count(*) from hive_userinfo; 没反应

  • zookeeper.ClientCnxn (ClientCnxn.java:logStartConnect(966)) - Opening socket connection to server localhost/127.0.0.1:2181. Will not attempt to authenticate using SASL (无法定位登录配置)

原因:hive中没有设置zk

  • hbase 中提示:WARN util.NativeCodeLoader: Unable to load native-hadoop library for your platform... using builtin-java classes where applicable

原因:cloudera hadoop lib中没有hadoop的native jar

  • Exception in thread "main" java.lang.NoClassDefFoundError: org/apache/hadoop/mapreduce/v2/app/MRAppMaster

原因:classpath没有配置正确,检查环境变量以及yarn的classpath

人气教程排行