hadoop2.7.7+habse2.0.5+zookeeper3.4.14+hive2.3.5单机安装


环境 腾讯云centos7 

1、hadoop下载

http://mirror.bit.edu.cn/apache/hadoop/common/hadoop-2.7.7/hadoop-2.7.7.tar.gz

2、解压

tar -xvf hadoop-2.7.7.tar.gz -C /usr/java

3、修改hadoop-2.7.7/etc/hadoop/hadoop-env.sh文件

将jdk环境添加进去:
# The java implementation to use.
export JAVA_HOME=/usr/java/jdk1.8

4、添加hadoop环境变量

    HADOOP_HOME=/usr/java/hadoop-2.7.7
    MAVEN_HOME=/usr/java/maven3.6
    RABBITMQ_HOME=/usr/java/rabbitmq_server
    TOMCAT_HOME=/usr/java/tomcat8.5
    JAVA_HOME=/usr/java/jdk1.8
    CLASSPATH=$JAVA_HOME/lib/
    PATH=$PATH:$JAVA_HOME/bin:$TOMCAT_HOME/bin:$RABBITMQ_HOME/sbin:$MAVEN_HOME/bin:$HADOOP_HOME/bin
    export PATH JAVA_HOME CLASSPATH TOMCAT_HOME RABBITMQ_HOME MAVEN_HOME HADOOP_HOME

   环境变量生效:source /etc/profile

5、修改hadoop-2.7.7/etc/hadoop/core-site.xml 

  
    
        fs.defaultFS
        hdfs://localhost:9000
    
    
    
        hadoop.tmp.dir
        /usr/java/hadoop-2.7.7/tmp
    

6、修改hadoop-2.7.7/etc/hadoop/hdfs-site.xml 

  
        
            dfs.name.dir
            /usr/java/hadoop-2.7.7/hdfs/name
            namenode上存储hdfs名字空间元数据 
        

        
            dfs.data.dir
            /usr/java/hadoop-2.7.7/hdfs/data
            datanode上数据块的物理存储位置
        
        
        
            dfs.replication
            1
        
    

7、免密登陆 

    ssh-keygen -t rsa
    cat ~/.ssh/id_dsa.pub >> ~/.ssh/authorized_keys

8、hdfs启动与停止

    ./bin/hdfs namenode -format  #初始化,必须对namenode进行格式化
        出现:19/08/13 09:46:05 INFO common.Storage: Storage directory /usr/java/hadoop-2.7.7/hdfs/name has been successfully formatted。说明格式化成功!
        
      ./sbin/start-dfs.sh  #启动hadoop
        (base) [root@medecineit hadoop-2.7.7]# ./sbin/start-dfs.sh 
        Starting namenodes on [localhost]
        The authenticity of host 'localhost (127.0.0.1)' can't be established.
        ECDSA key fingerprint is SHA256:SLOXW/SMogWE3wmK/H310vL74h0dsYohaSF31oEsdBw.
        ECDSA key fingerprint is MD5:fe:a4:15:38:15:e7:32:c3:9f:c3:8e:43:c6:80:6b:ac.
        Are you sure you want to continue connecting (yes/no)? yes
        localhost: Warning: Permanently added 'localhost' (ECDSA) to the list of known hosts.
        localhost: starting namenode, logging to /usr/java/hadoop-2.7.7/logs/hadoop-root-namenode-medecineit.out
        localhost: starting datanode, logging to /usr/java/hadoop-2.7.7/logs/hadoop-root-datanode-medecineit.out
        Starting secondary namenodes [0.0.0.0]
        The authenticity of host '0.0.0.0 (0.0.0.0)' can't be established.
        ECDSA key fingerprint is SHA256:SLOXW/SMogWE3wmK/H310vL74h0dsYohaSF31oEsdBw.
        ECDSA key fingerprint is MD5:fe:a4:15:38:15:e7:32:c3:9f:c3:8e:43:c6:80:6b:ac.
        Are you sure you want to continue connecting (yes/no)? yes
        0.0.0.0: Warning: Permanently added '0.0.0.0' (ECDSA) to the list of known hosts.
        0.0.0.0: starting secondarynamenode, logging to /usr/java/hadoop-2.7.7/logs/hadoop-root-secondarynamenode-medecineit.out

      ./sbin/stop-dfs.sh   #停止hadoop

9、查看是否启动相应的节点

  jps命令查看
        (base) [root@medecineit hadoop-2.7.7]# jps
                    4416 NameNode
                    4916 Jps
                    4740 SecondaryNameNode
                    4553 DataNode
                    975 Bootstrap

    说明NameNode,SecondaryNameNode,DataNode启动成功。

10、web界面查看

http://ip:50070

11、配置yarn -->mapred-site.xml

        复制一份文件:cp mapred-site.xml.template mapred-site.xml
    
        
        
            mapreduce.framework.name
            yarn
            

12、配置yarn-site.xml文件

    
    
        yarn.nodemanager.aux-services
        mapreduce_shuffle
    

13、启动/停止yarn

        ./sbin/start-yarn.sh  #启动
            
            (base) [root@medecineit hadoop-2.7.7]# ./sbin/start-yarn.sh 
            starting yarn daemons
            starting resourcemanager, logging to /usr/java/hadoop-2.7.7/logs/yarn-root-resourcemanager-medecineit.out
            localhost: starting nodemanager, logging to /usr/java/hadoop-2.7.7/logs/yarn-root-nodemanager-medecineit.out
        
            (base) [root@medecineit hadoop-2.7.7]# jps
                8469 ResourceManager
                8585 NodeManager
                8812 Jps
                975 Bootstrap
                
        然后再启动hdfs : ./sbin/start-dfs.sh 

            (base) [root@medecineit hadoop-2.7.7]# jps
                8469 ResourceManager
                9208 DataNode

                9401 SecondaryNameNode
                9065 NameNode
                8585 NodeManager
                9550 Jps
                975 Bootstrap


        ./sbin/stop-yarn.sh    #停止

14、web界面查看yarn

http://ip:8088

单机hadoop和yarn的配置完毕!

########zookeeper安装###########

1、下载地址

https://mirrors.tuna.tsinghua.edu.cn/apache/zookeeper/zookeeper-3.4.14/zookeeper-3.4.14.tar.gz

2、解压

tar -xvf zookeeper-3.4.14.tar.gz -C /usr/java/

3、修改配置文件

    cp zoo_sample.cfg  zoo.cfg 
    将数据保存到zookeeper的data目录中
    dataDir=/usr/java/zookeeper-3.4.14/data

4、启动zookeeper

    ./bin/zkServer.sh start  #启动

    ./bin/zkServer.sh status #查看状态

zookeeper完毕!

#######hbase安装##########

1、下载地址

https://www.apache.org/dyn/closer.lua/hbase/2.0.5/hbase-2.0.5-bin.tar.gz

2、解压

tar -xvf hbase-2.0.5-bin.tar.gz -C /usr/java/

3、修改hbase-env.sh

export JAVA_HOME=/usr/java/jdk1.8/

4、修改hbase-site.xml


                
                  hbase.rootdir
                  hdfs://medecineit:9000/hbase
                
                
                  hbase.cluster.distributed
                  true
                
                
                  hbase.zookeeper.quorum
                  medecineit
                
                
                  dfs.replication
                  1
                

    hbase.master.dns.nameserver
    medecineit
    DNS
  

  
    hbase.regionserver.dns.nameserver
    medecineit
    DNS
  

         hbase.security.authentication
         simple
      
    
      hbase.security.authorization
      false
    

    hbase.regionserver.hostname
    medecineit

##注意,红色的部分一定要加,否则远程连接hbase报错!

5、修改 regionservers

改为主机名:medecineit

6、启动hbase

 ./bin/start-hbase.sh #启动
(base) [root@medecineit hbase-2.0.5]# jps 8469 ResourceManager 16902 Jps 16823 HRegionServer 9208 DataNode 16152 QuorumPeerMain 9401 SecondaryNameNode 9065 NameNode 16681 HMaster 8585 NodeManager 975 Bootstrap 表明已经启动了HRegionServer,HMaster。

7、web访问

http://ip:16010/master-status

8、启动hbase shell进行表的操作

./bin/hbase shell  #启动hbase shell

完毕!

 #####关闭顺序####

停止集群服务的顺序
停止spark集群
master>spark/sbin/stop-slaves.sh
master>spark/sbin/stop-master.sh
停止hbase集群
master>stop-hbase.sh
停止yarn集群
master>stop-yarn.sh
停止hadoop集群
master>stop-dfs.sh
停止zookeeper集群
master>runRemoteCmd.sh “zkServer.sh stop” zookeeper
停止集群服务完毕!

#####hive安装######

1、下载安装包

https://www-eu.apache.org/dist/hive/hive-2.3.5/apache-hive-2.3.5-bin.tar.gz

2、解压

tar -xzvf apache-hive-2.3.5-bin.tar.gz

3、配置hive-env.sh  

export HADOOP_HOME=/usr/java/hadoop-2.7.7
export HIVE_CONF_DIR=/usr/java/hive-2.3.5/conf
export HIVE_AUX_JARS_PATH=/usr/java/hive-2.3.5/lib

4、配置vim hive-site.xml文件

<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<?xml-stylesheet type="text/xsl" href="configuration.xsl"?>

  
    javax.jdo.option.ConnectionURL
    jdbc:mysql://medecineit:3306/hive?createDatabaseIfNotExist=true
    JDBC connect string for a JDBC metastore
  
  
    javax.jdo.option.ConnectionDriverName
    com.mysql.jdbc.Driver
    Driver class name for a JDBC metastore
  
  
    javax.jdo.option.ConnectionUserName
    root
    username to use against metastore database
  
  
    javax.jdo.option.ConnectionPassword
    yang156122
    password to use against metastore database
  

5、添加配置文件

cp hive-exec-log4j2.properties.template hive-exec-log4j2.properties

cp hive-log4j2.properties.template hive-log4j2.properties

6、启动hive

./hive --service hiveserver2  #启动

./beeline -u jdbc:hive2://localhost:10000  #测试 -beeline工具测试使用jdbc方式连接

http://ip:10002/  #web界面

完毕!