Tag Archives: Hive

File could only be replicated to 0 nodes instead of minReplication (=1)

Working on newly configured cluster. Can browse HDFS but any write attempt produces an error:

Also I can  create and view tables in Hive, but any insert attempt fails with the same error. Continue reading

Hive2 metastore configuring

By default, hive runs with embedded derby metastore, which allows only one connection. This article is about how to hive with derby network server. Assume hive is installed to /opt/hive folder

  1. Download derby https://db.apache.org/derby/derby_downloads.html to /opt/derby folder
  2. Start derby server nohup /opt/derby/bin/startNetworkServer &
  3. Edit /opt/hive/conf/hive-site.xml
  4. Start derby
  5. Init metastore
  6. Start hiveserver2 and beeline, should work both simultaneously

    and check in browser http://localhost:10002

     

Hive2 installation

Recently installed Hive 2.1.1 on Ubuntu 16.04. Maybe not optimal steps, but it worked for me.

Installation

1. Download Hive from https://hive.apache.org/downloads.html, unpack archive to folder /opt/hive

2. Copy /opt/hive/conf/hive-default.xml.template to hive-site.xml

Edit hive-site.xml, replace all occurences of ${system:java.io.tmpdir}/${system:user.name} with /tmp/hive Continue reading

Install Hive on Ubuntu

Configuration

My configuration is Apache Hive 0.13.0 on machine with Ubuntu 14.04 and  Apache Hadoop 2.2.0

(About Hadoop installation – http://dmitrypukhov.pro/install-hadoop-on-ubuntu)

Download and unpack Hive

Download latest Hive release from Apache web site http://www.apache.org/dyn/closer.cgi/hive/

Unpack to /opt/hive folder. Change owner to Hadoop user and group, hduser and hadoop in my case

$ sudo chown -R hduser:hadoop /opt/hive

Continue reading