Set Hive Table Replication

To set the replication factor of a table while loading it to HIVE you need to set the following property on the hive client.

SET dfs.replication=2;
LOAD DATA LOCAL ......;
 
60
Kudos
 
60
Kudos

Now read this

Basic Monitoring for Hadoop Data Nodes

Here’s a basic monitoring script to monitor the HDFS cluster disk space, Temp Dir space and number of data nodes up. This was plenty useful before we switched to Cloudera Manager. #!/usr/bin/env ruby # Checks Hadoop and alerts if there... Continue →