July 30, 2014

Best way to duplicate a partitioned table in Hive

A simple google search for the above will land you here:
http://grokbase.com/t/hive/user/097w0bsnne/best-way-to-duplicate-a-table

But, I believe a better way is:

Create the new target table with the schema from the old table
Use hadoop fs -cp to copy all the partitions from source to target table
Run MSCK REPAIR TABLE table_name; on the target table

520

Kudos

520

Kudos

It is funny how we have so much information available to us but nobody teaches us how to learn. In college, I struggled with processing vast amounts of information. I would read an article/paper/concept and comprehend only some part of... Continue →

Best way to duplicate a partitioned table in Hive

Now read this

Few Thoughts about Learning