Sunday, 11 December 2016

Find the space used by a Hive table

There are a few ways to identify the space used by a Hive table.

Unfortunately, my Hive wouldn't accept these commands (and I haven't worked out why...):
  • hive > SHOW TBLPROPERTIES <TableName>("rawDataSize")
  • hive > DESCRIBE EXTENDED <TableName>
So I resorted to using the file system shell:
  • hadoop fs -du -h <URI>