Partitions in Hive
Hello friends, Today we will discuss about partitioning in Hive and ways to use it. The bigger problem with hive is that when we apply where clause in our query then even a simple query in Hive also reads the entire dataset and this situation decreases the efficiency and becomes a bottleneck when we are required to run the queries on large tables, but this issue can be overcome by implementing partitions on hive tables. Partitions in hive : Hive Partitions is a way to organizes tables into partitions by dividing tables into different parts based on partition keys such as date, city, and department. Partition is helpful when the table has one or more partitions keys. Partition keys are basic elements for determining how the data is stored in the table. In the case of tables which are not partitioned, all the files in a table’s data directory is read and then filters are applied on it as a subsequent phase. This becomes a slow and expensive affair especially in