Nov 19, 2022 · 2 min read · What is Hive ? Hive is built on top of Apache Hadoop, which is an open-source framework used to efficiently store and process large datasets. As a result, Hive is closely integrated with Hadoop, and is designed to work quickly on petabytes of data. W...
Join discussion
Oct 6, 2022 · 2 min read · Why Hive? In Big Data ecosystem we have 3 main components, Distributed File System - Storage Layer Map Reduce - Processing Engine YARN - Resource Manager Before Hive we(Data folks) used Java as a scripting tool to interact with these components. Bu...
Join discussion