samyak jaindataskills.hashnode.dev·Oct 11, 2023Apache Hadoop - Getting Started (Understanding the Basics)Hadoop is an open-source software framework used for storing and processing Big Data in a distributed manner on large clusters of commodity hardware. Hadoop is licensed under the Apache v2 license. Hadoop was developed, based on the paper written by...Big Datahadoop
Anuj Kumarvandata04.hashnode.dev·Jul 12, 2023Hadoop Architecture (Part 2)Hadoop Architecture (Part 1): Recap In the world of big data processing, MapReduce is a powerful computing paradigm that enables distributed data processing. It consists of two phases - Map and Reduce - which provide parallelism and aggregation capab...1 likebig data