Pratyushatechnodiary.hashnode.dev·Dec 9, 2023Unlocking the Basics: Key Concepts and Practical Applications - Part IIIWeek1 - Continuation Article Outline.: HDFS Architecture - Basic Overview Week1 Summary HDFS Architecture - Basic Overview: HDFS, short for Hadoop Distributed File System, caters to applications dealing with large datasets, ranging from gigabyte...DiscussCloud Focused Big Data Serieshdfs commands
AATISH SINGHaatishintodata.hashnode.dev·Aug 3, 2023Hadoop Namenode Failure ManagementLet's #hadoop 📌 Some insights on Namenode failure management in Hadoop 📢 ✔ Managing NameNode failures in Hadoop is crucial to ensure high availability and fault tolerance in the Hadoop Distributed File System (HDFS). ✔ The NameNode is a critical co...Discuss·1 like·30 readshadoop
AATISH SINGHaatishintodata.hashnode.dev·Aug 1, 2023Hadoop #Split-Brain Scenario & #FencingLet's #hadoop 📌 𝐖𝐡𝐚𝐭 𝐢𝐬 #𝐒𝐩𝐥𝐢𝐭 𝐁𝐫𝐚𝐢𝐧 𝐒𝐜𝐞𝐧𝐚𝐫𝐢𝐨 𝐚𝐧𝐝 #𝐟𝐞𝐧𝐜𝐢𝐧𝐠 𝐢𝐧 𝐇𝐚𝐝𝐨𝐨𝐩? ✔ 𝘐𝘯 𝘵𝘩𝘦 𝘤𝘰𝘯𝘵𝘦𝘹𝘵 𝘰𝘧 𝘥𝘪𝘴𝘵𝘳𝘪𝘣𝘶𝘵𝘦𝘥 𝘴𝘺𝘴𝘵𝘦𝘮𝘴, 𝘪𝘯𝘤𝘭𝘶𝘥𝘪𝘯𝘨 𝘏𝘢𝘥𝘰𝘰𝘱 𝘤𝘭𝘶𝘴𝘵𝘦𝘳𝘴, 𝘢 "#split_bra...Discuss·48 readshadoop
padmanabha reddypadmanabha.hashnode.dev·Mar 17, 2023Hadoop Commands and Usage LogIntroduction:- Hadoop is a popular open-source framework used for distributed storage and processing of large datasets. With its powerful capabilities, Hadoop is used by many organizations and data professionals for big data analytics and processing ...Discuss·28 readshdfs
padmanabha reddypadmanabha.hashnode.dev·Jan 6, 2023HDFS Architecture and workingHadoop Distributed File System(HDFS) is the world’s most reliable storage system. It is best known for its fault tolerance and high availability. What is Hadoop HDFS? HDFS stores very large files running on a cluster of commodity hardware. It works o...Discuss·40 readshdfs
Yash Srivastavablog.yashsrivastava.link·Dec 21, 2022Basic HDFS ArchitectureWhat is HDFS? HDFS (Hadoop Distributed File System) is 1 of the 3 Hadoop core components. As its name suggests it's a distributed storage file system where large files are divided into multiple blocks of size 128mb (64mb in Hadoop 1) and stored acros...Discuss·53 readshdfs
Aditya Guptaitsadityagupta.hashnode.dev·Dec 17, 2022Hadoop Distributed File SystemIntroduction HDFS is a distributed file system to run on commodity hardware. It is highly fault-tolerant and is designed to be deployed on low-cost hardware, providing high throughput access to application data and is suitable for applications that h...Discuss·61 readshadoop
KISHORE K REDDYkishore1.hashnode.dev·Dec 13, 2022Apache Pig: High-Level Data Flow PlatformIntroduction Apache Pig is a platform for analyzing large data sets that consists of a high-level language for expressing data analysis programs, coupled with infrastructure for evaluating these programs. The salient property of Pig programs is that ...Discuss·67 reads#pig
Snehal Hosmanisnehosmani.com·Oct 7, 2022Paper Summary : The Google File SystemThe Google File System(GFS) is a distributed file system created by google for large distributed data-intensive applications. Unlike HDFS (Hadoop Distributed File System) which is open source, GFS is a proprietary system. Both these and other distrib...Discuss·146 readshdfs
Sudheer Paturisdcodes.hashnode.dev·Jul 24, 2022Apache Sqoop References🚨 I'm writing this blog as a reference to myself regarding all things related to databases. If you find the content a bit unstructured or if it doesn't make sense that's the reason why. Export data from Hadoop to the MySQL database using Sqoop sqoo...Discuss·29 readshdfs