the-data-whisperer.hashnode.devSnowflakeData Lake vs Data Warehouse Data Lake is a storage repository that holds raw, unprocessed data in its native format. This can include structured, semi-structured (like JSON or XML), and unstructured data (images, videos, logs). The data in a Data Lak...Oct 4, 2025·4 min read
the-data-whisperer.hashnode.devApache Spark and Data bricks1. Distributed Computing Overview Definition: A computing model where large tasks are divided and executed across multiple machines (nodes) in parallel. Key Features: Speed: Tasks run simultaneously on multiple nodes. Scalability: Add nodes to ha...Oct 4, 2025·6 min read
the-data-whisperer.hashnode.devGit Commands1. Git Basics Check Git version: git --version Key Concepts: Repository: Folder where Git tracks project history. Clone: Copy a remote repo to your local machine. Stage: Select changes to include in the next commit. Commit: Snapshot of staged ...Oct 4, 2025·4 min read
the-data-whisperer.hashnode.devPython for Data Analytics🗂️ Reading and Writing Files in Python Python provides built-in functions to handle file operations such as creating, reading, writing, and appending. Files are handled using the open() function. 1. Syntax of open() file = open(filename, mode) ...Oct 4, 2025·9 min read
the-data-whisperer.hashnode.devPython Basics1. Introduction to Python Python is an interpreted, dynamically typed, and high-level programming language. It supports multiple paradigms: procedural, object-oriented, and functional programming. Code blocks are defined by indentation (not braces...Oct 4, 2025·4 min read