© 2023 Hashnode
#data-engineering
This post is more of a personal guide for this project, showing the importance of knowledge acquired and my pain in the process of deploying the resources of this project. Check-out my project on GitH…
Welcome to yet another insightful blog on data engineering. In the previous blogs, I introduced and defined data engineering. Today I am going to walk you through a little bit of history on the evolut…
What are strings? In Python, anything that you enclose between single or double quotation marks is considered a string. A string is essentially a sequence or array of textual data. Strings are used wh…
In my last issue, I introduced you to the concept of data engineering, today I am going to be delving a bit deeper into the nuances of data engineering definition(s). What is "data engineering," reall…
Preface🚀 This blog post explores whether PySpark can incorporate SOLID principles for data engineering tasks. Here’s a series on SOLID principles with Python for data engineering tasks I’ve been work…
In my limited experience as a Data Engineer, I have often been asked the question, "Who is a data engineer? What do you even do?" And even funnier is explaining my career to my African parent, who see…
In Python, we can take user input directly by using the input() function. This input function gives a return value as a string/character hence we have to pass that into a variable. Syntax: variable=in…
HUDI stands for Hadoop Upserts Deletes Inserts. It is the framework designed by Uber to achieve ACID properties for distributed data in Hadoop or S3. When compared with Apache Iceberg or Apace DeltaLake, It provides more features like MOR(M…
The conversion of one data type to another data type is known as type casting. Python supports a wide variety of function or methods like int(), float(), str(), hex() etc Two types of typecasting - E…
Data engineering is a rapidly growing field that plays a crucial role in the effective management and utilization of data. As organizations increasingly rely on data-driven insights to make informed d…