PinnedPublished inData Engineer ThingsData Formats and Compression in Data Engineering: Best Practices for CSV, Excel, JSON, Parquet, and…Photo by Killian Cartignies on UnsplashFeb 4Feb 4
PinnedPublished inData Engineer ThingsGuide to PySpark: Transformations, Techniques, and Best PracticesPySpark, the Python API for Apache Spark, simplifies big data processing with distributed computing capabilities. It supports real-time…Jan 24Jan 24
PinnedPublished inData Engineer ThingsDeep Dive into Data Extraction: The First Step in Data LifecycleData extraction is the process of retrieving data from various sources to be used for further analysis. The extracted data could be…Jan 30Jan 30
How Mobility-as-a-Service (MaaS) Companies Can Orchestrate Data Governance and AnalyticsThe Mobility-as-a-Service (MaaS) industry thrives on data — handling millions of ride requests, tracking vehicle locations, optimizing…Feb 19Feb 19
Published inData Engineer ThingsData Governance: The Real-World Playbook for Quality, Compliance, and SecurityCompanies are drowning in data but starving for insights. Effective data governance ensures that organisations can trust, protect, and use…Feb 17Feb 17
Published inData Engineer ThingsNavigating the Data Lakehouse Revolution with Apache IcebergOver the past few months, I’ve been diving deep into the world of data engineering and have discovered that Apache Iceberg is a game…Feb 12Feb 12