<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Data Transformation on AI VOID</title><link>https://ai-blog.noorshomelab.dev/tags/data-transformation/</link><description>Recent content in Data Transformation on AI VOID</description><generator>Hugo</generator><language>en</language><lastBuildDate>Sat, 20 Dec 2025 00:00:00 +0000</lastBuildDate><atom:link href="https://ai-blog.noorshomelab.dev/tags/data-transformation/index.xml" rel="self" type="application/rss+xml"/><item><title>Real-time Supply Chain Delay Analytics (Gold Layer)</title><link>https://ai-blog.noorshomelab.dev/realtime-supply-chain-intelligence/05-dlt-gold-delay-analytics/</link><pubDate>Sat, 20 Dec 2025 00:00:00 +0000</pubDate><guid>https://ai-blog.noorshomelab.dev/realtime-supply-chain-intelligence/05-dlt-gold-delay-analytics/</guid><description>&lt;h2 id="chapter-5-real-time-supply-chain-delay-analytics-gold-layer"&gt;Chapter 5: Real-time Supply Chain Delay Analytics (Gold Layer)&lt;/h2&gt;
&lt;h3 id="chapter-introduction"&gt;Chapter Introduction&lt;/h3&gt;
&lt;p&gt;Welcome to Chapter 5, where we elevate our supply chain data from the Silver layer to the Gold layer. In this crucial phase, we will build Databricks Delta Live Tables (DLT) pipelines to perform real-time aggregations and derive actionable insights for supply chain delay analytics. This involves taking the cleaned and enriched data from our Silver tables and transforming it into easily consumable metrics, such as average delay times, on-time delivery rates, and identifying critical delay incidents.&lt;/p&gt;</description></item><item><title>Data Transformation with PySpark DataFrames</title><link>https://ai-blog.noorshomelab.dev/databricks-mastery-2025/data-transformation-pyspark/</link><pubDate>Fri, 19 Dec 2025 00:00:00 +0000</pubDate><guid>https://ai-blog.noorshomelab.dev/databricks-mastery-2025/data-transformation-pyspark/</guid><description>&lt;h2 id="introduction-to-data-transformation-with-pyspark-dataframes"&gt;Introduction to Data Transformation with PySpark DataFrames&lt;/h2&gt;
&lt;p&gt;Welcome back, data adventurers! In our previous chapters, we learned how to get around Databricks, set up our environment, and even load some data. But what good is raw data if we can&amp;rsquo;t make sense of it, clean it up, or reshape it to answer critical questions? This is where the magic of data transformation comes comes in, and PySpark DataFrames are our trusty wands!&lt;/p&gt;</description></item><item><title>Chapter 6: Scales and Transformations - Making Sense of Data</title><link>https://ai-blog.noorshomelab.dev/d3js-canvas-graphs-2025/chapter-6-scales-transformations/</link><pubDate>Thu, 04 Dec 2025 00:00:00 +0000</pubDate><guid>https://ai-blog.noorshomelab.dev/d3js-canvas-graphs-2025/chapter-6-scales-transformations/</guid><description>&lt;h2 id="chapter-6-scales-and-transformations---making-sense-of-data"&gt;Chapter 6: Scales and Transformations - Making Sense of Data&lt;/h2&gt;
&lt;p&gt;Welcome back, intrepid data explorer! In our previous chapters, you mastered the art of setting up your D3.js environment and drawing basic shapes directly onto the HTML Canvas. You&amp;rsquo;ve got the foundational drawing skills down, but now it&amp;rsquo;s time to bring your data to life in a meaningful way.&lt;/p&gt;
&lt;p&gt;This chapter is all about understanding how to translate raw data values into visual properties like positions, sizes, and colors. We&amp;rsquo;ll dive deep into D3.js &lt;strong&gt;Scales&lt;/strong&gt;, which are powerful functions that map your data&amp;rsquo;s domain to your visualization&amp;rsquo;s range. Then, we&amp;rsquo;ll explore fundamental &lt;strong&gt;Canvas Transformations&lt;/strong&gt; to precisely position and manipulate your drawn elements. By the end, you&amp;rsquo;ll be able to create data-driven visualizations that are not just pretty, but also accurate and informative!&lt;/p&gt;</description></item></channel></rss>