PlayWithData

Follow

Follow

Create ADLS External Tables in Azure Synapse

Feb 24, 20252 min read1 views

Create a Master Key if not present. Why? If a database scoped credential is used, Synapse requires encryption. The Master Key ensures that...

Create ADLS External Tables in Azure Synapse

SQL : Store Procedures Versus Functions

Sep 24, 20243 min read9 views

SQL stored procedures and functions are versatile tools in SQL that allow users to develop reusable and optimized code for handling intricate database...

SQL : Store Procedures Versus Functions

Slowly Changing Dimensions with PySpark and Delta Lake

May 30, 20242 min read64 views

Slowly Changing Dimensions (SCDs) are a vital concept in data warehousing, particularly in managing data that changes over time. As the entities...

Slowly Changing Dimensions with PySpark and Delta Lake

Databricks Overview

Mar 30, 20243 min read21 views

What is Databricks? Databricks, developed by the creators of Spark, offers a comprehensive solution for all data needs. From storage to insights via...

Databricks Overview

Delta Lake - Introduction and Architecture

Jan 7, 20243 min read37 views

A Delta Lake is not different from a Parquet file with a robust versioning system. It utilizes transaction logs stored in JSON files to maintain a...

Delta Lake - Introduction and Architecture

PySpark Job Optimization Techniques (Part - II )

Dec 18, 20232 min read54 views

1. Broadcast Join When dealing with the challenge of joining a larger DataFrame with a smaller one in PySpark, the conventional Spark join operation...

PySpark Job Optimization Techniques (Part - II )