Below you will find pages that utilize the taxonomy term “Apache Spark”
Data+AI Summit 2021 is Coming
It’s almost been half a year since the last summit.
Data+AI Summit 2021 starts on Monday, May 24 till Friday, May 28. The training will be held …
Delta Lake essential Fundamentals: Part 4 - Practical Scenarios
🎉 Welcome to the 4th part of Delta Lake essential fundamentals: the practical scenarios! 🎉
There are many great features that you can leverage in …
Delta Lake essential Fundamentals: Part 3 - compaction and checkpoint
Let’s understand what are Delta Lake compact and checkpoint and why they are important.
Checkpoint
There are two known checkpoints mechanism in …
Delta Lake essential Fundamentals: Part 2 - The DeltaLog
In the previous part, you learned what ACID transactions are.
In this part, you will understand how Delta Transaction Log, named DeltaLog, is …
Delta Lake essential Fundamentals: Part 1 - ACID
🎉 Welcome to the first part of Delta Lake essential fundamentals! 🎉
What is Delta Lake ?
Delta Lake is an open-source storage layer that brings ACID …
Apache Spark Ecosystem, Jan 2021 Highlights
If you’ve been reading here for a while, you know that I’m a big fan of Apache Spark and have been using it for more than 8 years.
Apache …