Exam 203 back-end services Spark Delta Lake

Spark Delta Lake.

This is a layer on top of Spark that provides for relational databases.

By using Delta Lake, you can implement a data lakehouse architecture in Spark.

Delta Lake supports:

CRUD (create, read, update, and delete) operations
ACID atomicity (transactions complete as a single unit of work), consistency (transactions leave the database in a consistent state), isolation (in-process transactions can't interfere with one another), and durability (when a transaction completes, the changes it made are persisted
Data versioning and time travel
Streaming as well as Batch data. Spark Structured Streaming API
Underlying data is in Parquet format only, not CSV.
Can use the Serverless pool in Synapse Studio to query it.