Exam DP203 Storage 3 Azure Lake Database: Difference between revisions
No edit summary |
No edit summary |
||
Line 1: | Line 1: | ||
The third [[Exam_DP203_Storage|Storage]] type is: Azure Lake Database. | The third [[Exam_DP203_Storage|Storage]] type is: Azure Lake Database. | ||
The underlying data is in Parquet or CSV files only | The underlying data is in Parquet or CSV files only. | ||
Serverless SQL Pool, or Spark. | Serverless SQL Pool, or Spark. | ||
To create a Lake Database, go into Synapse Studio '''Data''' tab, and click the '''+ button''' and select Lake Database under "Workspace". This creates a new section "Lake Database" under which the database is created, under '''Data''' - '''Workspace'''. | To create a Lake Database, go into Synapse Studio '''Data''' tab, and click the '''+ button''' and select Lake Database under "Workspace". This creates a new section "Lake Database" under which the database is created, under '''Data''' - '''Workspace'''. | ||
On the right-hand side you choose the Linked Service from the pop-up menu. This needs to be an existing '''Linked Service''' to Azure Gen2 storage already configured under section '''Data''' - '''Linked''' - '''Azure Data Lake Storage Gen2'''. | |||
In the design section, you can choose a template from different business scenarios, or choose to create it as empty. Then after you can add tables to it, including key relationships between tables. | In the design section, you can choose a template from different business scenarios, or choose to create it as empty. Then after you can add tables to it, including key relationships between tables. |
Revision as of 21:21, 16 November 2024
The third Storage type is: Azure Lake Database.
The underlying data is in Parquet or CSV files only.
Serverless SQL Pool, or Spark.
To create a Lake Database, go into Synapse Studio Data tab, and click the + button and select Lake Database under "Workspace". This creates a new section "Lake Database" under which the database is created, under Data - Workspace.
On the right-hand side you choose the Linked Service from the pop-up menu. This needs to be an existing Linked Service to Azure Gen2 storage already configured under section Data - Linked - Azure Data Lake Storage Gen2.
In the design section, you can choose a template from different business scenarios, or choose to create it as empty. Then after you can add tables to it, including key relationships between tables.
You can run select statements on the tables either with the Serverless pool using standard SQL syntax like:
USE RetailDB; GO SELECT CustomerID, FirstName, LastName FROM Customer ORDER BY LastName;
or by running a Spark notebook and running:
%%sql INSERT INTO `RetailDB`.`Customer` VALUES (123, 'John', 'Yang') %%sql SELECT * FROM `RetailDB`.`Customer` WHERE CustomerID = 123