Exam DP203 Azure Stream Analytics

From MillerSql.com

Azure Stream Analytics.

Unbounded - no limit to the amount of data.

Temporal - each piece of data has a datetimestamp.

Aggregation over these temporal windows.

Real time or near-real time analysis.

Create a Stream Analytics job. Optionally create a Stream Analytics cluster.

Stream Analytics jobs are resources that should be created in resource groups.

reference inputs for static data

SELECT observation_time, weather_station, temperature
INTO cold-temps
FROM weather-events TIMESTAMP BY observation_time
WHERE temperature < 0

A field named EventQueuedUtcTime is automatically created to define the time when the event is received into the event queue.

EventProcessedUtcTime is the time it is processed.

Window functions

Window functions provide aggregation of streaming data over time periods. These are implemented by select statements on the source data with the following GROUP BY clauses (can also use HAVING statements):

Tumbling: Fixed time periods. GROUP BY TumblingWindow(minute, 1)

Hopping: Same as Tumbling, but where the time periods overlap. GROUP BY HoppingWindow(second, 60, 30)

Sliding: Periods start/end only when there is an event. There can be more than one event per period. GROUP BY SlidingWindow(minute, 1)

Session: Window opened by receipt of an event, and closes x seconds later, unless there is another event in this period, and so on until y seconds after the start. GROUP BY SessionWindow(second, 20, 60)

Snapshot: A period for every event. GROUP BY System.Timestamp()

Ingest streaming data

Create a Power BI Dashboard.

SELECT
    EventEnqueuedUtcTime AS ReadingTime,
    SensorID,
    ReadingValue
INTO
    [synapse-output]
FROM
    [streaming-input] TIMESTAMP BY EventEnqueuedUtcTime

Output to Power BI

There is a Power BI output type.

Use window functions to ensure data is sent to Power BI no more frequently than every second