Exam DP203 Azure Stream Analytics: Difference between revisions
No edit summary |
|||
(7 intermediate revisions by the same user not shown) | |||
Line 10: | Line 10: | ||
Create a Stream Analytics job. Optionally create a Stream Analytics cluster. | Create a Stream Analytics job. Optionally create a Stream Analytics cluster. | ||
Stream Analytics jobs are resources that should be created in resource groups. | |||
'''reference inputs''' for static data | '''reference inputs''' for static data | ||
Line 23: | Line 25: | ||
== Window functions == | == Window functions == | ||
Window functions provide aggregation of streaming data over time periods. | Window functions provide aggregation of streaming data over time periods. These are implemented by select statements on the source data with the following GROUP BY clauses (can also use HAVING statements): | ||
'''Tumbling''': Fixed time periods. GROUP BY TumblingWindow(minute, 1) | '''Tumbling''': Fixed time periods. GROUP BY TumblingWindow(minute, 1) | ||
Line 33: | Line 35: | ||
'''Session''': Window opened by receipt of an event, and closes x seconds later, unless there is another event in this period, and so on until y seconds after the start. GROUP BY SessionWindow(second, 20, 60) | '''Session''': Window opened by receipt of an event, and closes x seconds later, unless there is another event in this period, and so on until y seconds after the start. GROUP BY SessionWindow(second, 20, 60) | ||
'''Snapshot''': | '''Snapshot''': A period for every event. GROUP BY System.Timestamp() | ||
== Ingest streaming data == | |||
Create a Power BI Dashboard.<pre> | |||
SELECT | |||
EventEnqueuedUtcTime AS ReadingTime, | |||
SensorID, | |||
ReadingValue | |||
INTO | |||
[synapse-output] | |||
FROM | |||
[streaming-input] TIMESTAMP BY EventEnqueuedUtcTime | |||
</pre> | |||
== Output to Power BI == | |||
There is a Power BI output type. | |||
Use window functions to ensure data is sent to Power BI no more frequently than every second |
Latest revision as of 15:23, 21 November 2024
Azure Stream Analytics.
Unbounded - no limit to the amount of data.
Temporal - each piece of data has a datetimestamp.
Aggregation over these temporal windows.
Real time or near-real time analysis.
Create a Stream Analytics job. Optionally create a Stream Analytics cluster.
Stream Analytics jobs are resources that should be created in resource groups.
reference inputs for static data
SELECT observation_time, weather_station, temperature INTO cold-temps FROM weather-events TIMESTAMP BY observation_time WHERE temperature < 0
A field named EventQueuedUtcTime is automatically created to define the time when the event is received into the event queue.
EventProcessedUtcTime is the time it is processed.
Window functions
Window functions provide aggregation of streaming data over time periods. These are implemented by select statements on the source data with the following GROUP BY clauses (can also use HAVING statements):
Tumbling: Fixed time periods. GROUP BY TumblingWindow(minute, 1)
Hopping: Same as Tumbling, but where the time periods overlap. GROUP BY HoppingWindow(second, 60, 30)
Sliding: Periods start/end only when there is an event. There can be more than one event per period. GROUP BY SlidingWindow(minute, 1)
Session: Window opened by receipt of an event, and closes x seconds later, unless there is another event in this period, and so on until y seconds after the start. GROUP BY SessionWindow(second, 20, 60)
Snapshot: A period for every event. GROUP BY System.Timestamp()
Ingest streaming data
Create a Power BI Dashboard.
SELECT EventEnqueuedUtcTime AS ReadingTime, SensorID, ReadingValue INTO [synapse-output] FROM [streaming-input] TIMESTAMP BY EventEnqueuedUtcTime
Output to Power BI
There is a Power BI output type.
Use window functions to ensure data is sent to Power BI no more frequently than every second