sample-distinct operator

Learn how to use the sample-distinct operator to return a column that contains up to the specified number of distinct values of the requested columns.

Returns a single column that contains up to the specified number of distinct values of the requested column.

The operator tries to return an answer as quickly as possible rather than trying to make a fair sample.

Syntax

T | sample-distinct NumberOfValues of ColumnName

Parameters

NameTypeRequiredDescription
Tstring✔️The input tabular expression.
NumberOfValuesint, long, or real✔️The number distinct values of T to return. You can specify any numeric expression.
ColumnNamestring✔️The name of the column from which to sample.

Examples

The example in this section shows how to use the syntax to help you get started.

Get 10 distinct values from a population

StormEvents | sample-distinct 10 of EpisodeId

Output

EpisodeId
11074
11078
11749
12554
12561
13183
11780
11781
12826

Further compute the sample values

let sampleEpisodes = StormEvents | sample-distinct 10 of EpisodeId;
StormEvents 
| where EpisodeId in (sampleEpisodes) 
| summarize totalInjuries=sum(InjuriesDirect) by EpisodeId

Output

EpisodeIdtotalInjuries
110910
110740
110780
117490
125543
125610
131830
117800
117810
128260