sample operator

Learn how to use the sample operator to return up to the specified number of rows from the input table.

Returns up to the specified number of random rows from the input table.

Syntax

T | sample NumberOfRows

Parameters

NameTypeRequiredDescription
Tstring✔️The input tabular expression.
NumberOfRowsint, long, or real✔️The number of rows to return. You can specify any numeric expression.

Examples

The example in this section shows how to use the syntax to help you get started.

Generate a sample

This query creates a range of numbers, samples one value, and then duplicates that sample.

let _data = range x from 1 to 100 step 1;
let _sample = _data | sample 1;
union (_sample), (_sample)

Output

x
74
63

To ensure that in example above _sample is calculated once, one can use materialize() function:

let _data = range x from 1 to 100 step 1;
let _sample = materialize(_data | sample 1);
union (_sample), (_sample)

Output

x
24
24

Generate a sample of a certain percentage of data

To sample a certain percentage of your data (rather than a specified number of rows), you can use

StormEvents | where rand() < 0.1

Output

The table contains the first few rows of the output. Run the query to view the full result.

StartTimeEndTimeEpisodeIdEventIdStateEventType
2007-01-01T00:00:00Z2007-01-20T10:24:00Z240311914INDIANAFlood
2007-01-01T00:00:00Z2007-01-24T18:47:00Z240811930INDIANAFlood
2007-01-01T00:00:00Z2007-01-01T12:00:00Z197912631DELAWAREHeavy Rain
2007-01-01T00:00:00Z2007-01-01T00:00:00Z259213208NORTH CAROLINAThunderstorm Wind
2007-01-01T00:00:00Z2007-01-31T23:59:00Z14927069MINNESOTADrought
2007-01-01T00:00:00Z2007-01-31T23:59:00Z224010858TEXASDrought

Generate a sample of keys

To sample keys rather than rows (for example - sample 10 Ids and get all rows for these Ids), you can use sample-distinct in combination with the in operator.

let sampleEpisodes = StormEvents | sample-distinct 10 of EpisodeId;
StormEvents
| where EpisodeId in (sampleEpisodes)

Output

The table contains the first few rows of the output. Run the query to view the full result.

StartTimeEndTimeEpisodeIdEventIdStateEventType
2007-09-18T20:00:00Z2007-09-19T18:00:00Z1107460904FLORIDAHeavy Rain
2007-09-20T21:57:00Z2007-09-20T22:05:00Z1107860913FLORIDATornado
2007-09-29T08:11:00Z2007-09-29T08:11:00Z1109161032ATLANTIC SOUTHWaterspout
2007-12-07T14:00:00Z2007-12-08T04:00:00Z1318373241AMERICAN SAMOAFlash Flood
2007-12-11T21:45:00Z2007-12-12T16:45:00Z1282670787KANSASFlood
2007-12-13T09:02:00Z2007-12-13T10:30:00Z1178064725KENTUCKYFlood