1 - Null values

Learn how to use and understand null values.

All scalar data types in Kusto have a special value that represents a missing value. This value is called the null value, or null.

Null literals

The null value of a scalar type T is represented in the query language by the null literal T(null).

The following query returns a single row full of null values:

print bool(null), datetime(null), dynamic(null), guid(null), int(null), long(null), real(null), double(null), timespan(null)

Predicates on null values

The scalar function isnull() can be used to determine if a scalar value is the null value. The corresponding function isnotnull() can be used to determine if a scalar value isn’t the null value.

Equality and inequality of null values

  • Equality (==): Applying the equality operator to two null values yields bool(null). Applying the equality operator to a null value and a non-null value yields bool(false).
  • Inequality (!=): Applying the inequality operator to two null values yields bool(null). Applying the inequality operator to a null value and a non-null value yields bool(true).

For example:

datatable(val:int)[5, int(null)]
| extend IsBiggerThan3 = val > 3
| extend IsBiggerThan3OrNull = val > 3 or isnull(val)
| extend IsEqualToNull = val == int(null)
| extend IsNotEqualToNull = val != int(null)

Output

valIsBiggerThan3IsBiggerThan3OrNullIsEqualToNullIsNotEqualToNull
5truetruefalsetrue
nullnulltruenullnull

Null values and aggregation functions

When applying the following operators to entities that include null values, the null values are ignored and don’t factor into the calculation:

Null values and the where operator

The where operator use Boolean expressions to determine if to emit each input record to the output. This operator treats null values as if they’re bool(false). Records for which the predicate returns the null value are dropped and don’t appear in the output.

For example:

datatable(ival:int, sval:string)[5, "a", int(null), "b"]
| where ival != 5

Output

ivalsval
nullb

Null values and binary operators

Binary operators are scalar operators that accept two scalar values and produce a third value. For example, greater-than (>) and Boolean AND (&&) are binary operators.

For all binary operators, except as noted in Exceptions to this rule, the rule is as follows:

If one or both of the values input to the binary operator are null values, then the output of the binary operator is also the null value. In other words, the null value is “sticky”.

Exceptions to this rule

  • For the equality (==) and inequality (!=) operators, if one of the values is null and the other value isn’t null, then the result is either bool(false) or bool(true), respectively.
  • For the logical AND (&&) operator, if one of the values is bool(false), the result is also bool(false).
  • For the logical OR (||) operator, if one of the values is bool(true), the result is also bool(true).

For example:

datatable(val:int)[5, int(null)]
| extend Add = val + 10
| extend Multiply = val * 10

Output

valAddMultiply
51550
nullnullnull

Null values and the logical NOT (!) operator

The logical NOT operator not() yields the value bool(null) if the argument is the null value.

Null values and the in operator

  • The in operator behaves like a logical OR of equality comparisons.
  • The !in operator behaves like a logical AND of inequality comparisons.

Null values and data ingestion

For most data types, a missing value in the data source produces a null value in the corresponding table cell. However, columns of type string and CSV (or CSV-like) data formats are an exception to this rule, and a missing value produces an empty string.

For example:

.create table T(a:string, b:int)

.ingest inline into table T
[,]
[ , ]
[a,1]

T
| project a, b, isnull_a=isnull(a), isempty_a=isempty(a), stlen_a=strlen(a), isnull_b=isnull(b)

Output

abisnull_aisempty_astrlen_aisnull_b
  falsetrue0true
  falsefalse1true
a1falsefalse1false

2 - Scalar data types

This article describes Scalar data types.

Every data value, like the value of an expression or a function parameter, has a data type which is either a scalar data type or a user-defined record. A scalar data type is one of the built-in predefined types in Supported data types. A user-defined record is an ordered sequence of name and scalar-data-type pairs, like the data type of a row in a table.

As in most languages, the data type determines what calculations and manipulations can be run against a value. For example, if you have a value that is of type string, you won’t be able to perform arithmetic calculations against it.

Supported data types

In Kusto Query Language, most of the data types follow standard conventions and have names you’ve probably seen before. The following table shows the full list:

TypeDescription
bool (boolean)true (1) or false (0).
datetime (date)An instant in time, typically expressed as a date and time of day.
decimalA 128-bit wide, decimal number.
dynamicAn array, a property bag, or a value of any of the other scalar data types.
guid (uuid, uniqueid)A 128-bit globally unique value.
intA signed, 32-bit wide, integer.
longA signed, 64-bit wide, integer.
real (double)A 64-bit wide, double-precision, floating-point number.
stringA sequence of zero or more Unicode characters.
timespan (time)A time interval.

While most of the data types are standard, you might be less familiar with types like dynamic or timespan, and guid.

  • Dynamic has a structure similar to JSON, but with one key difference: It can store Kusto Query Language-specific data types that traditional JSON can’t, such as a nested dynamic value, or timespan.

  • Timespan is a data type that refers to a measure of time such as hours, days, or seconds. Don’t confuse timespan with datetime, which evaluates to an actual date and time, not a measure of time. The following table shows a list of timespan suffixes.

  • GUID is a datatype representing a 128-bit, globally unique identifier, which follows the standard format of [8]-[4]-[4]-[4]-[12], where each [number] represents the number of characters and each character can range from 0-9 or a-f.

Null values

All nonstring data types can be null. When a value is null, it indicates an absence or mismatch of data. For example, if you try to input the string abc into an integer column, it results in the null value. To check if an expression is null, use the isnull() function.

For more information, see Null values.

3 - The bool data type

This article describes the bool data type.

The bool data type can be: true (1), false (0), or null.

bool literals

To specify a bool literal, use one of the following syntax options:

SyntaxDescription
true or bool(true)Represents trueness.
false or bool(false)Represents falsehood.
bool(null)Represents the null value.

Boolean operators

The bool data type supports all of the logical operators: equality (==), inequality (!=), logical-and (and), and logical-or (or).

4 - The datetime data type

This article describes the datetime data type.

The datetime data type represents an instant in time, typically expressed as a date and time of day. Values range from 00:00:00 (midnight), January 1, 0001 Anno Domini (Common Era) through 11:59:59 P.M., December 31, 9999 A.D. (C.E.) in the Gregorian calendar.

Time values are measured in 100-nanosecond units called ticks, and a particular date is the number of ticks since 12:00 midnight, January 1, 0001 A.D. (C.E.) in the GregorianCalendar calendar (excluding ticks that would be added by leap seconds). For example, a ticks value of 31241376000000000 represents the date, Friday, January 01, 0100 12:00:00 midnight. This is sometimes called “a moment in linear time”.

datetime literals

To specify a datetime literal, use one of the following syntax options:

SyntaxDescriptionExample
datetime(year.month.day hour:minute:second.milliseconds)A date and time in UTC format.datetime(2015-12-31 23:59:59.9)
datetime(year.month.day)A date in UTC format.datetime(2015-12-31)
datetime()Returns the current time.
datetime(null)Represents the null value.

The now() and ago() special functions

Kusto provides two special functions, now() and ago(), to allow queries to reference the time at which the query starts execution.

Supported formats

There are several formats for datetime that are supported as datetime() literals and the todatetime() function.

ISO 8601

FormatExample
%Y-%m-%dT%H:%M:%s%z2014-05-25T08:20:03.123456Z
%Y-%m-%dT%H:%M:%s2014-05-25T08:20:03.123456
%Y-%m-%dT%H:%M2014-05-25T08:20
%Y-%m-%d %H:%M:%s%z2014-11-08 15:55:55.123456Z
%Y-%m-%d %H:%M:%s2014-11-08 15:55:55
%Y-%m-%d %H:%M2014-11-08 15:55
%Y-%m-%d2014-11-08

RFC 822

FormatExample
%w, %e %b %r %H:%M:%s %ZSat, 8 Nov 14 15:05:02 GMT
%w, %e %b %r %H:%M:%sSat, 8 Nov 14 15:05:02
%w, %e %b %r %H:%MSat, 8 Nov 14 15:05
%w, %e %b %r %H:%M %ZSat, 8 Nov 14 15:05 GMT
%e %b %r %H:%M:%s %Z8 Nov 14 15:05:02 GMT
%e %b %r %H:%M:%s8 Nov 14 15:05:02
%e %b %r %H:%M8 Nov 14 15:05
%e %b %r %H:%M %Z8 Nov 14 15:05 GMT

RFC 850

FormatExample
%w, %e-%b-%r %H:%M:%s %ZSaturday, 08-Nov-14 15:05:02 GMT
%w, %e-%b-%r %H:%M:%sSaturday, 08-Nov-14 15:05:02
%w, %e-%b-%r %H:%M %ZSaturday, 08-Nov-14 15:05 GMT
%w, %e-%b-%r %H:%MSaturday, 08-Nov-14 15:05
%e-%b-%r %H:%M:%s %Z08-Nov-14 15:05:02 GMT
%e-%b-%r %H:%M:%s08-Nov-14 15:05:02
%e-%b-%r %H:%M %Z08-Nov-14 15:05 GMT
%e-%b-%r %H:%M08-Nov-14 15:05

Sortable

FormatExample
%Y-%n-%e %H:%M:%s2014-11-08 15:05:25
%Y-%n-%e %H:%M:%s %Z2014-11-08 15:05:25 GMT
%Y-%n-%e %H:%M2014-11-08 15:05
%Y-%n-%e %H:%M %Z2014-11-08 15:05 GMT
%Y-%n-%eT%H:%M:%s2014-11-08T15:05:25
%Y-%n-%eT%H:%M:%s %Z2014-11-08T15:05:25 GMT
%Y-%n-%eT%H:%M2014-11-08T15:05
%Y-%n-%eT%H:%M %Z2014-11-08T15:05 GMT

5 - The decimal data type

This article describes the decimal data type.

The decimal data type represents a 128-bit wide, decimal number.

decimal literals

To specify a decimal literal, use one of the following syntax options:

|Syntax|Description|Example| |–|–| |decimal(number)|A decimal number represented by one or more digits, followed by a decimal point, and then one or more digits.|decimal(1.0)| |decimal(numbereexponent)|A decimal number represented by scientific notation.|decimal(1e5) is equivalent to 100,000| |decimal(null)|Represents the null value.||

6 - The dynamic data type

This article describes The dynamic data type.

The dynamic scalar data type can be any of the following values:

  • An array of dynamic values, holding zero or more values with zero-based indexing.
  • A property bag that maps unique string values to dynamic values. The property bag has zero or more such mappings (called “slots”), indexed by the unique string values. The slots are unordered.
  • A value of any of the primitive scalar data types: bool, datetime, guid, int, long, real, string, and timespan.
  • Null. For more information, see Null values.

Dynamic literals

To specify a dynamic literal, use one of the following syntax options:

SyntaxDescriptionExample
dynamic([value [, …]])An array of dynamic or other scalar literals.dynamic([1, 2, "hello"])
dynamic({key = value [, …]})A property bag, or object. The value for a key can be a nested property bag.dynamic({"a":1, "b":{"a":2}})
dynamic(value)A dynamic value holding the value of the inner scalar data type.dynamic(4)
dynamic(null)Represents the null value.

Dynamic object accessors

To subscript a dictionary, use either the dot notation (dict.key) or the brackets notation (dict["key"]). When the subscript is a string constant, both options are equivalent.

In the examples below dict and arr are columns of dynamic type:

ExpressionAccessor expression typeMeaningComments
dict[col]Entity name (column)Subscripts a dictionary using the values of the column col as the keyColumn must be of type string
arr[index]Entity index (column)Subscripts an array using the values of the column index as the indexColumn must be of type integer or boolean
arr[-index]Entity index (column)Retrieves the ‘index’-th value from the end of the arrayColumn must be of type integer or boolean
arr[(-1)]Entity indexRetrieves the last value in the array
arr[toint(indexAsString)]Function callCasts the values of column indexAsString to int and use them to subscript an array
dict[[‘where’]]Keyword used as entity name (column)Subscripts a dictionary using the values of column where as the keyEntity names that are identical to some query language keywords must be quoted
dict.[‘where’] or dict[‘where’]ConstantSubscripts a dictionary using where string as the key

Accessing a sub-object of a dynamic value yields another dynamic value, even if the sub-object has a different underlying type. Use the gettype function to discover the actual underlying type of the value, and any of the cast function listed below to cast it to the actual type.

Casting dynamic objects

After subscripting a dynamic object, you must cast the value to a simple type.

ExpressionValueType
Xparse_json(’[100,101,102]’)array
X[0]parse_json(‘100’)dynamic
toint(X[1])101int
Yparse_json(’{“a1”:100, “a b c”:“2015-01-01”}’)dictionary
Y.a1parse_json(‘100’)dynamic
Y[“a b c”]parse_json(“2015-01-01”)dynamic
todate(Y[“a b c”])datetime(2015-01-01)datetime

Cast functions are:

  • tolong()
  • todouble()
  • todatetime()
  • totimespan()
  • tostring()
  • toguid()
  • parse_json()

Building dynamic objects

Several functions enable you to create new dynamic objects:

  • bag_pack() creates a property bag from name/value pairs.
  • pack_array() creates an array from list of values (can be list of columns, for each row it will create an array from the specified columns).
  • range() creates an array with an arithmetic series of numbers.
  • zip() pairs “parallel” values from two arrays into a single array.
  • repeat() creates an array with a repeated value.

Additionally, there are several aggregate functions which create dynamic arrays to hold aggregated values:

  • buildschema() returns the aggregate schema of multiple dynamic values.
  • make_bag() returns a property bag of dynamic values within the group.
  • make_bag_if() returns a property bag of dynamic values within the group (with a predicate).
  • make_list() returns an array holding all values, in sequence.
  • make_list_if() returns an array holding all values, in sequence (with a predicate).
  • make_list_with_nulls() returns an array holding all values, in sequence, including null values.
  • make_set() returns an array holding all unique values.
  • make_set_if() returns an array holding all unique values (with a predicate).

Operators and functions over dynamic types

For a complete list of scalar dynamic/array functions, see dynamic/array functions.

Operator or functionUsage with dynamic data types
value in arrayTrue if there’s an element of array that == value
where City in ('London', 'Paris', 'Rome')
value !in arrayTrue if there’s no element of array that == value
array_length(array)Null if it isn’t an array
bag_has_key(bag,key)Checks whether a dynamic bag column contains a given key.
bag_keys(bag)Enumerates all the root keys in a dynamic property-bag object.
bag_merge(bag1,…,bagN)Merges dynamic property-bags into a dynamic property-bag with all properties merged.
bag_set_key(bag,key,value)Sets a given key to a given value in a dynamic property-bag.
extract_json(path,object), extract_json(path,object)Use path to navigate into object.
parse_json(source)Turns a JSON string into a dynamic object.
range(from,to,step)An array of values.
mv-expand listColumnReplicates a row for each value in a list in a specified cell.
summarize buildschema(column)Infers the type schema from column content.
summarize make_bag(column)Merges the property bag (dictionary) values in the column into one property bag, without key duplication.
summarize make_bag_if(column,predicate)Merges the property bag (dictionary) values in the column into one property bag, without key duplication (with predicate).
summarize make_list(column)Flattens groups of rows and puts the values of the column in an array.
summarize make_list_if(column,predicate)Flattens groups of rows and puts the values of the column in an array (with predicate).
summarize make_list_with_nulls(column)Flattens groups of rows and puts the values of the column in an array, including null values.
summarize make_set(column)Flattens groups of rows and puts the values of the column in an array, without duplication.

Indexing for dynamic data

Every field is indexed during data ingestion. The scope of the index is a single data shard.

To index dynamic columns, the ingestion process enumerates all “atomic” elements within the dynamic value (property names, values, array elements) and forwards them to the index builder. Otherwise, dynamic fields have the same inverted term index as string fields.

Examples

Dynamic property bag

The following query creates a dynamic property bag.

print o=dynamic({"a":123, "b":"hello", "c":[1,2,3], "d":{}})
| extend a=o.a, b=o.b, c=o.c, d=o.d

For convenience, dynamic literals that appear in the query text itself may also include other Kusto literals with types: datetime, timespan, real, long, guid, bool, and dynamic. This extension over JSON isn’t available when parsing strings (such as when using the parse_json function or when ingesting data), but it enables you to do the following:

print d=dynamic({"a": datetime(1970-05-11)})

To parse a string value that follows the JSON encoding rules into a dynamic value, use the parse_json function. For example:

  • parse_json('[43, 21, 65]') - an array of numbers
  • parse_json('{"name":"Alan", "age":21, "address":{"street":432,"postcode":"JLK32P"}}') - a dictionary
  • parse_json('21') - a single value of dynamic type containing a number
  • parse_json('"21"') - a single value of dynamic type containing a string
  • parse_json('{"a":123, "b":"hello", "c":[1,2,3], "d":{}}') - gives the same value as o in the example above.

Ingest data into dynamic columns

The following example shows how you can define a table that holds a dynamic column (as well as a datetime column) and then ingest single record into it. It also demonstrates how you can encode JSON strings in CSV files.

// dynamic is just like any other type:
.create table Logs (Timestamp:datetime, Trace:dynamic)

// Everything between the "[" and "]" is parsed as a CSV line would be:
// 1. Since the JSON string includes double-quotes and commas (two characters
//    that have a special meaning in CSV), we must CSV-quote the entire second field.
// 2. CSV-quoting means adding double-quotes (") at the immediate beginning and end
//    of the field (no spaces allowed before the first double-quote or after the second
//    double-quote!)
// 3. CSV-quoting also means doubling-up every instance of a double-quotes within
//    the contents.

.ingest inline into table Logs
  [2015-01-01,"{""EventType"":""Demo"", ""EventValue"":""Double-quote love!""}"]

Output

TimestampTrace
2015-01-01 00:00:00.0000000{“EventType”:“Demo”,“EventValue”:“Double-quote love!”}

7 - The guid data type

This article describes The guid data type.

The guid data type represents a 128-bit globally unique value.

guid literals

To specify a guid literal, use one of the following syntax options:

SyntaxDescriptionExample
guid(id)A guid ID string.guid(74be27de-1e4e-49d9-b579-fe0b331d3642)
guid(null)Represents the null value.

8 - The int data type

This article describes the int data type.

The int data type represents a signed, 32-bit wide, integer.

int literals

To specify an int literal, use one of the following syntax options:

|Syntax|Description|Example| |–|–| |int(number)|A positive integer.|int(2)| |int(-number)|A negative integer.|int(-2)| |int(null)|Represents the null value.||

9 - The long data type

This article describes the long data type.

The long data type represents a signed, 64-bit wide, integer.

By default, integers and integers represented with hexadecimal syntax are of type long.

long literals

To specify a long literal, use one of the following syntax options:

|Syntax|Description|Example| |–|–| |number|An integer. You don’t need to wrap the integer with long() because integers are by default of type long.|12| |0xhex|An integer represented with hexadecimal syntax.|0xf is equivalent to 15| |long(-number)|A negative integer.|long(-1)| |long(null)|Represents the null value.||

10 - The real data type

This article describes the real data type.

The real data type represents a 64-bit wide, double-precision, floating-point number.

By default, decimal numbers and numbers represented with scientific notation are of type real.

real literals

To specify a real literal, use one of the following syntax options:

SyntaxDescriptionExample
numberA real number represented by one or more digits, followed by a decimal point, and then one or more digits.1.0
numbereexponentA real number represented by scientific notation.1e5
real(null)Represents the null value.
real(nan)Not-a-number (NaN), such as when dividing a 0.0 by another 0.0.
real(+inf)Positive infinity, such as when dividing 1.0 by 0.0.
real(-inf)Negative infinity, such as when dividing -1.0 by 0.0.

11 - The string data type

Learn about the string data type.

The string data type represents a sequence of zero or more Unicode characters.

For information on string query operators, see String operators.

string literals

A string literal is a string enclosed in quotes. You can use double quotes or single quotes to encode string literals in query text. With double quotes, you must escape nested double quote characters with a backslash (\). With single quotes, you must escape nested single quote characters, and you don’t need to escape double quotes.

Use the backslash character to escape the enclosing quote characters, tab characters (\t), newline characters (\n), and the backslash itself (\\).

Verbatim string literals

Verbatim string literals are string literals prepended with the @ character, which serves as a verbatim identifier. In this form, the backslash character (\) stands for itself and isn’t an escape character. In verbatim string literals, double quotes are escaped with double quotes and single quotes are escaped with single quotes.

For an example, see Verbatim string.

Multi-line string literals

Indicate a multi-line string literals by a “triple-backtick chord” (```) at the beginning and end of the literal.

For an example, see Multi-line string literal.

Concatenation of separated string literals

In a Kusto query, when two or more adjacent string literals have no separation between them, they’re automatically combined to form a new string literal. Similarly, if the string literals are separated only by whitespace or comments, they’re also combined to form a new string literal.

For an example, see Concatenated string literals.

Obfuscated string literals

Queries are stored for telemetry and analysis. To safeguard sensitive information like passwords and secrets, you can mark a string as an obfuscated string literal. These marked strings are logged in obfuscated form replaced with asterisks (*) in the query text.

An obfuscated string literal is created by prepending an h or an H character in front of a standard or verbatim string literal.

For an example, see Obfuscated string literal.

Examples

String literal with quotes

The following example demonstrates how to use quotes within string literals encompassed by single quotes and double quotes. For more information, see String literals.

print
    s1 = 'string with "double quotes"',
    s2 = "string with 'single quotes'"

Output

s1s2
string with “double quotes”string with ‘single quotes’

String literal with backslash escaping

The following example creates a regular expression pattern using backslashes to escape special characters. For more information, see String literals.

print pattern = '\\n.*(>|\'|=|\")[a-zA-Z0-9/+]{86}=='

Output

pattern
\n.*(>|’|=|")[a-zA-Z0-9/+]{86}==

String literal with Unicode

The following example shows that a backslash is needed to include a Unicode character in a string literal.

print space = "Hello\u00A0World"

Output

space
Hello World

Verbatim string literal

The following example creates a path in which the backslashes are part of the path instead of escape characters. To do this, the string @ sign is prepended to the string, creating a verbatim string literal.

print myPath = @'C:\Folder\filename.txt'

Output

myPath
C:\Folder\filename.txt

Multi-line string literal

The following example shows the syntax for a multi-line string literal, which uses newlines and tabs to style a code block. For more information, see Multi-line string literals.

print program = ```
  public class Program {
    public static void Main() {
      System.Console.WriteLine("Hello!");
    }
  }```

Output

program
public class Program { public static void Main() { System.Console.WriteLine(“Hello!”); } }

Concatenated string literals

The following expressions all yield a string of length 13. For more information, see Concatenation of separated string literals.

print 
    none = strlen("Hello"', '@"world!"),
    whitespace = strlen("Hello" ', ' @"world!"),
    whitespaceAndComment = strlen("Hello" 
        // Comment
        ', '@"world!"
    );

Output

nonewhitespacewhitespaceAndComment
131313

Obfuscated string literal

In the following query output, the h string is visible in your results. However, in tracing or telemetry, the h string is stored in an obfuscated form and substituted with asterisks in the log. For more information, see Obfuscated string literals.

print blob="https://contoso.blob.core.windows.net/container/blob.txt?"
    h'sv=2012-02-12&se=2013-04-13T0...'

Output

blob
https://contoso.blob.core.windows.net/container/blob.txt?sv=2012-02-12&se=2013-04-13T0

12 - The timespan data type

This article describes The timespan data type.

The timespan data type represents a time interval.

timespan literals

To specify a timespan literal, use one of the following syntax options:

SyntaxDescriptionExampleLength of time
ndA time interval represented by one or more digits followed by d for days.2d2 days
nhA time interval represented by one or more digits followed by h for hours.1.5h1.5 hours
nmA time interval represented by one or more digits followed by m for minutes.30m30 minutes
nsA time interval represented by one or more digits followed by s for seconds.10s10 seconds
nmsA time interval represented by one or more digits followed by ms for milliseconds.100ms100 milliseconds
nmicrosecondA time interval represented by one or more digits followed by microsecond.10microsecond10 microseconds
ntickA time interval represented by one or more digits followed by tick to indicate nanoseconds.1tick100 ns
timespan(n seconds)A time interval in seconds.timespan(15 seconds)15 seconds
timespan(n)A time interval in days.timespan(2)2 days
timespan(days.hours:minutes:seconds.milliseconds)A time interval in days, hours, minutes, and seconds passed.timespan(0.12:34:56.7)0d+12h+34m+56.7s
timespan(null)Represents the null value.

timespan operators

Two values of type timespan may be added, subtracted, and divided. The last operation returns a value of type real representing the fractional number of times one value can fit the other.

Examples

The following example calculates how many seconds are in a day in several ways:

print
    result1 = 1d / 1s,
    result2 = time(1d) / time(1s),
    result3 = 24 * 60 * time(00:01:00) / time(1s)

This example converts the number of seconds in a day (represented by an integer value) to a timespan unit:

print 
    seconds = 86400
| extend t = seconds * 1s