Data formats supported for ingestion
Data ingestion is the process by which data is added to a table and is made available for query. For all ingestion methods, other than ingest-from-query, the data must be in one of the supported formats. The following table lists and describes the formats that is supported for data ingestion.
For more information about why ingestion might fail, see Ingestion failures and Ingestion error codes in Azure Data Explorer.
Format | Extension | Description |
---|---|---|
ApacheAvro | .avro | An AVRO format with support for logical types. The following compression codecs are supported: null , deflate , and snappy . Reader implementation of the apacheavro format is based on the official Apache Avro library. For information about ingesting Event Hub Capture Avro files, see Ingesting Event Hub Capture Avro files. |
Avro | .avro | A legacy implementation for AVRO format based on .NET library. The following compression codecs are supported: null , deflate (for snappy - use ApacheAvro data format). |
CSV | .csv | A text file with comma-separated values (, ). See RFC 4180: Common Format and MIME Type for Comma-Separated Values (CSV) Files. |
JSON | .json | A text file with JSON objects delimited by \n or \r\n . See JSON Lines (JSONL). |
MultiJSON | .multijson | A text file with a JSON array of property bags (each representing a record), or any number of property bags delimited by whitespace, \n or \r\n . Each property bag can be spread on multiple lines. |
ORC | .orc | An ORC file. |
Parquet | .parquet | A Parquet file. |
PSV | .psv | A text file with pipe-separated values (| ). |
RAW | .raw | A text file whose entire contents is a single string value. |
SCsv | .scsv | A text file with semicolon-separated values (; ). |
SOHsv | .sohsv | A text file with SOH-separated values. (SOH is ASCII codepoint 1; this format is used by Hive on HDInsight.) |
TSV | .tsv | A text file with tab-separated values (\t ). |
TSVE | .tsv | A text file with tab-separated values (\t ). A backslash character (\ ) is used for escaping. |
TXT | .txt | A text file with lines delimited by \n . Empty lines are skipped. |
W3CLOGFILE | .log | Web log file format standardized by the W3C. |
For more info on ingesting data using json
or multijson
formats, see ingest json formats.
Supported data compression formats
Blobs and files can be compressed through any of the following compression algorithms:
Compression | Extension |
---|---|
gzip | .gz |
zip | .zip |
Indicate compression by appending the extension to the name of the blob or file.
For example:
MyData.csv.zip
indicates a blob or a file formatted as CSV, compressed with zip (archive or a single file)MyData.json.gz
indicates a blob or a file formatted as JSON, compressed with gzip.
Blob or file names that don’t include the format extensions but just compression (for example, MyData.zip
) is also supported. In this case, the file format
must be specified as an ingestion property because it cannot be inferred.
Related content
- Learn more about supported data formats
- Learn more about Data ingestion properties
- Learn more about data ingestion
Feedback
Was this page helpful?
Glad to hear it! Please tell us how we can improve.
Sorry to hear that. Please tell us how we can improve.