format

Arrow specification documents

Currently, the Arrow specification consists of these pieces:

Metadata specification (see Metadata.md)
Physical memory layout specification (see Layout.md)
Logical Types, Schemas, and Record Batch Metadata (see Schema.fbs)
Encapsulated Messages (see Message.fbs)
Mechanics of messaging between Arrow systems (IPC, RPC, etc.) (see IPC.md)
Tensor (Multi-dimensional array) Metadata (see Tensor.fbs)

The metadata currently uses Google's flatbuffers library for serializing a couple related pieces of information:

Schemas for tables or record (row) batches. This contains the logical types, field names, and other metadata. Schemas do not contain any information about actual data.
Data headers for record (row) batches. These must correspond to a known schema, and enable a system to send and receive Arrow row batches in a form that can be precisely disassembled or reconstructed.

Arrow Format Maturity and Stability

We have made significant progress hardening the Arrow in-memory format and Flatbuffer metadata since the project started in February 2016. We have integration tests which verify binary compatibility between the Java and C++ implementations, for example.

Major versions may still include breaking changes to the memory format or metadata, so it is recommended to use the same released version of all libraries in your applications for maximum compatibility. Data stored in the Arrow IPC formats should not be used for long term storage.

Name		Name	Last commit message	Last commit date
parent directory ..
Arrow.graffle		Arrow.graffle
Arrow.png		Arrow.png
File.fbs		File.fbs
Flight.proto		Flight.proto
Guidelines.md		Guidelines.md
IPC.md		IPC.md
Layout.md		Layout.md
Message.fbs		Message.fbs
Metadata.md		Metadata.md
README.md		README.md
Schema.fbs		Schema.fbs
Tensor.fbs		Tensor.fbs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

format

format

README.md

Arrow specification documents

Arrow Format Maturity and Stability

Files

format

Directory actions

More options

Directory actions

More options

Latest commit

History

format

Folders and files

parent directory

README.md

Arrow specification documents

Arrow Format Maturity and Stability