Data Model and Encoding

Data Model

Source files: common/src/types

RisingWave adapts a relational data model with extensive support for semi-structured data. Relational tables, including tables and materialized views, consist of a list of named, strong-typed columns.

Tables created by users have an implicit, auto-generated row-id column as their primary key; while for materialized views, the primary key is derived from queries. For example, the primary key of an aggregation (group-by) materialized view is the specified group keys.

NULL values mean missing or unknown fields. Currently, all columns are implicitly nullable.

Primitive data types:

  • Booleans: BOOLEAN
  • Integers: SMALLINT (16-bit), INT (32-bit), BIGINT (64-bit)
  • Decimals: NUMERIC
  • Floating-point numbers: REAL, DOUBLE
  • Strings: VARCHAR
  • Temporals: DATE, TIMESTAMP, TIMESTAMP WITH TIME ZONE, TIME, INTERVAL

Composite data types:

  • Struct: A structure with a list of named, strong-typed fields.
  • List: A variable-length list of values with same data type.

In-Memory Encoding

Source files: common/src/array

In-memory data is encoded in arrays for vectorized execution. For variable-length data like strings, generally we use another offset array to mark the start of encoded values in a byte buffer.

A Data Chunk consists of multiple columns and a visibility array, as is shown in the left subgraph below. The visibility array marks each row as visible or not. This helps filtering some rows while keeping other data arrays unchanged.

A Stream Chunk consists of columns, visibility array and an additional ops column, as is shown in the right subgraph below. The ops column marks the operation of row, which can be one of Delete, Insert, UpdateDelete and UpdateInsert.

chunk

On-Disk Encoding

Source files: utils/memcomparable, utils/value-encoding

RisingWave stores user data in shared key-value storage called ‘Hummock’. Tables, materialized views and checkpoints of internal streaming operators are encoded into key-value entries. Every field of a row, a.k.a. cell, is encoded as a key-value entry, except that NULL values are omitted.

row-format

Considering that ordering matters in some cases, e.g. result set of an order-by query, fields of keys must preserve the order of original values after being encoded into bytes. This is what memcomparable is used for. For example, integers must be encoded in big-endian and the sign bit must be flipped to preserve order. In contrast, the encoding of values does not need to preserve order.