This page describes the proposed Wire Protocol for the Streaming Service. See Pravega Concepts for more information.
Data is sent over the wire in self-contained "messages" that are either "requests" (messages sent from the client to the server) or "replies" (responses sent from the server to the client).
All the requests and replies have 8 byte headers with two fields (all data is written in BigEndian format).
- Message Type: An Integer (4 bytes) identifies the message type and determines the subsequent fields. (Note that the protocol can be extended by adding new types.)
- Length: An Integer (4 bytes) (Messages should be less than 224, but the upper bits remain zero). Payload size of the message (possibly zero, indicating there is no data). The remainder of the fields are specific to the type of the message. A few important messages are listed below.
Segment
: String (2 bytes) followed by that many bytes of Java's Modified UTF-8. This Segment indicates the Stream Segment that was read.Offset
: Long (8 bytes). TheOffset
in the Stream Segment to read from.suggestedLength
of Reply: Integer (4 bytes). The clients can request for the required length to the server (but the server may allot a different number of bytes.delegationToken
: String (2 byte) followed by that many bytes of Java's Modified UTF-8. This was added to perform auth. It is an opaque-to-the-client token provided by the Controller that says it's allowed to make this call.
Segment
: String (2 bytes) followed by that many bytes of Java's Modified UTF-8. This Segment indicates the Stream Segment that was read.Offset
: Long (8 bytes). TheOffset
in the Stream Segment to read from.Tail
: Boolean (1 bit). If the read was performed at the tail of the Stream Segment.EndOfSegment
: Boolean (1 bit). If the read was performed at the end of the Stream Segment.Data
: Binary (remaining length in the message).
The client requests to read from a particular Segment at a particular Offset
. It then receives one or more replies in the form of SegmentRead
messages. These contain the data they requested (assuming it exists). The server may decide transferring to the client more or less data than it was asked for, splitting that data in a suitable number of reply messages.
RequestId
: Long (8 bytes). This field contains the client-generated ID that has been propagated to identify a client request.writerId
: UUID (16 bytes). It identifies the requesting appender.Segment
: String (2 bytes) followed by that many bytes of Java's Modified UTF-8. This Segment indicates the Stream Segment that was read.delegationToken
: String (2 byte) followed by that many bytes of Java's Modified UTF-8. This was added to perform auth. It is an opaque-to-the-client token provided by the Controller that says it's allowed to make this call.
RequestId
: Long (8 bytes). This field contains the client-generated ID that has been propagated to identify a client request.Segment
: String (2 bytes) followed by that many bytes of Java's Modified UTF-8. This Segment indicates the Stream Segment to append.writerId
: UUID (16 bytes). It identifies the requesting appender.lastEventNumber
: Long (8 bytes). It specifies the last event number in the Stream.
writerId
: UUID (16 bytes). It identifies the requesting appender.Data
: This holds the contents of the block.
writerId
: UUID (16 bytes). It identifies the requesting appender.sizeOfWholeEvents
: Integer (4 bytes). It is the total number of bytes in this block (starting from the beginning) that is composed of whole (meaning non-partial) events.Data
: This holds the contents of the block.numEvents
: Integer (4 bytes). It specifies the current number of events.lastEventNumber
: Long (8 bytes). It specifies the value of last event number in the Stream.
The ApppendBlockEnd
has a sizeOfWholeEvents
to allow the append block to be less than full. This allows the client to begin writing a block before it has a large number of events. This avoids the need to buffer up events in the client and allows for lower latency.
- Data: A Partial Event is an Event at the end of an Append block that did not fully fit in the Append block. The remainder of the Event will be available in the
AppendBlockEnd
.
Data
: It contains the Event's data (only valid inside the block).
writerId
: UUID (16 bytes).It identifies the requesting appender.eventNumber
: Long (8 bytes). This matches thelastEventNumber
in the append block.previousEventNumber
: Long (8 bytes). This is the previous value ofeventNumber
that was returned in the lastDataAppeneded
.
When appending a client:
- Establishes a connection to the host chosen by it.
- Sends a "Setup Append" request.
- Waits for the "Append Setup" reply.
After receiving the "Append Setup" reply, it performs the following:
- Send a
AppendBlock
request. - Send as many Events that can fit in the block.
- Send an
AppendBlockEnd
request.
While this is happening, the server will be periodically sending it DataAppended
replies acking messages. Note that there can be multiple "Appends Setup" for a given TCP connection. This allows a client to share a connection when producing to multiple Segments.
A client can optimize its appending by specifying a large value in it's AppendBlock
message, as the events inside of the block do not need to be processed individually.