In order to send and receive messages from a node, we need to create case classes that contain the fields which we need to send or receive.
The case classes dont need to contain standard fields defined in maelstrom protocol like src, dest, type, msg_id, in_reply_to, etc. Those are automatically added and parsed by the runtime.
Keep in mind that if you use ask api, framework adds a msg_id to the message. If you use reply api, framework adds a in_reply_to to the message.
Caution
If you try to use reply api for a message that does not have a msg_id (i.e. sent using send api), it will throw an error at runtime.
Caution
If you use ask api, the called will wait for the reply with a timeout. If the reply is not received within the timeout, it will return Timeout error.
The idea behind framework design is that when writing solutions to problems, should should not have to think about msg_id and other things. Much like when we write an HTTP/GRPC client or server.
receive api takes a handler function I => ZIO[MaelstromRuntime & R, Nothing, Unit]
Note
I needs have a zio.json.JsonDecoder instance
R can be anything. You will need to provide R & MaelstromRuntime when you run the ZIO effect
Here's an example
Receive
caseclassGossip(numbers:Seq[Int])derivesJsonCodecvalmessageHandler=receive[Gossip](msg=>for{src<-MaelstromRuntime.srcme<-MaelstromRuntime.meothers<-MaelstromRuntime.others_<-ZIO.logDebug(s"received $msg from $src")_<-ZIO.logDebug(s"my node id is $me")_<-ZIO.logDebug(s"other node ids are $others")}yield())
ask api is a combination of send and receive. It sends a message to a remote node and waits for a reply. It expects a zio.json.JsonDecoder instance for the reply & a zio.json.JsonEncoder instance for the request message.
You can either use the default timeout (configured in Settings) or provide a custom timeout for the operation.
Uses the default timeout configured in Settings (100ms by default)
Ask with default timeout
caseclassPing(text:String)derivesJsonCodeccaseclassPong(text:String)derivesJsonCodec// Uses default timeout configured in Settings (100ms by default)valpingResult:ZIO[MaelstromRuntime,AskError,Pong]=NodeId("n2").ask[Pong](Ping("Hello"))
Override the default timeout for this specific operation
Ask with custom timeout
caseclassPing(text:String)derivesJsonCodeccaseclassPong(text:String)derivesJsonCodec// Custom timeout overrides the default timeoutvalpingResult:ZIO[MaelstromRuntime,AskError,Pong]=NodeId("n2").ask[Pong](Ping("Hello"),5.seconds)
The ask api can return either a successful response or an AskError
AskError
typeAskError=Error|DecodingFailure|Timeout
Ask error can be one of the following:
Timeout if the reply was not received within given duration
DecodingFailure if the reply could not be decoded into the given type
Error if the sender sends an error message instead of the reply message.
Sender can send an error message if it encounters an error while processing the request message or when request is invalid. You can read more about error messages in the error messages section and error handling section
sealedtraitErrorCode(privatevalcode1:Int,valdefinite:Boolean){defcode:Int=code1overridedeftoString:String=s"error: ${this.getClass.getSimpleName.replace("$","")}, code: $code"}objectErrorCode:/** * Indicates that the requested operation could not be completed within a timeout. */objectTimeoutextendsErrorCode(0,false)/** * Thrown when a client sends an RPC request to a node which does not exist. */objectNodeNotFoundextendsErrorCode(1,true)/** * Use this error to indicate that a requested operation is not supported by the current implementation. Helpful for stubbing out APIs during development. */objectNotSupportedextendsErrorCode(10,true)/** * Indicates that the operation definitely cannot be performed at this time--perhaps because the server is in a read-only state, has not yet been initialized, believes its peers to be down, and so on. Do not use this error for indeterminate cases, when the operation may actually have taken place. */objectTemporarilyUnavailableextendsErrorCode(11,true)/** * The client's request did not conform to the server's expectations, and could not possibly have been processed. */objectMalformedRequestextendsErrorCode(12,true)/** * Indicates that some kind of general, indefinite error occurred. Use this as a catch-all for errors you can't otherwise categorize, or as a starting point for your error handler: it's safe to return internal-error for every problem by default, then add special cases for more specific errors later. */objectCrashextendsErrorCode(13,false)/** * Indicates that some kind of general, definite error occurred. Use this as a catch-all for errors you can't otherwise categorize, when you specifically know that the requested operation has not taken place. For instance, you might encounter an indefinite failure during the prepare phase of a transaction: since you haven't started the commit process yet, the transaction can't have taken place. It's therefore safe to return a definite abort to the client. */objectAbortextendsErrorCode(14,true)/** * The client requested an operation on a key which does not exist (assuming the operation should not automatically create missing keys). */objectKeyDoesNotExistextendsErrorCode(20,true)/** * The client requested the creation of a key which already exists, and the server will not overwrite it. */objectKeyAlreadyExistsextendsErrorCode(21,true)/** * The requested operation expected some conditions to hold, and those conditions were not met. For instance, a compare-and-set operation might assert that the value of a key is currently 5; if the value is 3, the server would return precondition-failed. */objectPreconditionFailedextendsErrorCode(22,true)/** * The requested transaction has been aborted because of a conflict with another transaction. Servers need not return this error on every conflict: they may choose to retry automatically instead. */objectTxnConflictextendsErrorCode(30,true)/** * Custom error code * * @param code the error code */caseclassCustom(overridevalcode:Int)extendsErrorCode(code,false)
You can send an error message to any node id as a reply to another message. Here's an example
Send standard error
caseclassInMessage()derivesJsonCodecvalprogram=receive[InMessage](_=>replyError(ErrorCode.PreconditionFailed,"some text message"))
Send custom error
caseclassInMessage()derivesJsonCodecvalprogram=receive[InMessage](_=>replyError(ErrorCode.Custom(1005),"some text message"))
There is an api called replyError that can be used to return an instance of Error to the sender.
Reply Error
caseclassQuery(id:Int)derivesJsonCodecvalprogram=receive[Query](_=>replyError(ErrorCode.PreconditionFailed,"some text message"))
Alternatively, you can fail the ZIO effect with an Error type and the framework will automatically return that error to the sender if there is a msg_id in the request message.
Info
One key difference between replyError and ZIO.fail is that replyError allows you to continue the handler execution after returning the error. While ZIO.fail will immediately stop handler execution for the current message.
Failed ZIO Effect
caseclassQuery(id:Int)derivesJsonCodecvalprogram=receive[Query](_=>ZIO.fail(Error(ErrorCode.PreconditionFailed,"some text message")))
There is a very easy way to convert any AskError to an Error response to the sender.
ZIO-Maelstrom provides LinkKv, LwwKv, SeqKv & LinTso clients to interact with these services. SeqKv, LwwKv & LinKv are all key value stores. They have the same api but different consistency guarantees.
Native apis are provided by the maelstrom services
All KV operations support timeout configuration. You can either use the default timeout (configured in Settings) or provide a custom timeout for specific operations.
read
Takes a key and returns the value of the key. If the value does not exist, it returns KeyDoesNotExist error code.
Uses the default timeout configured in Settings (100ms by default)
CAS stands for compare-and-swap. It takes a key, a value and an expected value. It writes the value against the key only if the expected value matches the current value of the key. If the value is different, then it returns PreconditionFailed error code. If the key does not exist, it returns KeyDoesNotExist error code. If you set createIfNotExists to true, it will create the key if it does not exist.
Above example will write 3 to counter only if the current value of counter is 1. If the current value is different, it will return PreconditionFailed error code.
Takes a key and a value and writes the value against the key only if the key does not exist. If the key already exists, it returns PreconditionFailed error code.
This is a high level api built on top of other apis. It takes a key, a function that takes the current value and returns a new value. It reads the current value of the key, applies the function and writes the new value against the key. If the value has changed in the meantime, it applies the function again and keeps trying until the value does not change. This is useful for implementing atomic operations like incrementing a value.
The timeout value does not apply to entire operation but to each individual read, cas and write operation. So the total time taken by the operation can be more than the timeout value. Retries are only done when the value has changed in the meantime. And other error is returned immediately. This also applies to updateZIO api.
updateZIO
This is a high level api built on top of other apis. It takes a key, a function that takes the current value and returns a ZIO that returns a new value. It reads the current value of the key, applies the ZIO and writes the new value against the key. If the value has changed in the meantime, it applies the function again and keeps trying until the value does not change. This is very similar to update but the function can be a ZIO which can do some async operations.
When retries happen, the ZIO is retried as well, so side effects should be avoided in this function.
Important
Because all these apis are built on top of ask api, they can return AskError which you may need to handle. According to maelstrom documentation, they can return KeyDoesNotExist or PreconditionFailed error codes.
In case of network partition or delay, all of the above apis can return Timeout error code.
When incorrect types are used to decode the response, they can return DecodingFailure error code.
Tip
key and value of the key value store can be any type that has a zio.json.JsonCodec instance
Below are the settings that can be configured for a node
Log Level
The default log level is LogLevel.Info.
If you want more detailed logs, you can set it to LogLevel.Debug.
If you want to disable logs, you can set it to LogLevel.None
Log Format
Log format can be either Plain or Colored. Default is colored.
Concurrency
This is the concurrency level for processing messages. Default is 1024.
This means 1024 request messages(receive api) + 1024 response messages (ask api) = 2048 messages can be processed in parallel.
Default Timeout
The default timeout for ask operations and all KV store operations. Default is 100 milliseconds.
This timeout is used when no explicit timeout is provided to ask() or KV operations like read(), write(), cas().
You can override this globally for all operations, or provide operation-specific timeouts when needed.
You can log at different levels using ZIO's logging APIs - ZIO.logDebug, ZIO.logInfo, etc.
All these APIs log to STDERR because STDOUT is used for sending messages.
You can configure the log level using settings API.
By default, log statements are colored. You can change it to plain using settings API
Logging
objectMainApplicationextendsMaelstromNode{overridevalconfigure=NodeConfig.withLogLevelDebugdefprogram=for_<-ZIO.logDebug("Starting node")_<-ZIO.logInfo("Received message")_<-ZIO.logWarning("Something is wrong")_<-ZIO.logError("Something is really wrong")yield()}
Above program, when initialized, will output the following:
When developing a solution, you sometimes want to test it without maelstrom. While you can use stdIn to enter the input, you can also hardcode the input messages in the program itself.
This will run the entire program with the input from the file. With file input you also get to simulate delay in inputs using sleep statements as shown above.