I’ve been playing around with using an in-process redis using miniredis as the backing store for a service that relies on go-workers2 for background processing. You can find the code in my example-miniredis project on GitHub.
While miniredis was created as something to be only used in unit tests, this may be useful in running a service that normally requires a redis in a totally standalone mode. I view such a standalone mode as critical for a good development experience in creating integrations against a service, since you can run the service locally without any of its downstream dependencies and still expect to have it respond sensibly.
This standalone mode is inspired by the -dev
option of HashiCorp’s Vault. Being asked to run a universe of dependencies locally via docker-compose (imagine needing to integrate against several services and their docker-compose configurations … holy crap) or only being able to talk to a running instance, either a test instance in production or an instance in a non-production environment, is antithetical to a good local development experience.
I’d also like to share some thoughts on go-workers2 that I’ve compiled during the creation of this example codebase. You may find it valuable to follow the links below so that you may see what I am talking about:
-
main.go#49: The
Add
value for a parameter namedclass
feels a little strange to me. Changing it to an arbitrary value seems to work just as well. My suspicion is that it relates to the Sidekiq ruby implementation needing the name of the worker class that it should invoke when the enqueued message gets picked up for processing. I guess we have it available in case we are mixing it up with Sidekiq’s ruby workers and want them to be able to pick up any jobs that we enqueue from a go producer. -
main.go#119:
ProcessID
is supposed to uniquely identify this instance. In an implementation that uses a real redis, and in a multi-node environment like k8s where these things can go up & down, how do we set that up?Maybe we can’t and therefore we will lose all the in-progress jobs during a restart, redeploy or a pod move. It may mean that we should keep track of the progress of the jobs ourselves, for example with checkpoints or status codes in a database, and a periodic reconciliation to make sure any abandoned jobs are restarted.
We can also decide that we will not be running more than one worker node at a time, and then we don’t need to worry about the ProcessID. We may still need to ensure that there is truly only one node. For example when an old one is being shut down, the new one should not be trying to process anything. This could be done with leadership election or a lock, and that is something we can also use redis for.
One point to note here is that all things eventually fail, and redis may fail in a spectacular way to the point we cannot recover any in-progress jobs. Therefore keeping track of them ourselves and being able to recover any incomplete ones is something we should do irrespective of whether we we run multiple nodes or a single one.
Hope you find this useful.