Re: process supervisor - considerations for docker from John Regan on 2015-02-26 (supervision)

From: John Regan <john_at_jrjrtech.com>
Date: Wed, 25 Feb 2015 21:27:30 -0600

On Thu, Feb 26, 2015 at 01:38:14AM +0000, Dreamcat4 wrote:
> Really overwhelmed by all this. It is a much more positive response
> than expected. And many good things. I am very grateful. Some bits I
> still like us to continue discussion upon.
>
> But Gornak - I must say that your new ubuntu base image really seem *a
> lot* better than the phusion/baseimage one. It is fantastic and an
> excellent job you have done there and you continue to update with new
> versions of s6, etc. Can't really say thank you enough for that.

Just to toss an idea out there - the phusion baseimage, my baseimage,
and Gornak's are all based off Ubuntu. Our images, while smaller than
phusion's, are still fairly large downloads.

Have any of you guys looked at Alpine linux? I think the guys at
Alpine may have accidentally created the perfect distro for
Docker. It's a *great* balance between a Ubuntu-based image and a
Busybox/buildroot-based image:

* The base image is something like 10MB, unlike Ubuntu
* It still has a package manager, unlike busybox
* Their packages tend to bring in less additional crap
* You still have that composability that Ubuntu/CentOS/etc images have

I actually just submitted s6 packages to Alpine for testing earlier
today, so hopefully it'll be in their repos at some point.

The fundamental problem with a buildroot/busybox-based image is that
it's really hard to build new images. You can't just say "FROM this;
RUN package-manager install that"

I think if anybody were to start up a new baseimage project, Alpine is
the way to go, hands-down. Tiny, efficient images.
>
> Anyway, back to the discussion:
>
> On Wed, Feb 25, 2015 at 3:57 PM, John Regan <john_at_jrjrtech.com> wrote:
> > On Wed, Feb 25, 2015 at 03:58:07PM +0100, Gorka Lertxundi wrote:
> >> Hello,
> >>
> >> After that great john's post, I tried to solve exactly your same problems. I
> >> created my own base image based primarily on John's and Phusion's base
> >> images.
> >
> > That's awesome - I get so excited when I hear somebody's actually
> > read, digested, and taken action based on something I wrote. So cool!
> > :)
> >
> >>
> >> See my thoughts below.
> >>
> >> 2015-02-25 12:30 GMT+01:00 Laurent Bercot <ska-skaware_at_skarnet.org>:
> >>
> >> >
> >> > (Moving the discussion to the supervision_at_list.skarnet.org list.
> >> > The original message is quoted below.)
> >> >
> >> > Hi Dreamcat4,
> >> >
> >> > Thanks for your detailed message. I'm very happy that s6 found an
> >> > application in docker, and that there's such an interest for it!
> >> > skaware_at_list.skarnet.org is indeed the right place to reach me and
> >> > discuss the software I write, but for s6 in particular and process
> >> > supervisors in general, supervision_at_list.skarnet.org is the better
> >> > place - it's full of people with process supervision experience.
> >> >
> >> > Your message gives a lot of food for thought, and I don't have time
> >> > right now to give it all the attention it deserves. Tonight or
> >> > tomorrow, though, I will; and other people on the supervisionlist
> >> > will certainly have good insights.
> >> >
> >> > Cheers!
> >> >
> >> > -- Laurent
> >> >
> >> >
> >> > On 25/02/2015 11:55, Dreamcat4 wrote:
> >> >
> >> >> Hello,
> >> >> Now there is someone (John Regan) who has made s6 images for docker.
> >> >> And written a blog post about it. Which is a great effort - and the
> >> >> reason I've come here. But it gives me a taste of wanting more.
> >> >> Something a bit more foolproof, and simpler, to work specifically
> >> >> inside of docker.
> >> >>
> >> >> From that blog post I get a general impression that s6 has many
> >> >> advantages. And it may be a good candidate for docker. But I would be
> >> >> remiss not to ask the developers of s6 themselves not to try to take
> >> >> some kind of a personal an interest in considering how s6 might best
> >> >> work inside of docker specifically. I hope that this is the right
> >> >> mailing list to reach s6 developers / discuss such matters. Is this
> >> >> the correct mailing list for s6 dev discussions?
> >> >>
> >> >> I've read and read around the subject of process supervision inside
> >> >> docker. Various people explain how or why they use various different
> >> >> process supervisors in docker (not just s6). None of them really quite
> >> >> seem ideal. I would like to be wrong about that but nothing has fully
> >> >> convinced me so far. Perhaps it is a fair criticism to say that I
> >> >> still have a lot more to learn in regards to process supervisors. But
> >> >> I have no interest in getting bogged down by that. To me, I already
> >> >> know more-or-less enough about how docker manages (or rather
> >> >> mis-manages!) it's container processes to have an opinion about what
> >> >> is needed, from a docker-sided perspective. And know enough that
> >> >> docker project itself won't fix these issues. For one thing because of
> >> >> not owning what's running on the inside of containers. And also
> >> >> because of their single-process viewpoint take on things. Andy way.
> >> >> That kind of political nonsense doesn't matter for our discussion. I
> >> >> just want to have a technical discussion about what is needed, and how
> >> >> might be the best way to solve the problem!
> >> >>
> >> >>
> >> >> MY CONCERNS ABOUT USING S6 INSIDE OF DOCKER
> >> >>
> >> >> In regards of s6 only, currently these are my currently perceived
> >> >> shortcomings when using it in docker:
> >> >>
> >> >> * it's not clear how to pass in programs arguments via CMD and
> >> >> ENTRYPOINT in docker
> >> >
> >> > - in fact i have not seen ANY docker process supervisor solutions
> >> >> show how to do this (except perhaps phusion base image)
> >> >>
> >> >> * it is not clear if ENV vars are preserved. That is also something
> >> >> essential for docker.
> >> >
> >> >
> >> >> * s6 has many utilities s6-*
> >> >> - not clear which ones are actually required for making a docker
> >> >> process supervisor
> >> >>
> >> >>
> >> > * s6 not available yet as .deb or .rpm package
> >> >> - official packages are helpful because on different distros:
> >> >> + standard locations where to put config files and so on may
> >> >> differ.
> >> >> + to install man pages too, in the right place
> >> >>
> >> >>
> >> > * s6 is not available as official single pre-compiled binary file for
> >> >> download via wget or curl
> >> >> - which would be the most ideal way to install it into a docker
> >> >> container
> >> >>
> >> >
> >> >> ^^ Some of these perceived shortcomings are more important /
> >> >> significant than others! Some are not in the remit of s6 development
> >> >> to be concerned about. Some are mild nit-picking, or the ignorance of
> >> >> not-knowning, having not actually tried out s6 before.
> >> >>
> >> >> But my general point is that it is not clear-enough to me (from my
> >> >> perspective) whether s6 can actually satisfy all of the significant
> >> >> docker-specific considerations. Which I have not properly stated yet.
> >> >> So here they are listed below…
> >> >>
> >> >>
> >> >> DOCKER-SPECIFIC CONSIDERATIONS FOR A PROCESS SUPERVISOR
> >> >>
> >> >> A good process supervisor for docker should ideally:
> >> >>
> >> >> * be a single pre-compiled binary program file. That can be downloaded
> >> >> by curl/wget (or can be installed from .deb or .rpm).
> >> >>
> >> >
> >> I always include s6, execline, s6-portable-utils in my base images, they
> >> are pretty
> >> small and manageable. You could find/build them using this builder
> >> container:
> >>
> >> https://github.com/glerchundi/container-s6-builder
> >>
> >> I try to keep this repo as updated as possible so that when Laurent
> >> releases a
> >> version I publish a new release in github:
> >>
> >> https://github.com/glerchundi/container-s6-builder/releases
> >>
> >
> > That's genius, I never thought of using GitHub's release feature like
> > that.
>
> Great work Gorka for providing these linux x86_64 binaries on Github releases.
> This was exactly the kind of thing I was hoping for / looking for in
> regards to that aspect.
>
> > Expect a pull request soon for the other s6 packages.
> >
> >> >
> >> >> * can take directly command and arguments. With argv[] like this:
> >> >> "process_supervisor" "my_program_or_script" "my program or script
> >> >> arguments…"
> >> >
> >> >
> >> What do you want to achieve with that? Get into the container to debug? If
>
> Right so I was half-expecting this kind of response (and from John
> Regan too). In my initial post I could not think of a concise-enough
> way to demonstrate and explain my reasoning behind that specific
> request. At least not without entering into a whole other big long
> discussion that would have detracted / derailed from some of the other
> important considerations and discussion points in respect to docker.
>
> Basically without that capability (which I am aware goes against
> convention for process supervisors that occupy pid 1). Then you are
> forcing docker users to choose an XOR (exclusive-OR) between either
> using s6 process supervision or the ability to specify command line
> arguments to their docker containers (via ENTRYPOINT and/or CMD).
> Which essentially is like a breakage of those ENTRYPOINT and CMD
> features of docker. At least that is my understanding how pretty much
> all of these process supervisors behave. And not any criticism
> levelled at s6 alone. Since you would not typically expect this
> feature anyway (before we had containerisation etc.). It is very
> docker-specific.
>
> Both of you seem to have stated effectively that you don't really see
> such a pressing reason why it is needed.
>
> So then it's another thing entirely for me to explain why and convince
> you guys there are good reasons for it being important to be able to
> continue to use CMD and ENTRYPOINT for specifying command line
> arguments still remains an important thing after adding a process
> supervisor. There are actually many different reasons for why that is
> desirable (that I can think of right now). But that's another
> discussion and case for me to make to you.
>
> I would be happy to go into that aspect further. Perhaps off-the
> mailing list is a better idea. To then come back here again when that
> discussion is over and concluded with a short summary. But I don't
> want to waste anyone's time so please reply and indicate if you would
> really like for me to go into more depth with better justifications
> for why we need that particular feature.

Hm. I think it would be helpful if you gave a couple of examples.

When I build images, I approach it in a task-oriented manner, not a
process-oriented manner. The image doesn't run ejabberd - it runs a
chat service.

So that said, the user shouldn't arbitrarily change the task at
runtime. The task of the image is provide a chat service, so I'm not
sure what the need is to change that at runtime by changing the
ENTRYPOINT and CMD arguments. Those are things you setup at
build-time, and at run-time you supply parameters needed for that
task, like "look up users from this database" or something.

Now, occasionally an image will have some sort of "side task", like
"export the chat logs," in which case I will write a script to serve
as an alternative ENTRYPOINT for runtime (with the --entrypoint
parameter). But I reserve that for one-and-done tasks, you wouldn't
regularly start a container with --entrypoint specified.

Like I said, I'm not sure why you'd build an image that accomplishes
some task, then have the user arbitrarily change what that task is.
But I'm always open to hearing new use-cases.
>
> >> so,
> >> just using CMD with [ "/init" ] is enough and allows to anyone get into the
> >> container
> >> without running any init process / supervisor.
> >>
> >> I cannot imagine why this would be useful, but if it was required,
> >> registering a
> >> new service in runtime into s6 could be a possible solution. This service
> >> would
> >> include your custom program or script.
>
> Would appreciate coming back to how we can do this later on. After I
> have made a more convincing case for why it's actually needed. My
> naive assumption, not knowing any of s6 yet: It should be that simply
> passing on an argv[] array aught to be possible. And perhaps without
> too many extra hassles or loops to jump through.
>
> >> >> * will pass on all ENV vars to "my_program_or_script" faithfully
> >> >>
> >> >>
> >> I take care of ENV's in my init process.
> >>
> >> Concretely, as phusion does, I dump all ENV's into
> >> /etc/container_environment:
> >>
> >> https://github.com/glerchundi/container-base/blob/master/rootfs/init#L19-L30
> >>
> >> And then make use of them in this script which internally uses s6-envdir to
> >> publish
> >> environment variables in the spawned process.
>
> This is great. Thank you!
>
> >> https://raw.githubusercontent.com/glerchundi/container-base/master/rootfs/usr/bin/with-contenv
> >>
> >> So all container init.d, service run and service finish scripts will always
> >> have
> >> environment variables accessible.
> >
> > I just wanted to let you know this image is pretty sweet and makes
> > mine look like garbage :)
> >
> >>
> >> * will run as PID 1 inside the linux namespace
> >> >>
> >> >>
> >> OK. s6 can run as pid 1, so no problem. The init script I mentioned
> >> delegates
> >> responsibility to s6 so it runs as a init/pid1/supervisor perfectly.
>
> I like your init script a lot. Because it is was to understand and
> well commented. And although written in bash, your init script is
> short enough to be re-written in the future in any other language, for
> example as a pre-compiled 'C' program etc. (should such a thing ever
> be needed / worthwhile).
>
> >> > * where my_program_or_script may spawn BOTH child AND non-child
> >> >> (orphaned) processes
> >> >>
> >> >
> >> Laurent correct me if I'm wrong but, this is an intrinsic feature of s6. As
> >> it can run
> >> as pid 1 it inherits all orphaned child processes.
> >>
> >>
> >> >
> >> >> * when "process_supervisor" (e.g. s6 or whatever) receives a TERM signal
> >> >> * it faithfully passes that signal to "my_program_or_script"
> >> >> * it also passes that signal to any orphaned non-child processes too
> >> >>
> >> >>
> >> yes, it can:
> >>
> >> https://github.com/glerchundi/container-base/blob/master/rootfs/etc/s6/.s6-svscan/finish#L23-L31
>
> That is awesome ^^. Like the White Whale of moby dick. It really felt
> good to see this here in your base image. Really appreciate that.
>
> > I agree, this seems like the right way to properly stop a Docker
> > container. Like I said in the other email, I think it's up to the
> > image-creator to do all this - not the responsibility of s6 directly.
> >
> >>
> >>
> >> > * when my_program_or_script dies, or exits
> >> >> * clean up ALL remaining non-children orphaned processes afterwards
> >> >> * which share the same linux namespace
> >> >> - this is VERY important for docker, as docker does not do this
> >> >>
> >> >>
> >> this should be your concrete service concern. For example, imagine you have
> >> a load
> >> balancer with two supervised processes, nginx (main) & confd. Probably you
> >> want to
>
> Nope. At least for the initial guildelines I gave - the container has
> only 1 supervised process (not 2). As per the official docker
> guidelines. Having a 2nd or 3rd etc supervised process means optional
> behaviour...
>
So this is something I actually disagree with - it seems like
everybody gets hung up on how many processes the container is running.
Like I mentioned in a previous email, people start talking about "the
Docker way" means "one container, one process."

I think the correct approach is one *task* per container. The
container should do one thing - serve web pages, be a chat service,
run a webapp, and so on. Who cares if you need one process or many
processes to get the task done?

A pretty good example is running something like GitLab. You wind up
running a web server and an SSH daemon in the same container, but
they form a single, logical service.

To clarify, I'm against running ssh as a "let the user log in and poke
around" service like Phusion does. But in GitLab, it makes sense - the
users add their SSH keys via the web interface, and only use it for
cloning/pushing/pulling/etc, not interactively.

There is a limit to what kinds of processes should be in a container.
Again, using GitLab - you need to run a MySQL or PostgreSQL database
for GitLab, but that's really a whole different task and should be in
a different container.

I *do* agree with implementing optional features, though. In my
images, if you link a container to a logstash container, then the
logstash-forwarder will kick in and collect logs. If you don't, it
doesn't start the logstash-forwarder process.

> >> have confd always running, no matter why this process could die, but you
> >> want it up.
> >> Instead, if your main process dies, you want your container to die too. So
> >> that should
> >> be implemented in you specific use case:
> >>
> >> For example:
> >>
> >> https://github.com/glerchundi/container-mariadb/blob/master/rootfs/etc/s6/mysql/finish#L8-L13
> >
> > This is really similar to calling `s6-svscanctl -t /path/to/whatever`
>
> Sorry I am not familiar enough with s6 and these things yet to
> properly understand that example ^^. But I am guessing that the
> general the essence of what you were in saying is something like this:
>
> "the supervised process should try to shut down gracefully when it
> voluntarily exits or receives a TERM signal". Else: "you should be
> responsible to write a finish script for that supervised process".
>
> Which is already a given. I am much less concerned if a process in a
> docker container fails to handle it's own shutdown properly. When your
> general .s6 finish script will kill the orphans. To ensures that
> docker daemon isn't getting messed up with in-ablility to stop
> containers ('error: no such process'). Which was my primary and most
> major concern.
>
> > I'm not sure if s6 will do anything subtly differently, I'm guessing
> > it treats receiving a TERM signal in exactly the same way. Just wanted
> > to raise that and double-check there's no differences.
>
> ^^
>
> >> > So to ENSURE these things:
> >> >>
> >> >> * to pass full command line arguments and ENV vars to
> >> >> "my_program_or_script"
> >> >>
> >> >> * ensure that when "my_program_or_script" exits, (crashes or normal exit)
> >> >> * no processes are left running in that linux namespace
> >> >>
> >> >> * ensure that when the service reveives a TERM, (docker stop)
> >> >> * no processes are left running in that linux namespace
> >> >>
> >> >> * ensure that when the service reveives a KILL, (docker stop)
> >> >> * no processes are left running in that linux namespace
> >> >>
> >> >>
> >> >> BUT in addition, any extra configurability should be entirely optional:
> >> >>
> >> >> * to add supplemental run scripts (for support programs like cron and
> >> >> rsyslog etc)
> >> >> * which is what most general process supervisors consider as their
> >> >> main mechanism for starting services
> >> >>
> >> >> SO
> >> >> * if "my_program_or_script" is supplied as an argument, THAT is the
> >> >> main process running inside the docker container.
> >> >> * if no "my_program_or_script" argument is supplied, then use
> >> >> whatever other conventional ways to determine the main process from
> >> >> the directories of run scripts.
> >> >>
> >> >>
> >> >>
> >> >> SUMMARY
> >> >>
> >> >> Current solutions for docker seem to be "too complex" for various
> >> >> reasons. Such as:
> >> >>
> >> >> * no mandatory ssh server (phusion baseimage fails)
> >> >> * no python dependancy (supervisor fails, and phusion baseimage fails)
> >> >> * no mandatory cron or rsyslog (current s6 base image fails, and
> >> >> phusion base image fails)
> >> >> * no "hundreds of cli tools" - most of which may never be used
> >> >> (current s6 base image fails)
> >> >> * no awkward intermediate bootstrap script required to write ENV vars
> >> >> and cmdline args to a file (runit fails)
> >> >>
> >> >> So for whichever of those reasons, it feels the problem of docker
> >> >> remains un-satisfied. And not fully addressed by any individual
> >> >> solution. What's the best course of action?
>
> Here is an idea how we might put s6 to the test:
>
> Let's make a minimal busybox base image with s6, and also those other
> necessary mechanisms of Gorka's ubuntu image. So cutting out all that
> apt configuration stuff. I believe this is a good way of demonstrating
> s6 inside of docker.
>
> (Busybox does not have bash though.)
>
> Then we might do a couple of useful things with resulting image:
>
> * Compare file size against the original busybox image (with and without s6).
> * Run unix `time` command (on `docker run <our busybox image>`) to
> compare the startup initialisation times with and without s6.
>
> It's just an idea at this point.
>
> >> >> How can these problems be solved into one single tool?
> >> >>
> >> >> I am hoping that the person who wrote this page:
> >> >>
> >> >> http://skarnet.org/software/s6/why.html
> >> >>
> >> >> Might be able to comment on some these above ^^ docker-specific
> >> >> considerations? If so please, it would really be appreciated.
> >> >>
> >> >> Also cc'ing the author of the docker s6 base images. Who perhaps can
> >> >> comment about some of the problems he has encountered. Many thanks for
> >> >> any comments again (but please reply on the mailing list).
> >> >>
> >> >>
> >> >> Kind Regards
> >> >> dreamcat4
> >> >>
> >> >
> >> >
Received on Thu Feb 26 2015 - 03:27:30 UTC

This archive was generated by hypermail 2.3.0 : Sun May 09 2021 - 19:44:19 UTC