This is generally good practice if the caches won't be reused (though arguably pedantic for the `stage1-toolchain` stage). `docker history` on comparable images showed that this saves a few hundred MB on stage1, and ~60MB on the `apt-get` layer of `ci-container-agent`.