Combining Dynamic Software Updating (DSU) with Multi-Version Execution (MVE) in order to avoid live update delays in services.

Overview

Dynamic Software Updating (DSU) is a technique for patching stateful software without shutting it down, which enables both timely updates and non-stop service. Unfortunately, bugs in the update itself—whether in the changed code or in the way the change is introduced dynamically—may cause the updated software to crash or misbehave. Furthermore, the time taken to dynamically apply the update may be unacceptable if it introduces a long delay in service.

This paper makes the key observation that both problems can be addressed by employing Multi-Version Execution (MVE). To avoid delay in service, the update is applied to a forked copy while the original system continues to operate. Once the update completes, the MVE system monitors that the responses of both versions agree for the same inputs. Expected divergences are specified by the programmer using an MVE-specific DSL. Unexpected divergences signal possible errors and roll back the update, which simply means terminating the updated version and reverting to the original version. This is safe because the MVE system keeps the state of both versions in sync. If the new version shows no problems after a warmup period, operators can make it permanent and discard the original version.

We have implemented this approach, which we call MVEDSUa (MVE + DSU is pronounced Medusa) by extending the Kitsune DSU framework with Varan, a state-of-the-art MVE system. We have used MVEDSUa to update several high-performance servers: Redis, Memcached, and vsftpd. Our results show that MVEDSUa significantly reduces the update-time delay, imposes little overhead in steady state, and easily recovers from a variety of update-related errors.

Download

The source code of MVEDSUa’s rules are available here.

Publications

  • Mvedsua: Higher Availability Dynamic Software Updates via Multi-Version Execution

    Luís Pina, Anastasios Andronidis, Michael Hicks, Cristian Cadar

    2019 Architectural Support for Programming Languages and Operating Systems (ASPLOS 19)