,
Luigi De Simone
,
Raffaele Della Corte
,
Stefano Toscano
Creative Commons Attribution 4.0 International license
ccebaefeabb4c2885da1f723557e4dec
(Get MD5 Sum)
The orchestration of application instances is critical for the efficient management of cloud computing platforms. Specifically, the serverless paradigm automates container spawning and de-spawning based on actual load, mitigating inefficiencies, such as over- and under-provisioning, that might compromise Service Level Objectives (SLOs). This dynamic behavior introduces significant challenges concerning initialization and termination latencies, which are exacerbated when enforcing real-time requirements in mixed-criticality systems. The existing literature already addresses key issues, such as reducing cold-start times and assuring real-time performance to deployed instances. However, container orchestration times remain an overlooked factor that can severely affect instance startup times, especially when the orchestrator is subject to intense workloads. In this paper, we present PREEMPT-FaaS, an orchestration controller that, unlike commonly adopted controllers, adopts a fixed-priority preemptive scheduling of requests to guarantee reduced orchestration times to high-priority and highly critical instances. We implemented PREEMPT-FaaS as a Rust custom controller for Kubernetes (K8s), along with a patch for Knative, a popular serverless platform built upon K8s. We perform an extensive experimental campaign of PREEMPT-FaaS, including the serving of AI workloads, such as, recurring neural networks and video analytics, showing up to ∼6× reduction of orchestration times under high load and improving end-to-end cold-start times of critical instances, with a consequent reduction of service-level laencies (up to ∼2 s reduction under stress at the 95th percentile).
@Article{cinque_et_al:DARTS.12.2.1,
author = {Cinque, Marcello and De Simone, Luigi and Della Corte, Raffaele and Toscano, Stefano},
title = {{PREEMPT-FaaS: Taming Orchestration Times in Latency-Sensitive Serverless Environments (Artifact)}},
pages = {1:1--1:21},
journal = {Dagstuhl Artifacts Series},
ISSN = {2509-8195},
year = {2026},
volume = {12},
number = {2},
editor = {Cinque, Marcello and De Simone, Luigi and Della Corte, Raffaele and Toscano, Stefano},
publisher = {Schloss Dagstuhl -- Leibniz-Zentrum f{\"u}r Informatik},
address = {Dagstuhl, Germany},
URL = {https://drops.dagstuhl.de/entities/document/10.4230/DARTS.12.2.1},
URN = {urn:nbn:de:0030-drops-266185},
doi = {10.4230/DARTS.12.2.1},
annote = {Keywords: Edge-Cloud, Orchestration, Containers, Mixed-Criticality, Kubernetes}
}