Search result for `serverless`

[USENIX '24] Harmonizing Efficiency and Practicability: Optimizing Resource Utilization in Serverless Computing with Jiagu

Abstract

Current serverless platforms struggle to optimize resource utilization due to their dynamic and fine-grained nature. Conventional techniques like overcommitment and autoscaling fall short, often sacrificing utilization for practicability or incurring performance trade-offs. Overcommitment requires predicting performance to prevent QoS violation, introducing trade-off between prediction accuracy and overheads. Autoscaling requires scaling instances in response to load fluctuations quickly to reduce resource wastage, but more frequent scaling also leads to more cold start overheads. This paper introduces Jiagu to harmonize efficiency with practicability through two novel techniques. First, pre-decision scheduling achieves accurate prediction while eliminating overheads by decoupling prediction and scheduling. Second, \emph{dual-staged scaling} achieves frequent adjustment of instances with minimum overhead. We have implemented a prototype and evaluated it using real-world applications and traces from the public cloud platform. Our evaluation shows a 54.8% improvement in deployment density over commercial clouds (with Kubernetes) while maintaining QoS, and 81.0%–93.7% lower scheduling costs and a 57.4%–69.3% reduction in cold start latency compared to existing QoS-aware schedulers.

Search result for `serverless`

[USENIX '24] Harmonizing Efficiency and Practicability: Optimizing Resource Utilization in Serverless Computing with Jiagu

[USENIX '24] ALPS: An Adaptive Learning, Priority OS Scheduler for Serverless Functions

[USENIX '24] StreamBox: A Lightweight GPU SandBox for Serverless Inference Workflow

[USENIX '24] A Secure, Fast, and Resource-Efficient Serverless Platform with Function REWIND

[SIGCOMM '24] YuanRong: A Production General-purpose Serverless System for Distributed Applications in the Cloud

[OSDI '24] Sabre: Hardware-Accelerated Snapshot Compression for Serverless MicroVMs

[OSDI '24] ServerlessLLM: Low-Latency Serverless Inference for Large Language Models

[NSDI '24] Jolteon: Unleashing the Promise of Serverless for Serverless Workflows

[ISCA '24] EcoFaaS: Rethinking the Design of Serverless Environments for Energy Efficiency

[INFOCOM '24] Demeter: Fine-grained Function Orchestration for Geo-distributed Serverless Analytics

[FAST '24] MinFlow: High-performance and Cost-efficient Data Passing for I/O-intensive Stateful Serverless Analytics

[EUROSYS '24] Serialization/Deserialization-free State Transfer in Serverless Workflows

[EUROSYS '24] Pronghorn: Effective Checkpoint Orchestration for Serverless Hot-Starts

[EUROSYS '24] Optimus: Warming Serverless ML Inference via Inter-Function Model Transformation

[ASPLOS '24] CodeCrunch: Improving Serverless Performance via Function Compression and Cost-Aware Warmup Location Optimization

[ASPLOS '24] RainbowCake: Mitigating Cold-starts in Serverless with Layer-wise Container Caching and Sharing

[ASPLOS '24] FaaSGraph: Enabling Scalable, Efficient, and Cost-Effective Graph Processing with Serverless Computing

[ASPLOS '24] In-Storage Domain-Specific Acceleration for Serverless Computing

[ASPLOS '24] FaaSMem: Improving Memory Efficiency of Serverless Computing with Memory Pool Architecture

[ASPLOS '24] FUYAO: DPU-enabled Direct Data Transfer for Serverless Computing

[USENIX '23] Sponge: Fast Reactive Scaling for Stream Processing with Serverless Frameworks

[USENIX '23] Decentralized and Stateful Serverless Computing on the Internet Computer Blockchain

[USENIX '23] UnFaaSener: Latency and Cost Aware Offloading of Functions from Serverless Platforms

[SOSP '23] XFaaS: Hyperscale and Low Cost Serverless Functions at Meta

[SOSP '23] Halfmoon: Log-Optimal Fault-Tolerant Stateful Serverless Computing

[SIGCOMM '23] Ditto: Efficient Serverless Analytics with Elastic Parallelism

[SC '23] Rethinking Deployment for Serverless Functions: A Performance-First Perspective

[OSDI '23] No Provisioned Concurrency: Fast RDMA-codesigned Remote Fork for Serverless Computing

[OSDI '23] Automated Verification of Idempotence for Stateful Serverless Applications

[NSDI '23] Following the Data, Not the Function: Rethinking Function Orchestration in Serverless Computing

[NSDI '23] Doing More with Less: Orchestrating Serverless Applications without an Orchestrator

[MICRO '23] Memento: Architectural Support for Ephemeral Memory Management in Serverless Environments

[ISCA '23] MXFaaS: Resource Sharing in Serverless Environments for Parallelism and Efficiency

[INFOCOM '23] DisProTrack: Distributed Provenance Tracking over Serverless Applications

[INFOCOM '23] Enabling Age-Aware Big Data Analytics in Serverless Edge Clouds

[INFOCOM '23] On Efficient Zygote Container Planning toward Fast Function Startup in Serverless Edge Cloud

[INFOCOM '23] Online Container Scheduling for Data-intensive Applications in Serverless Edge Computing

[INFOCOM '23] Time and Cost-Efficient Cloud Data Transmission based on Serverless Computing Compression

[INFOCOM '23] EAVS: Edge-assisted Adaptive Video Streaming with Fine-grained Serverless Pipelines

[HPCA '23] SpecFaaS: Accelerating Serverless Applications with Speculative Function Execution

[EUROSYS '23] Palette Load Balancing: Locality Hints for Serverless Functions

[EUROSYS '23] With Great Freedom Comes Great Opportunity: Rethinking Resource Allocation for Serverless Functions

[ASPLOS '23] ElasticFlow: An Elastic Serverless Training Platform for Distributed Deep Learning

[ASPLOS '23] DataFlower: Exploiting the Data-flow Paradigm for Serverless Workflow Orchestration

[ASPLOS '23] Flame: A Centralized Cache Controller for Serverless Computing

[ASPLOS '23] λFS: A Scalable and Elastic Distributed File System Metadata Service using Serverless Functions

[USENIX '22] RunD: A Lightweight Secure Container Runtime for High-density Deployment and High-concurrency Startup in Serverless Computing

[USENIX '22] Help Rather Than Recycle: Alleviating Cold Startup in Serverless Computing Through Inter-Function Container Sharing

[USENIX '22] Tetris: Memory-efficient Serverless Inference through Tensor Sharing

[SIGCOMM '22] SPRIGHT: extracting the server from serverless computing! high-performance eBPF-based event-driven, shared-memory processing

[SC '22] DayDream: Executing Dynamic Scientific Workflows on Serverless Platforms with Hot Starts

[SC '22] SFS: Smart OS Scheduling for Serverless Functions

[PPOPP '22] Mashup: making serverless computing useful for HPC workflows via hybrid execution

[OSDI '22] ORION and the Three Rights: Sizing, Bundling, and Prewarming for Serverless DAGs

[ISCA '22] Lukewarm serverless functions: characterization and optimization

[ISCA '22] HiveMind: a hardware-software system stack for serverless edge swarms

[INFOCOM '22] Retention-Aware Container Caching for Serverless Edge Computing

[INFOCOM '22] StepConf: SLO-Aware Dynamic Resource Configuration for Serverless Function Workflows

[EUROSYS '22] Fireworks: a fast, efficient, and safe serverless framework using VM-level post-JIT snapshot

[EUROSYS '22] Jiffy: elastic far-memory for stateful serverless analytics

[EUROSYS '22] Memory deduplication for serverless computing with Medes

[ASPLOS '22] AQUATOPE: QoS-and-Uncertainty-Aware Resource Management for Multi-stage Serverless Workflows

[ASPLOS '22] IceBreaker: warming serverless functions better with heterogeneity

[ASPLOS '22] INFless: a native serverless system for low-latency, high-throughput inference

[ASPLOS '22] Serverless computing on heterogeneous computers

[USENIX '21] SONIC: Application-aware Data Passing for Chained Serverless Applications

[USENIX '21] FaaSNet: Scalable and Fast Provisioning of Custom Serverless Container Runtimes at Alibaba Cloud Function Compute

[SOSP '21] Boki: Stateful Serverless Computing with Shared Logs

[SOSP '21] Faster and Cheaper Serverless Computing on Harvested Resources

[SC '21] Understanding, predicting and scheduling serverless workloads under partial interference

[OSDI '21] Dorylus: Affordable, Scalable, and Accurate GNN Training with Distributed CPU Servers and Serverless Threads

[NSDI '21] Caerus: NIMBLE Task Scheduling for Serverless Analytics

[ISCA '21] Confidential Serverless Made Efficient with Plug-In Enclaves

[ASPLOS '21] Nightcore: efficient and scalable serverless computing for latency-sensitive, interactive microservices

[ASPLOS '21] FaasCache: keeping serverless computing alive with greedy-dual caching

[ASPLOS '21] Benchmarking, analysis, and optimization of serverless function snapshots

[USENIX '20] Serverless in the Wild: Characterizing and Optimizing the Serverless Workload at a Large Cloud Provider

[USENIX '20] Faasm: Lightweight Isolation for Efficient Stateful Serverless Computing

[SC '20] Batch: machine learning inference serving on serverless platforms with adaptive batching