Cloud Platforms

Reducing costs for shuffle-heavy Apache Spark workloads with serverless storage for Ama...

This signal matters because cloud data platforms are increasingly evaluated on delivery speed, governance, and the ability to scale reliable analytics without operational sprawl.

AB • 2026-03-10

Reducing costs for shuffle-heavy Apache Spark workloads with serverless storage for Amazon EMR Serverless

In this post, we explore the cost improvements we observed when benchmarking Apache Spark jobs with serverless storage on EMR Serverless. We take a deeper look at how serverless storage helps reduce costs for shuffle-...

Open source reference