Apache Spark is a popular framework for developing distributed, parallel data processing applications. Our solution for Apache Spark on Kubernetes has made significant progress in the past year since we launched, adding support for Apache Iceberg, a new GPU accelerated image
We’ve also been busy adding initial support for Apache Kyuubi to our Charmed Spark solution, so that you can deploy an enterprise-grade, fault-tolerant, ANSI-SQL-compliant data warehouse on your Kubernetes data lake infrastructure, building a so-called ‘lakehouse’. You can deploy a comprehensive, hyper-automated data lake infrastructure using our all-open source control plane, software defined storage and cloud-native compute infrastructure solutions. We’ve even built a couple of runbooks that should get you started in both cloud and on-premise contexts.
There are many benefits to adopting the cloud-native approach to building a data lake:
While we do have some work to do until our Kyuubi integration is fully ready for business, you can already try it out – see our docs for the lowdown.
Another thing I’ve been itching to announce is our new Spark 4 beta image. This new beta image joins our collection of Spark 3 images – and whilst the beta image isn’t eligible for official support from Canonical, it gives you an easy way to try out the latest upstream Apache Spark 4 beta features today!
Some of the new features of Spark 4 include:
There are some cool new things there – well oriented towards advanced data management at whopping scale – so if you’d like to take them for a spin, head on over to our user docs to learn how to quickly set up our Charmed Spark solution for Apache Spark on Kubernetes.
You can freely access our Apache Spark 4 beta container image in Github Container Registry right here –
If you’d like to learn more about getting enterprise-grade support for Apache Spark from Canonical, contact us and we’ll be happy to jump on a call with you to discuss further, or you can browse our Charmed Spark product page if you prefer.
Deploying FreePBX and Asterisk on a single Ubuntu virtual machine in a public cloud is…
Canonical and MediaTek enhance reliability, accelerate market entry and reduce Total Cost of Ownership (TCO)…
As Ubuntu 20.04 LTS (Focal Fossa) standard support ends on May 31, 2025, Azure users…
Welcome to the Ubuntu Weekly Newsletter, Issue 881 for the week of February 23 –…
Welcome back, data scientists! In my previous post, we explored how easy it is to…
In this article, we will see how to install vLLM on Linux using 4 easy…