#cloud-computing

9 posts

24 Jun

Nash Vincent 24 Jun 2026 8 min read

How CRED made disaster recovery drills push-button simple

Our members trust CRED with their money. They rely on us to clear bills, manage credit, and track their financial lives. This means that if we go down, someone’s EMI is late, someone’s credit score takes a hit, someone’s financial plan falls apart. With over 1,000 microservices and multiple regulated business lines, resilience is the entire product. Most companies treat…

infrastructure-as-codecloud-computing site-reliability-engineerci-cd-pipelineplatform-engineering

28 May

Shaurya Kethireddy 28 May 2026 14 min read

Slack AI: The Path to Multi-Cloud

Slack

In early 2023, Slack faced a foundational challenge: serving Large Language Models (LLMs) at enterprise scale with the security, reliability, and performance our customers expect. Over three years, we evolved from basic infrastructure to orchestrating a sophisticated multi-cloud architecture. We didn’t just want shiny new models; we needed a system resilient to regional outages and…

uncategorized aws backend cloud-computing collaboration

15 May

Phoebe Sajor 15 May 2026 1 min read

No Dumb Questions: What is cloud computing and why is everyone doing it? ‌ ‍ ‍‍‌‍ ‌ ‍‌‍‍‌‌‍‌ ‌‍‍‌‌‍ ‍‍‍ ‍‍‍‍‌ ‌‍‌‌‍ ‍‌‍‍‌‌ ‌‌ ‍‌‍ ‍‌‍‍‌‌‍ ‍‍‍ ‍‍‌‍‍‌ ‍‌‍‌‌‌‍‌‍‍‍ ‍‍‍‍‌‍‍‌ ‌‌ ‌‌ ‌ ‍‍‍ ‍ ‌‍ ‌‍ ‌‌ ‍ ‍‌ ‌ ‌‌‍‌‌‍ ‌‍‍ ‌‍ ‌ ‌‍‌‍‌‌‌ ‍‌‍‌‍‌‍ ‌‍ ‌ ‌ ‍ ‍‌‍ ‌‍ ‍ ‌‍‍‌‌‍ ‍‌ ‌‌‍‌‌‌‍ ‍‌ ‌‍ ‌‍‌‌‌‍‌‌‍‍‌‌ ‌‍ ‌‍ ‌‌‍ ‌‍‌‌‍‌‌ ‌‌ ‌ ‍‌‍‌‌‌ ‌‍‌‌‌‍ ‍‌ ‌‌‍‌‌ ‌‌‍‍‌‌‍ ‌‍ ‍ ‍ ‌‍‍‌‌‍‌ ‌ ‌‌‌‍‌‌‌‍‌‌‍‌ ‌ ‌ ‌‍ ‌ ‌‍‌‍‍ ‌ ‌ ‍‌‍‌ ‍‍ ‌ ‍‌‍‌ ‍‌‍‌‍‍ ‌ ‍ ‌‌‍ ‍‌ ‌ ‍‌ ‍‌‍‌ ‌ ‌‍‌‌‍‌ ‍ ‌ ‌‌ ‍‌‌ ‌‍‌‌ ‌‌‍‍‌‍ ‌‍ ‌‍‌ ‌‌‌‍ ‌ ‌ ‌ ‍ ‌ ‌‍‌‌ ‌‌‍‍ ‌‌ ‌‌‍‍‌‌ ‌‌‍ ‌‍‌‌ ‌‍‍‌‍‌‌ ‌‍‌‌‌‌‌‌‌ ‍‌‍ ‌‌‍‍‌ ‌‌ ‌‌ ‌ ‍‌‌ ‌‌‍‌‌ ‍‌‌‍‍‌‌ ‍‌‌‍‌‍ ‌‍ ‌‌ ‍ ‍‌ ‌ ‌‌‍‌‌‍ ‌‍‍ ‌‍ ‌ ‌‍‌‍‌‌‌ ‍‌‍‌‍‌‍ ‌‍ ‌ ‌ ‍ ‍‌‍ ‌‍ ‍‌‍‌‍‍‌‌‍‌ ‌ ‌‌‌‍‌‌‌‍‌‌‍‌ ‌ ‌ ‌‍ ‌ ‌‍‌‍‍ ‌ ‌ ‍‌‍‌ ‍‍ ‌ ‍‌‍‌ ‍‌‍‌‍‍ ‌ ‍ ‌‌‍ ‍‌ ‌ ‍‌ ‍‌‍‌ ‌ ‌‍‌‌‍‌‍‌‍‌ ‌‌ ‍‌‌ ‌‍‌‌ ‌‌‍‍‌‍ ‌‍ ‌‍‌ ‌‌‌‍ ‌ ‌ ‌‍‌‍‌ ‌‍‌‌ ‌‌‍‍ ‌‌ ‌‌‍‍‌‌ ‌‌‍ ‌‍‌‌‍‌‍‌ ‌‍‌‌‌ ‍‌ ‌ ‌‍‌‌‌‍ ‌ ‌‌‍‍‌‌ ‌‍‌‍‌‌ ‌‌ ‌ ‌‌‌‍‍‌‍ ‌‍‍‌‌ ‌‍‍‌‍‌‌‌‍‌‍‍‌ ‌

Stack Overflow

In this No Dumb Questions, Phoebe is joined by Stack Overflow’s tech lead for the infrastructure team, Josh Zhang, to learn about the cloud, compute, and data centers. ‌ ‍ ‍‍‌‍ ‌ ‍‌‍‍‌‌‍‌ ‌‍‍‌‌‍ ‍‍‍ ‍‍‍‍‌ ‌‍‌‌‍ ‍‌‍‍‌‌ ‌‌ ‍‌‍ ‍‌‍‍‌‌‍ ‍‍‍ ‍‍‌‍‍‌ ‍‌‍‌‌‌‍‌‍‍‍ ‍‍‍‍‌‍‍‌ ‌‌ ‌‌ ‌ ‍‍‍ ‍ ‌‍ ‌‍ ‌‌ ‍ ‍‌ …

no-dumb-questions cloud cloud-computingcloud-nativecpu

21 Feb 2024

Ilay Chen 21 Feb 2024 12 min read

Leveraging Spark 3 and NVIDIA’s GPUs to Reduce Cloud Cost by up to 70% for Big Data Pipelines

Paypal

By Ilay Chen and Tomer Akirav At PayPal, hundreds of thousands of Apache Spark jobs run on an hourly basis, processing petabytes of data and requiring a high volume of resources. To handle the growth of machine learning solutions, PayPal requires scalable environments, cost awareness and constant innovation. This blog explains how Apache Spark 3 and GPUs can help enterprises…

cloud-computing gpu big-data machine-learning apache-spark

12 Dec 2023

Archie Gunasekara 12 Dec 2023 10 min read

Our Journey Migrating to AWS IMDSv2

Slack

We are heavy users of Amazon Compute Compute Cloud (EC2) at Slack — we run approximately 60,000 EC2 instances across 17 AWS regions while operating hundreds of AWS accounts. A multitude of teams own and manage our various instances. The Instance Metadata Service (IMDS) is an on-instance component that can be used to gain an…

uncategorized aws cloud-computing infrastructure security

21 Mar 2023

Tricia Bogen 21 Mar 2023 9 min read

Technology Lifecycle

Slack

This blog post discusses the strategies that Slack uses to manage the lifecycle (development, support, and eventual retirement) of infrastructure projects, through the lens of the migration through three successive internal “platform” offerings. Our challenges Circa 2020, our Cloud Engineering team (now evolved into multiple teams responsible for narrower aspects) was responsible for managing our…

uncategorized cloud-computing collaboration devops infrastructure

9 Mar 2022

Javier Turegano 9 Mar 2022 10 min read

Applying Product Thinking to Slack’s Internal Compute Platform

Slack

According to a recent Thoughtworks radar, “the industry is increasingly gaining experience with platform engineering product teams that create and support internal platforms.” They caveated this with a piece of advice: “When creating a platform, it’s critical to have clearly defined customers and products that will benefit from it rather than building in a vacuum.”…

uncategorized cloud-computing infrastructureproduct-management

20 Oct 2021

Archie Gunasekara 20 Oct 2021 10 min read

Building the Next Evolution of Cloud Networks at Slack – A Retrospective

Slack

About a year ago, I wrote a blog post called Building the Next Evolution of Cloud Networks at Slack. In it, we discussed how Slack’s AWS infrastructure has evolved over the years and the pain points that drove us to spin up a brand-new network architecture redesign project called Whitecastle. If you have not had…

uncategorized aws cloud-computing devops infrastructure

17 Feb 2019

Jeff Atwood 17 Feb 2019 5 min read

The Cloud Is Just Someone Else’s Computer

Jeff Atwood

When we started Discourse in 2013, our server requirements were high: 1GB RAM modern, fast dual core CPU speedy solid state drive with 20+ GB I’m not talking about a cheapo shared cpanel server, either, I mean a dedicated virtual private server with those specifications. We

cloud computingdiscourse