How GitLab's Red Team automates C2 testing
XXIIVV — permacomputing
Figma’s infrastructure: What goes into powering a web-based design tool | Figma Blog
Scaling infrastructure at a fast-growth startup
GitHub - chaance/state-machine-from-scratch: Building a state machine and a custom transition hook for your React app 🤖
Building a state machine and a custom transition hook for your React app 🤖
Using the OpenAI platform to analyse automated test failures
A look at how to develop a Nightwatch.js plugin which sends the test failure and associated errors to a service which integrates with the OpenAI platform to analyse the errors and get some actionable feedback
Create your first containerized Azure Functions
Get started by deploying your first function app from a Linux image in a container registry to Azure Functions.
Forget your Microservices! The Unparalleled Benefits of Pool Architecture.
When monoliths are the solution to your scaling challenges.
Debugging network stalls on Kubernetes
In this deep-dive, we identified and worked through sporadic latency issues with services running on Kubernetes in our environment.
Life of a Netflix Partner Engineer — The case of extra 40 ms
By: John Blair, Netflix Partner Engineering
Online migrations at scale
Online payment processing for internet businesses. Stripe is a suite of payment APIs that powers commerce for businesses of all sizes.
Open-sourcing a 10x reduction in Apache Cassandra tail latency
At Instagram, we have one of the world’s largest deployments of the Apache Cassandra database. We began using Cassandra in 2012 to replace…
Finding Kafka’s throughput limit in Dropbox infrastructure
The growing pains of database architecture | Figma Blog
How the Figma infrastructure team reduced potential instability by scaling to multiple databases
Sharding & IDs at Instagram
With more than 25 photos and 90 likes every second, we store a lot of data here at Instagram. To make sure all of our important data fits…
Sharding Pinterest: How we scaled our MySQL fleet
This is a technical dive into how we split our data across many MySQL servers. We finished launching this sharding approach in early 2012, and it’s still the system we use today to store our core…
Scaling Kubernetes to 7,500 nodes
We’ve scaled Kubernetes clusters to 7,500 nodes, producing a scalable infrastructure for large models like GPT-3, CLIP, and DALL·E, but also for rapid small-scale iterative research such as Scaling Laws for Neural Language Models.
Rebuilding our tech stack for the new Facebook.com
Facebook.com launched in 2004 as a simple, server-rendered PHP website. Over time, we’ve added layer upon layer of new technology to deliver more interactive features. Each of these new features an…
TinaCMS is Now Fully Open-Source
An open-source CMS backed by Git
Converting stateful application to stateless using AWS services | Amazon Web Services
Designing a system to be either stateful or stateless is an important choice with tradeoffs regarding its performance and scalability. In a stateful system, data from one session is carried over to the next. A stateless system doesn’t preserve data between sessions and depends on external entities such as databases or cache to manage state. […]
Stop Paying $3.5/million requests for AWS API Gateway v1 or v2. Use Lambda Function URL instead
For businesses and developers utilizing Amazon Web Services (AWS), API Gateway has been a go-to choice for creating and deploying…
What happens when databases crash?
Databases have tables and indexes stored in files. As you create rows, the database system writes to data pages in memory which is then written to data files on disk. There is a problem though, what…
The Roots of Today's Modern Backend Engineering Practices
What accidentally taking down Amazon.com in 1997 taught Joshua Burgin; tech industry veteran and one of Amazon’s first 100 employees
How a 3rd Party API Can Ruin Your Weekend
Lessons from my longest production crisis
Introducing the Next Generation of AWS Amplify’s Fullstack Development Experience | Amazon Web Services
AWS Amplify just announced a public preview of a new code-first developer experience that empowers frontend developers to quickly build and deploy fullstack apps with their existing TypeScript or Javascript skills. The first generation of the tooling offered a tooling-first experience, using a CLI/Console-based interactive workflow to create a backend. Gen 2 transitions to a […]
Serverless at Scale: Lessons From 200 Million Lambda Invocations
Serverless computing, with Lambda functions at the heart of it, has irrevocably changed the way we build and scale applications, more than…
Serverlesspresso
Serverlesspresso workshop
Building Serverlesspresso: Creating event-driven architectures
Serverlesspresso is an event-driven, serverless workload that uses Amazon EventBridge and AWS Step Functions to coordinate events across microservices and su...
The Architecture of Serverless Data Systems — Jack Vanlightly
I recently blogged about why I believe the future of cloud data services is large-scale and multi-tenant, citing, among others, S3. “ Top tier SaaS services like S3 are able to deliver amazing simplicity, reliability, durability, scalability, and low price because their technologies are s
New for Amazon SQS – Update the AWS SDK to reduce latency | Amazon Web Services
With Amazon SQS, you can send and receive messages between software components at any scale. It was one of the first AWS services I used and as a Solutions Architect, I helped many customers take advantage of asynchronous communications using message queues. In fact, Amazon SQS has been generally available since July 2006 and, under […]
10 insights on real world container use | Datadog
Our latest report examines more than 2.4 billion containers run by tens of thousands of Datadog customers to understand the state of the container ecosystem.