Doordash.engineering

How to Handle Kubernetes Health Checks

WebHere are our recommended steps to avoid similar issues: 1. Understand the different applications of the various Kubernetes probes. Ensure the entire department is aware of …

Actived: 9 days ago

URL: https://doordash.engineering/2022/08/09/how-to-handle-kubernetes-health-checks/

Why Burnout is Hard to Detect

WebBurnout refers specifically to phenomena in the occupational context and should not be applied to describe experiences in other areas of life.”. Burnout is extremely hard to …

Category:  Health Go Health

The 4 Principles to boost DoorDash's Experimentation by 1000%

WebHow we scaled our experimentation 1000%. In order to scale our experiment capacity we focused on building an approach with the following four attributes: Operating with a fail …

Category:  Health Go Health

Using Active Probing to Detect Infrastructure Failures

WebThe complexity of a microservice architecture requires fine-grained observability to spot impending service and infrastructure failures, and quickly mitigate them. Recognizing this …

Category:  Health Go Health

How DoorDash Defines Great Engineering Management

WebEngineering Excellence encompasses three subcategories: Quality, Velocity, and Efficiency. Quality refers to the craft of engineering, delivering software that is reliable, efficient, …

Category:  Health Go Health

Tips for Building High-Quality Django Apps at Scale

WebOne option is to try Django’s built-in squashing feature. Another option, which has worked well for us, is to just do this manually. Drop everything in the django_migrations table, …

Category:  Health Go Health

Optimizing OpenTelemetry’s Span Processor for High Throughput …

WebOptimizing OpenTelemetry’s Span Processor for High Throughput and Low CPU Costs. Santosh Banda. When companies move to microservices, they need to address a new …

Category:  Health Go Health

6 Best Practices to Manage Pull Request Creation and Feedback

WebWrite descriptive and consistent names. Create a clear PR title and description. Keep PRs short (same applies to files and functions) Manage PR disagreements through direct …

Category:  Health Go Health

Enforce Timeout: A DoorDash Reliability Methodology

WebTimeouts are oftentimes overlooked in the world of databases. A simple explanation is that lots of developers are very optimistic and trying to live in an ideal world of infinite …

Category:  Health Go Health

Failure Mitigation for Microservices: An Intro to Aperture

WebCong Ma Matt Ranney. When dealing with failures in a microservice system, localized mitigation mechanisms like load shedding and circuit breakers have always been used, …

Category:  Health Go Health

6 Principles for Building a World Class TPM Team

WebThe advantages of a centralized TPM team include: Centralized TPM teams create synergy which leads to best practices, shared learnings, and a rapid way to pinpoint answers to …

Category:  Health Go Health

How DoorDash Manages Mobile Releases

WebIn that spirit - and with a six-year background in the release processes for both the DoorDash and Caviar consumer apps - here is a high-level overview of DoorDash’s …

Category:  Health Go Health

Addressing the Challenges of Sample Ratio Mismatch in A/B Testing

WebOne of these challenges is sample ratio mismatch, or SRM. SRM represents one of the most egregious data quality issues in A/B tests because it fundamentally compromises the …

Category:  Health Go Health

Switchback Tests and Randomized Experimentation Under …

WebTo A/B or not to A/B, that is the question. Overview. On the Dispatch team at DoorDash, we use simulation, empirical observation, and experimentation to make progress towards …

Category:  Health Go Health

Using CloudFront Signed URLs with Built-In S3 Authenticated Access

WebStep six: Generate a key pair. Signed URLs will secure our endpoint. Cryptographic signatures are usually done using a private key to “sign” some data; any holder of the …

Category:  Health Go Health

Scaling DoorDash’s Geospatial Innovation with a Location-Based …

WebDoorDash’s Geo team built a delivery simulator to automate a formerly manual process of testing new location-based logic on our platform. This new tool, which lets us simulate …

Category:  Health Go Health

Improving Web Page Performance Server-Side Rendering with …

WebResults. Migrating our pages to Next.js achieved +12% and +15% page load time improvements on Home and Store. LCP (one of Google’s core speed metrics) has …

Category:  Health Go Health

Tackling technical challenges to build a global logistics platform

WebThere are a variety of novel challenges involved when a growing tech company wants to expand quickly and efficiently into new markets. This was very much true at DoorDash, …

Category:  Health Go Health

Improving ETA Prediction Accuracy for Long-tail Events

WebFigure 1: When the ETA time that customers see before making an order ends up being wrong, it hurts the customer experience and degrades trust in our platform. To solve this …

Category:  Health Go Health