logistics_ops · workflow
DoorDash retrains ML demand prediction models to restore accuracy after COVID-19 disruption
COVID-19 broke the historical demand patterns DoorDash's ML models relied on, making predictions inaccurate at a time when getting supply-demand balance right had direct consequences for both customers and Dasher earnings.
How it works
Common implementation structure
How this type of workflow is generally built, generalized across documented cases — not tied to any one vendor's stack. Click any stage to read what happens there. Specific products that implement these stages appear in “Tools commonly seen” below.
Stage 1 · COVID-19 breaks demand patterns
COVID-19 restrictions caused demand to become higher and more volatile than ever before, making model retraining necessary.
Tools used
Apache AirflowDatabricksPandasLightGBM
Outcome
The retrained DPM reduced prediction variance from 7.9% to 2.8% and increased the share of predictions within the SLA percentile range from 67% to 95%, yielding a model that was much more accurate while just as precise.
What failed first
The old Demand Prediction Model began severely underpredicting demand once COVID-19 restrictions took effect. During retraining, dataset manipulation hypotheses — including downsampling pre-pandemic data and removing holiday training data — proved negligible or detrimental to model performance.
Results
Volume2.8%
Grounding & classification
Source type: technical build writeup
23 fields verified against source quotes, 1 dropped as unverifiable.
forecastingpredictive analyticsfailure mode describedmetric backednamed customerproduction runtime claimedsource backedtools describedworkflow describedecommercelogisticsaccuracy improvementerror reductiontechnical build writeuplogistics opssupply chain