back_office_ops · workflow

Dropbox improves ML-powered content suggestions, increasing clicked-suggestion sessions by over 50%

Dropbox's content suggestion system had to handle multiple disparate content types—files, folders, Google Docs, Microsoft Office documents, and Dropbox Paper—each stored in different systems with different metadata, making a unified ML pipeline infeasible.

How it works

Common implementation structure

How this type of workflow is generally built, generalized across documented cases — not tied to any one vendor's stack. Click any stage to read what happens there. Specific products that implement these stages appear in “Tools commonly seen” below.

Stage 1 · User event signals collected

User interactions with files and folders over the past 90 days are collected as candidate input signals for the model.

Tools used

dbxlearnchar-RNNBayesian OptimizationGoogle DocsMicrosoft Office 365

Outcome

Dropbox increased the percentage of user sessions with at least one clicked suggestion by more than 50%, validating the combined multi-model approach via online A/B tests before rolling out at scale.

What failed first

Initial approaches were insufficient: one-hot encoding of file extensions produced high-dimensional sparse vectors, a bag-of-characters model for filenames failed to capture semantic meaning, and naive grid search for hyperparameter tuning was costly and ineffective.

Results

Volumemore than 50%

Running sinceApril

Source

https://dropbox.tech/machine-learning/using-machine-learning-to-predict-what-file-you-need-next-part-2

How we source this →

Grounding & classification

Source type: technical build writeup

20 fields verified against source quotes.

personalizationpredictive analyticsrecommendation systemknowledge basefailure mode describedmetric backednamed customerproduction runtime claimedtools describedworkflow describedsoftwareconversion increaseemployee productivitytechnical build writeupback office opsextract classify route