data_entry_ops · media · workflow

How Spotify Generated Millions of ML Content Annotations Using a Scalable Annotation Platform

Spotify's ML teams needed high-quality annotations at massive scale — covering hundreds of millions of tracks and podcast episodes — but the manual annotation process was inefficient, disconnected, and lacked the right context for engineers and domain experts to operate effectively.

How it works
Common implementation structure
How this type of workflow is generally built, generalized across documented cases — not tied to any one vendor's stack. Click any stage to read what happens there. Specific products that implement these stages appear in “Tools commonly seen” below.
Stage 1 · Annotation case sampling
Scripts sample predictions from ML classification projects to produce annotation cases for operator review.
Tools used
LLM
Outcome

The annotation platform increased the annotation corpus by 10 times and achieved three times the improvement in annotator productivity, while significantly reducing the time it takes to develop new ML models.

Results
Time savedsignificantly reduce the time it takes to develop new ML models
Volume10 times
Source

https://engineering.atspotify.com/2024/10/how-we-generated-millions-of-content-annotations/

How we source this →

Grounding & classification
Source type: technical build writeup
25 fields verified against source quotes.
anomaly detectioncontent generationdocument classificationbuilder submittedhuman review describedmetric backednamed customerproduction runtime claimedtools describedworkflow describedmediasoftwareemployee productivitythroughput increasetime savedtechnical build writeupdata entry opsquality assuranceai draft human approvalescalation workflowhuman review queue