data_entry_ops · media · workflow

How Spotify Generated Millions of ML Content Annotations Using a Scalable Annotation Platform

Spotify's ML teams needed high-quality annotations at massive scale — covering hundreds of millions of tracks and podcast episodes — but the manual annotation process was inefficient, disconnected, and lacked the right context for engineers and domain experts to operate effectively.

How it works

Common implementation structure

How this type of workflow is generally built, generalized across documented cases — not tied to any one vendor's stack. Click any stage to read what happens there. Specific products that implement these stages appear in “Tools commonly seen” below.

Stage 1 · Annotation case sampling

Scripts sample predictions from ML classification projects to produce annotation cases for operator review.

Tools used

LLM

Outcome

The annotation platform increased the annotation corpus by 10 times and achieved three times the improvement in annotator productivity, while significantly reducing the time it takes to develop new ML models.

Results

Time savedsignificantly reduce the time it takes to develop new ML models

Volume10 times

Source

https://engineering.atspotify.com/2024/10/how-we-generated-millions-of-content-annotations/

How we source this →

Grounding & classification

Source type: technical build writeup

25 fields verified against source quotes.

anomaly detectioncontent generationdocument classificationbuilder submittedhuman review describedmetric backednamed customerproduction runtime claimedtools describedworkflow describedmediasoftwareemployee productivitythroughput increasetime savedtechnical build writeupdata entry opsquality assuranceai draft human approvalescalation workflowhuman review queue