compliance_monitoring · media · workflow

How Pinterest Built a Real-Time Radar for Violative Content Using AI

Pinterest's Trust & Safety teams could only measure policy-violating content prevalence through expensive human review studies run roughly every six months. User reports alone were incomplete, missing under-reported harms and lacking statistical power for rare categories.

How it works

Common implementation structure

How this type of workflow is generally built, generalized across documented cases — not tied to any one vendor's stack. Click any stage to read what happens there. Specific products that implement these stages appear in “Tools commonly seen” below.

Stage 1 · Daily impressions stream sampled

Images are sampled from the daily user impressions stream to start the prevalence measurement workflow.

Tools used

multimodal LLM

Outcome

The AI-assisted workflow enables daily prevalence measurement at 15x faster labeling turnaround and orders of magnitude lower operational cost than a human-only workflow, while preserving comparable decision quality and enabling continuous monitoring with real-time alerting.

What failed first

Human-only prevalence studies required at least two independent reviewers per item plus adjudication, were subject to instability, ran infrequently, and produced post-intervention comparisons that were slow and hard to trust.

Results

Time savedroughly every six months

Volume15x faster

Cost replacedorders of magnitude lower operational cost

Source

https://medium.com/pinterest-engineering/how-pinterest-built-a-real-time-radar-for-violative-content-using-ai-d5a108e02ac2

How we source this →

Grounding & classification

Source type: technical build writeup

26 fields verified against source quotes.

anomaly detectioncomputer visiondocument classificationsocial media posthuman review describedmetric backednamed customerproduction runtime claimedtools describedworkflow describedmediasoftwarecost reductionthroughput increasetime savedtechnical build writeupcompliance monitoringquality assuranceextract classify routemonitor detect alert