ecommerce_ops · media · workflow

Pinterest improves search relevance using LLM-based teacher-student distillation pipeline

Pinterest Search relied on engagement signals for ranking, but needed a genuine relevance model to ensure displayed content was pertinent to user queries rather than driven by past behaviour. The system also lacked coverage for multilingual queries and seasonal new concepts not found in limited human-annotated data.

How it works
Common implementation structure
How this type of workflow is generally built, generalized across documented cases — not tied to any one vendor's stack. Click any stage to read what happens there. Specific products that implement these stages appear in “Tools commonly seen” below.
Stage 1 · User search query received
Users submit queries on Pinterest Search to discover content that aligns with their information needs.
Tools used
Llama-3-8BBLIPqLoRABM25mDeBERTa-V3-baseXLM-RoBERTa-largemultilingual BERT-baseT5-base
Outcome

The LLM-based relevance pipeline achieved a +2.18% improvement in search feed relevance measured by nDCG@20, and online A/B experiments showed improvements of more than 1% in search feed relevance and more than 1.5% in search fulfillment rates. The multilingual teacher also generalised to languages not seen during training.

What failed first

The LLM-based cross-encoder teacher model was effective at predicting relevance but could not be deployed directly for real-time serving due to latency and cost constraints.

Results
Volume+2.18%
Source

https://medium.com/pinterest-engineering/improving-pinterest-search-relevance-using-large-language-models-4cd938d4e892

How we source this →

Grounding & classification
Source type: technical build writeup
34 fields verified against source quotes, 2 dropped as unverifiable.
document classificationknowledge searchpredictive analyticsknowledge basesocial media postbuilder submittedfailure mode describedhuman review describedmetric backednamed customerproduction runtime claimedsource backedtools describedworkflow describedmediaaccuracy improvementconversion increasetechnical build writeupecommerce opsquality assuranceextract classify route