ecommerce_ops · media · workflow

Pinterest improves search relevance using LLM-based teacher-student distillation pipeline

Pinterest Search relied on engagement signals for ranking, but needed a genuine relevance model to ensure displayed content was pertinent to user queries rather than driven by past behaviour. The system also lacked coverage for multilingual queries and seasonal new concepts not found in limited human-annotated data.

How it works

Common implementation structure

How this type of workflow is generally built, generalized across documented cases — not tied to any one vendor's stack. Click any stage to read what happens there. Specific products that implement these stages appear in “Tools commonly seen” below.

Stage 1 · User search query received

Users submit queries on Pinterest Search to discover content that aligns with their information needs.

Tools used

Llama-3-8BBLIPqLoRABM25mDeBERTa-V3-baseXLM-RoBERTa-largemultilingual BERT-baseT5-base

Outcome

The LLM-based relevance pipeline achieved a +2.18% improvement in search feed relevance measured by nDCG@20, and online A/B experiments showed improvements of more than 1% in search feed relevance and more than 1.5% in search fulfillment rates. The multilingual teacher also generalised to languages not seen during training.

What failed first

The LLM-based cross-encoder teacher model was effective at predicting relevance but could not be deployed directly for real-time serving due to latency and cost constraints.

Results

Volume+2.18%

Source

https://medium.com/pinterest-engineering/improving-pinterest-search-relevance-using-large-language-models-4cd938d4e892

How we source this →

Grounding & classification

Source type: technical build writeup

34 fields verified against source quotes, 2 dropped as unverifiable.

document classificationknowledge searchpredictive analyticsknowledge basesocial media postbuilder submittedfailure mode describedhuman review describedmetric backednamed customerproduction runtime claimedsource backedtools describedworkflow describedmediaaccuracy improvementconversion increasetechnical build writeupecommerce opsquality assuranceextract classify route