ecommerce_ops · ecommerce · workflow

Walmart achieves ~50% tail query cache hit rate with semantic caching and generative AI in e-commerce search

Traditional exact-match caching fell short for the nuances of human language in e-commerce search queries, and integrating generative AI introduced significant latency and cost challenges.

How it works

Common implementation structure

How this type of workflow is generally built, generalized across documented cases — not tied to any one vendor's stack. Click any stage to read what happens there. Specific products that implement these stages appear in “Tools commonly seen” below.

Stage 1 · Customer submits search query

A customer submits an intent-based search query to the e-commerce search system.

Tools used

LLMs

Outcome

Semantic caching achieved a tail query cache hit rate close to 50%, far exceeding the anticipated 10-20%, and the system can return contextually relevant product groupings for intent-based queries.

What failed first

Traditional exact-match caching could not handle the varied ways customers phrase search queries.

Results

Volumeclose to 50%

Source

https://portkey.ai/blog/transforming-e-commerce-search-with-generative-ai-insights-from-walmarts-journey/

How we source this →

Grounding & classification

Source type: vendor customer story

15 fields verified against source quotes.

enterprise searchrecommendation systemproduct catalogmetric backednamed customerproduction runtime claimedworkflow describedecommerceretailautomation rateresponse time reductionvendor customer storyecommerce ops