trivago's explore-exploit ranking model balances known and unknown accommodation inventory
trivago must surface the 25 best accommodations from millions of options for each user search, but most inventory is never seen due to a cold-start problem, and a greedy exploitation-only baseline missed high-potential unexplored listings.
How it works
Common implementation structure
How this type of workflow is generally built, generalized across documented cases — not tied to any one vendor's stack. Click any stage to read what happens there. Specific products that implement these stages appear in “Tools commonly seen” below.
Stage 1 · User search triggers ranking
A user search triggers selection of the best 25 accommodations from thousands of matching options.
Tools used
Spark
Outcome
trivago deployed an exploration mechanism in production that increases exposure of high-quality unexplored inventory at no short-term revenue cost, with no significant shift in advertiser clickshares and a tunable lambda parameter for ongoing control.
What failed first
The baseline ranking feature used only the beta-binomial posterior mean, which was too greedy and only exploited well-known accommodations. The naive exploration approach could not distinguish among accommodations with identical low exposure, and some model-based variants produced negative exploration or worse quality than the naive approach.