data_entry_ops · services · workflow

Kantar Worldpanel uses Databricks and GPT-4 to generate 120,000 training pairs at 94% accuracy

Kantar Worldpanel's legacy systems were inflexible, resource-intensive, and required an outdated programming skillset, blocking their ability to experiment with modern AI/ML models and scale their data insights business.

How it works

Common implementation structure

How this type of workflow is generally built, generalized across documented cases — not tied to any one vendor's stack. Click any stage to read what happens there. Specific products that implement these stages appear in “Tools commonly seen” below.

Stage 1 · Receipt-to-barcode matching need

Pairing receipt line descriptions and product barcode names is the upstream use case that starts the workflow.

Tools used

DatabricksMLflowMosaic AI Vector SearchUnity CatalogDatabricks MarketplaceLlamaMistralGPT-4GPT-3.5

Outcome

Kantar Worldpanel automatically generated a training dataset of about 120,000 pairs of receipt descriptions and barcode names at 94% accuracy in a couple of hours, freeing manual coding teams and engineering resources for higher-value work.

Results

Time savedin just a couple of hours

Volume94%

Cost replacedmore cost-effective but more performant

Source

https://www.databricks.com/customers/kantar-genai

How we source this →

Grounding & classification

Source type: vendor customer story

34 fields verified against source quotes.

content generationdata extractionpredictive analyticsproduct catalogreceipthuman review describedmetric backednamed customerproduction runtime claimedsource backedtools describedworkflow describedprofessional servicesaccuracy improvementcost reductionemployee productivitytime savedvendor customer storyback office opsdata entry opsdocument to record