kyc_aml · finance · workflow

Adyen AI Applied Research Engineering: Integrity Risk Agents, Uplift, and Data Analysis

Traditional Integrity Risk operations required large human teams to manually research, analyze, and label cases; companies broadly struggle to apply AI meaningfully in critical large-scale production flows; and current SOTA AI agents achieve only around 16% accuracy on complex multi-step financial data reasoning tasks.

How it works

Common implementation structure

How this type of workflow is generally built, generalized across documented cases — not tied to any one vendor's stack. Click any stage to read what happens there. Specific products that implement these stages appear in “Tools commonly seen” below.

Stage 1 · GNN surfaces Integrity Risk cases

ML models, like Graph Neural Networks, surface cases to human analysts.

Tools used

Graph Neural NetworksLLMsPPODABStepAdyen UpliftGenAI platformdenoising autoencodersTransformer-based architecturesconference companionscode companions

Outcome

Adyen has deployed Uplift across every transaction on the platform and co-built the DABStep benchmark with Hugging Face using over 450 real-world tasks; ongoing AI agent research for Integrity Risk and foundational models carries potential for substantial productivity gains.

What failed first

Pre-LLM ML models surface Integrity Risk cases to human analysts but the framework still requires a human team to spend time on tasks that have the potential to be automated; current SOTA reasoning agents score only around 16% on Adyen's multi-step data analysis benchmark.

Results

Volumearound 16%

Cost replacedmore than USD 1.4 trillion

Source

https://www.adyen.com/knowledge-hub/unlocking-value-through-ai-applied-research-engineering

How we source this →

Grounding & classification

Source type: technical build writeup

40 fields verified against source quotes, 1 dropped as unverifiable.

agentic workflowai agentanomaly detectionfraud detectionknowledge searchpredictive analyticsknowledge basefailure mode describedhuman review describedmetric backednamed customerproduction runtime claimedtools describedworkflow describedbankingfinancial servicesautomation rateemployee productivitytechnical build writeupcompliance monitoringfinance opskyc amlagentic task executionextract classify routehuman review queue