claims_processing · services · workflow

Ricoh builds a scalable intelligent document processing platform on AWS for healthcare clients

Ricoh's healthcare document processing relied on bespoke engineering per customer, with each deployment requiring unique development, custom prompt engineering, model fine-tuning, and integration testing that could not be reused—consuming 40–60 hours of developer time per customer setup. With an anticipated sevenfold increase in document volume, these bottlenecks limited expansion. Strict compliance requirements (HITRUST, HIPAA, SOC 2) further constrained AI innovation.

How it works
Common implementation structure
How this type of workflow is generally built, generalized across documented cases — not tied to any one vendor's stack. Click any stage to read what happens there. Specific products that implement these stages appear in “Tools commonly seen” below.
Stage 1 · Document ingestion via S3
Healthcare documents enter the system through Amazon S3, triggering event-driven processing workflows.
Tools used
Amazon BedrockAmazon TextractAWS LambdaAmazon S3Amazon DynamoDBAmazon SQSAmazon EventBridgeAWS KMSAmazon CloudWatchAWS CloudTrailAWS CloudFormationAWS SAMIDP-CommonOCR
Outcome

Ricoh reduced customer onboarding time by over 90% (from 4–6 weeks to 2–3 days) and cut engineering hours per deployment by over 90% (from ~80 hours to under 5 hours). Monthly throughput is projected to grow sevenfold to over 70,000 documents. The solution achieves 98–99% extraction accuracy while reducing manual review costs by 60–70% compared to fully manual processing, with annual savings exceeding 1,900 person-hours.

What failed first

A simple API call to Amazon Bedrock—sending a scanned document alongside a prompt—was insufficient for complex multi-part healthcare documents. Template-based extraction also proved ineffective given the high variability in layouts, naming conventions, and document structures across healthcare providers.

Results
Time saved4–6 weeks to 2–3 days, >90% reduction
Volume~80 hours to <5 hours, >90% reduction
Cost replaced60–70%
Source

https://aws.amazon.com/blogs/machine-learning/how-ricoh-built-a-scalable-intelligent-document-processing-solution-on-aws?tag=soumet-20

How we source this →

Grounding & classification
Source type: technical build writeup
53 fields verified against source quotes, 1 dropped as unverifiable.
agentic workflowdata extractiondocument classificationidpocrclinical noteinsurance claimmedical recordfailure mode describedhuman review describedmetric backednamed customerproduction runtime claimedsource backedtools describedworkflow describedhealthcareprofessional servicesaccuracy improvementautomation ratecost reductioncycle time reductionemployee productivitythroughput increasetechnical build writeupback office opsclaims processingmedical records processingdocument to recordextract classify routehuman review queue