back_office_ops · saas · workflow

How Amazon Bedrock Custom Model Import streamlined LLM deployment for Salesforce

Salesforce's AI platform team faced significant operational overhead deploying fine-tuned LLMs—spending months on instance selection, serving engine configuration, and GPU capacity reservations that were expensive, time-consuming, and difficult to maintain across frequent model releases.

How it works
Common implementation structure
How this type of workflow is generally built, generalized across documented cases — not tied to any one vendor's stack. Click any stage to read what happens there. Specific products that implement these stages appear in “Tools commonly seen” below.
Stage 1 · CI/CD saves model artifacts to S3
The CI/CD process saves model artifacts to an Amazon S3 model store bucket.
Tools used
Amazon Bedrock Custom Model ImportAmazon SageMaker InferenceAmazon S3AWS Identity and Access ManagementAmazon Bedrock Knowledge BasesAmazon Bedrock GuardrailsAmazon Bedrock AgentsLlamaQwenMistralvLLMTensorRT-LLMAmazon API GatewayAWS LambdaAgentforce
Outcome

Salesforce achieved a 30% reduction in model deployment time and up to 40% cost savings by adopting Amazon Bedrock Custom Model Import, while maintaining backward-compatibility with existing applications through a hybrid architecture using SageMaker proxy containers alongside Amazon Bedrock serverless inference.

Results
Time saved30%
Volume44%
Cost replacedup to 40%
Source

https://aws.amazon.com/blogs/machine-learning/how-amazon-bedrock-custom-model-import-streamlined-llm-deployment-for-salesforce?tag=soumet-20

How we source this →

Grounding & classification
Source type: technical build writeup
35 fields verified against source quotes.
agentic workflowbuilder submittedfailure mode describedmetric backednamed customerproduction runtime claimedtools describedvendor confirmedworkflow describedsoftwarecost reductioncycle time reductionemployee productivitytechnical build writeupback office ops