quality_assurance · saas · workflow

Portola empowers nontechnical domain experts to ship prompt improvements 4x faster using Braintrust

Building a trustworthy AI companion required deep domain expertise in psychology, storytelling, and conversation design—nuances that could not be captured by automated evals alone. Prompt changes required coordination between subject matter experts and engineers, creating bottlenecks that slowed iteration.

How it works

Common implementation structure

How this type of workflow is generally built, generalized across documented cases — not tied to any one vendor's stack. Click any stage to read what happens there. Specific products that implement these stages appear in “Tools commonly seen” below.

Stage 1 · Daily chat log review

Lily Doyle, their behavioral researcher, spends about an hour each day reading through chat logs in Braintrust, looking for patterns in conversation quality.

Tools used

BraintrustPlaygroundsLoop

Outcome

Nontechnical subject matter experts own the full cycle from problem identification to production deployment, resulting in a 4x improvement in iteration velocity and 4x the number of weekly prompt iterations.

Results

Time saved4x the number of weekly prompt iterations

Volume4x improvement

Source

https://www.braintrust.dev/blog/portola

How we source this →

Grounding & classification

Source type: vendor customer story

23 fields verified against source quotes.

conversational airagvoice aichat transcriptfailure mode describedhuman review describedmetric backednamed customerproduction runtime claimedsource backedtools describedworkflow describedsoftwarecycle time reductionemployee productivityvendor customer storyquality assuranceai draft human approvalhuman review queue