marketing_ops · saas · workflow
Cloudinary makes video production 70% faster with Descript's AI-powered editing
Cloudinary's Customer Education team spent excessive time on video editing due to inaccurate transcription of technical terminology, tedious filler-word removal, and inconsistent audio quality from remotely recorded experts, slowing production to a crawl.
How it works
Common implementation structure
How this type of workflow is generally built, generalized across documented cases — not tied to any one vendor's stack. Click any stage to read what happens there. Specific products that implement these stages appear in “Tools commonly seen” below.
Stage 1 · Expert records live session
In-house experts record training sessions live and hand them off to the Customer Education team for editing.
Tools used
DescriptStudio Soundtranscription glossary
Outcome
Cloudinary reduced video production time by 70%—from 13 hours per episode to 4 hours—and increased podcast output from one episode every few months to two per month, at the same cost with the same team.
What failed first
Neither human nor AI transcription services could accurately handle Cloudinary's domain-specific vocabulary, including technical terms, proprietary lingo, and acronyms, especially from non-native English speakers, and correcting the resulting errors took hours.
Results
Volume70% faster
Cost replacedcrystal-clear audio even when experts have bad mics or setups
Grounding & classification
Source type: vendor customer story
23 fields verified against source quotes, 1 dropped as unverifiable.
content generationspeech to textmeeting recordingfailure mode describedhuman review describedmetric backednamed customerproduction runtime claimedsource backedtools describedworkflow describedsoftwareemployee productivitythroughput increasetime savedvendor customer storymarketing opsai draft human approval