Workflow · saas · workflow

How Meta animates AI-generated images at scale using latency optimization and traffic management

Meta's animate feature needed to serve billions of users with fast generation times and minimal errors while remaining resource efficient, and early testing revealed that end-to-end latency was higher than expected due to global traffic routing adding seconds to generation time.

How it works

Common implementation structure

How this type of workflow is generally built, generalized across documented cases — not tied to any one vendor's stack. Click any stage to read what happens there. Specific products that implement these stages appear in “Tools commonly seen” below.

Stage 1 · User requests animation

A user requests a short animation of a generated image through Meta AI.

Tools used

DPM-Solverbfloat16TorchScriptPyTorchpytorch.compileU-Net

Outcome

After applying all optimizations, Meta deployed the animate feature with high availability and a minimum failure rate, handling global traffic with latency at expected levels.

What failed first

Global traffic routing added seconds to end-to-end generation time, and after the traffic management system was added to fix that, the retry polling system began cascading under load spikes because the router had neither delay nor backoff.

Results

Volume15

Source

https://engineering.fb.com/2024/08/14/production-engineering/how-meta-animates-ai-generated-images-at-scale/

How we source this →

Grounding & classification

Source type: technical build writeup

26 fields verified against source quotes.

content generationbuilder submittedfailure mode describedmetric backednamed customerproduction runtime claimedtools describedworkflow describedmediasoftwarecycle time reductionerror reductionthroughput increasetechnical build writeup