compliance_monitoring · media · workflow

How Roblox Uses AI to Moderate Content on a Massive Scale

Roblox's user-generated platform grew in both scale and speed far beyond what human moderators could handle alone, requiring scalable AI infrastructure to moderate billions of pieces of content in real time across dozens of languages.

How it works

Common implementation structure

How this type of workflow is generally built, generalized across documented cases — not tied to any one vendor's stack. Click any stage to read what happens there. Specific products that implement these stages appear in “Tools commonly seen” below.

Stage 1 · Content submitted by users

Users send chat messages, voice communications, and upload assets to the platform every day.

Tools used

transformer-based text filterPII filtervoice safety classifierlarge language models (LLMs)

Outcome

AI moderation now handles over 750,000 text filter RPS and 370,000 PII filter RPS at peak. The PII filter reduced false positives by 30% and increased automatic PII detection by 25%. The voice safety classifier achieves 92% higher recall than the initial version at a 1% false positive rate. Real-time feedback interventions reduced filtered chat messages by 5% and abuse-report consequences by 6%.

What failed first

An earlier rules-based text filter and CPU-based serving infrastructure could not keep pace with the volume and speed demands of the platform as it scaled.

Results

Time saved1.1 million hours

Volume6.1 billion

Running sinceapproximately five years

Source

https://corp.roblox.com/newsroom/2025/07/roblox-ai-moderation-massive-scale

How we source this →

Grounding & classification

Source type: technical build writeup

44 fields verified against source quotes, 2 dropped as unverifiable.

anomaly detectioncomputer visioncontent generationquality inspectioncall recordingchat transcriptfailure mode describedhuman review describedmetric backednamed customerproduction runtime claimedtools describedworkflow describedmediaaccuracy improvementautomation rateerror reductionthroughput increasetechnical build writeupcompliance monitoringquality assuranceescalation workflowextract classify routemonitor detect alert