September 15, 2025
8 minutes
Video generation AI has crossed a threshold: the outputs are convincing enough to deceive. A 5-second clip of a public figure saying something they never said. A fake news event that looks like drone footage. Synthetic "evidence" of crimes that never happened.
When images became trivially fakeable, we learned to be skeptical. Now video—the medium we've long trusted as "proof"—is next.
I evaluated three leading video generation systems (Runway Gen-3, Pika 2.0, and OpenAI Sora) to understand where they draw safety boundaries, how consistently those boundaries hold, and what risks slip through. The findings reveal an industry grappling with the same content moderation challenges that plagued image generation—but with higher stakes.
Images are single moments. Videos are narratives. They imply causality, sequence, and motion. That makes them more persuasive and more dangerous.
1. Deepfakes and Impersonation
2. Misinformation at Scale
3. Synthetic CSAM
4. Violence and Harm
5. Intellectual Property
The bar for what constitutes "dangerous output" is higher for video. A single compelling fake can cause geopolitical incidents.
Test Suite Design:
I used similar categories as my image generation testing, adapted for video-specific risks:
Approach:
Safety Approach: Conservative with focus on brand safety
Strengths:
Weaknesses:
Notable Test:
Verdict: Highly cautious, good for enterprise use, but limits creative and educational applications.
Safety Approach: Moderate filtering with community-driven refinement
Strengths:
Weaknesses:
Notable Test:
Verdict: Better balance of creativity and safety, but inconsistency is a problem.
Safety Approach: Strict with layered defenses
Strengths:
Weaknesses:
Notable Test:
Verdict: The most secure, but potentially too restrictive for many legitimate use cases.
| System | Direct Name | Physical Description | Contextual Clues | |--------|-------------|---------------------|------------------| | Runway | ❌ Blocked | ⚠️ Partial | ⚠️ Partial | | Pika | ⚠️ Inconsistent | ✅ Often Allowed | ✅ Often Allowed | | Sora | ❌ Blocked | ❌ Blocked | ❌ Blocked |
Risk Level: Sora > Runway > Pika
| System | Disaster Footage | Protest Violence | Fake News Scenario | |--------|------------------|------------------|--------------------| | Runway | ❌ Blocked | ❌ Blocked | ⚠️ Partial | | Pika | ⚠️ Inconsistent | ✅ Allowed | ✅ Allowed | | Sora | ❌ Blocked | ❌ Blocked | ❌ Blocked |
Concern: Pika's permissiveness could enable convincing misinformation at scale.
| System | Stylized Action | Realistic Violence | Gore/Injury | |--------|-----------------|-----------------------|-------------| | Runway | ✅ Allowed | ❌ Blocked | ❌ Blocked | | Pika | ✅ Allowed | ⚠️ Inconsistent | ❌ Blocked | | Sora | ⚠️ Partial | ❌ Blocked | ❌ Blocked |
All systems appropriately block explicit graphic content.
| System | Named Characters | Visual Similarity | Style Mimicry | |--------|------------------|-------------------|---------------| | Runway | ❌ Blocked | ⚠️ Partial | ✅ Allowed | | Pika | ⚠️ Inconsistent | ✅ Allowed | ✅ Allowed | | Sora | ❌ Blocked | ⚠️ Partial | ⚠️ Partial |
IP enforcement remains weak across all systems.
I ran 10 identical prompts on each system to measure reliability:
Why this matters:
All three systems embed metadata and (to varying degrees) watermarking:
Runway:
Pika:
Sora:
The Problem: Metadata is easily stripped. Perceptual watermarks can be degraded. Once a video is re-encoded or screenshotted, provenance is lost.
Systems struggle with:
Keyword filtering isn't enough. True contextual reasoning remains elusive.
Adversarial users will:
Current systems don't adapt to these evolving tactics quickly enough.
A video generated on one platform can be:
Platform-level moderation doesn't solve society-level risks.
By the time a deepfake is debunked:
Detection tools lag behind generation tools by months or years.
1. Prioritize Consistency
Unpredictable moderation enables adversarial probing. Make filters deterministic.
2. Invest in Contextual Understanding
"War footage" for a history documentary isn't the same as glorifying violence. Systems need to distinguish intent.
3. Perceptual Watermarking by Default
Metadata isn't enough. Embed forensic traces that survive re-encoding.
4. Collaborative Deepfake Detection
Generation companies should fund independent detection research. Adversarial transparency benefits everyone.
5. Rapid Response for Misuse
When harmful content is identified, update filters within hours, not weeks.
6. User Education
Platforms should prominently label AI-generated content and educate users on verification.
Technology alone won't solve this. We need:
Video generation is here. The harms are real. But so are the creative, educational, and economic benefits. We can't put the genie back in the bottle—but we can build better bottles.
Testing video generation systems? I'm collecting data on filter effectiveness and misuse patterns. Reach out via GitHub if you're working on related research or safety tooling.
Want to discuss this article? Standard contact info is available throughout the site. Or, if you've been paying attention, you might know a more direct route.