Production stays calm. The Lab gets curious.
Traveler can keep using the current private pipeline while this room tests better video options. The goal is not to chase novelty every day. The goal is to notice when a model becomes meaningfully safer for S.
- Use the same Traveler source image across all candidates in a run.
- Use the same city, motion brief, duration target, and safety constraints.
- Score the output before knowing which model made it whenever possible.
- Record funny failures because they are teaching material.
- Only promote a model after multiple clean private tests, not one lucky render.
The magic bag test
The latest private Traveler Short produced a useful hallucination: S's bag appeared carried by an invisible hand. That failure is now part of the rubric, not just a joke.
| Slot | Role | What we test | Promotion signal |
|---|---|---|---|
| Seedance | Current baseline | Long uninterrupted motion, Traveler cadence, current prompt guards. | Keep if it remains most reliable after object-contact and signage tightening. |
| Sora candidate | High-priority challenger | Realism, motion physics, identity preservation, natural camera language. | Promote only if it beats baseline in repeated private Traveler shots. |
| Veo / Runway / Kling / Pika | Rotating challengers | Continuity, hands, bags, signage, face drift, duration, cost, speed. | Earns a production trial by winning a controlled bakeoff, not by hype. |
| No-video fallback | Safety valve | Ken Burns stills, parallax, slow editorial motion, image sequence cuts. | Use when model video risks identity or physical weirdness on a public day. |
Use these together
Video testing depends on stable source images and a clean production gate.
Collect
Save the source image, model, prompt, cost, duration, and output URL for every serious test.
Score
Use the six-part rubric and write down the failure mode, even when the result is charming.
Private
Run at least three private Traveler uploads with the candidate before any public schedule change.
Promote
Switch production only when the new model is better, safer, and easier to explain than the current baseline.