Compared to Video Understanding

TwelveLabs offers two products for working with video. They use the same ingestion flow but support different reasoning scopes.

Video Understanding: Search, analyze, and embed individual videos with dedicated APIs.
Video agents (Jockey): Organize, track, and extract insights across video collections through a single API.
Use both together: Combine Video Understanding for embeddings and per-video analysis with Video agents (Jockey) for reasoning across video collections.

Detailed comparison by task

Task	Video Understanding	Video agents (Jockey)
Search for moments in videos	✅ Search API	✅ Responses API
Summarize a single video	✅ Analyze API	⚠️ Not recommended - use Analyze API
Organize videos by topic	❌	✅ Responses API with structured output
Track entities across videos	❌	✅ Responses API
Build agent workflows	❌	✅ Multi-turn sessions + structured output
Generate vector embeddings	✅ Embed API	❌

Video agents (Jockey) limitations

Video rendering and editing: The platform returns text, data references, and timestamps, not processed video. Use the references in your own rendering pipeline.
Single-video analysis: For per-video analysis, use the Analyze API. The Video agents (Jockey) works best when reasoning across multiple videos.
Vector embeddings: Use the Embed API to generate embeddings.

Next steps

Migrate from Video Understanding - move your existing workflows to Video agents (Jockey)
Guides - set up content, configure ingestion, and generate responses
Recipes - common tasks you can adapt to your use case