Vision AI Agents Developer Platform
APIs and architecture for ingesting, analyzing, indexing, and searching video intelligence.
Vision AI Agents enables developers and enterprises to ingest video content, run intelligence analysis, generate searchable indexes, and build video discovery applications powered by structured metadata and vector embeddings.
Platform Pipeline
Video Upload
↓
Video Ingest APIs
↓
Video Intelligence Analysis
↓
Optional Audience Testing
↓
Metadata + Vector Embeddings
↓
Search APIs
↓
Application Results
All downstream operations require the video_id generated during video ingest.
Quick Navigation
| Section |
Description |
| Platform Overview |
Overview of the Vision AI Agents platform and capabilities |
| Platform Workflows |
Supported developer integration workflows |
| Video Ingest APIs |
Upload video content and generate video IDs |
| Video Intelligence APIs |
Run intelligence analysis pipelines |
| Audience Testing APIs |
Measure viewer engagement signals |
| Search APIs |
Query indexed intelligence signals |
| Getting Started |
Quick integration guide |
| Platform Architecture |
Technical architecture overview |
| Authentication |
API key authentication model |
| Rate Limits & Usage Tiers |
API usage scaling and quotas |
| Error Handling |
API error codes and handling strategies |
| Search Integration Guide |
Integrating semantic video search |
| Audience Testing Guide |
Audience analytics integration |
| Job Status & Polling Guide |
Monitoring asynchronous processing |
Platform Overview
Vision AI Agents provides a full pipeline for video intelligence processing.
| Platform Layer |
Description |
| Video Ingest |
Upload and register video content in the platform |
| Intelligence Analysis |
Extract signals from video frames, audio, and script |
| Metadata Indexing |
Generate structured metadata indexes |
| Vector Embeddings |
Create semantic vector embeddings |
| Search Retrieval |
Query indexed intelligence signals |
| Audience Testing |
Measure viewer engagement analytics |
Platform Workflows
Vision AI Agents supports four primary developer workflows.
| Workflow |
Description |
| Upload and Analyze |
Upload new videos and run intelligence analysis |
| Analyze Existing Videos |
Run analysis on videos already ingested |
| Batch Video Ingest |
Upload large video libraries asynchronously |
| Search Delivery |
Retrieve indexed results through APIs or hosted search |
Video Ingest APIs
Video ingest APIs are the entry point to the platform.
All analysis and search operations require a system-generated video_id returned by the ingest endpoint.
Example Endpoint
POST /api/video/ingest
| Capability |
Description |
| Single Video Upload |
Upload one video for analysis |
| Batch Video Upload |
Upload multiple videos simultaneously |
| Automatic Video ID Generation |
Generate system identifier for each video |
| Analysis Pipeline Trigger |
Automatically start processing pipeline |
Video Intelligence APIs
Video intelligence APIs allow developers to run analytics modules on previously ingested videos.
Example Endpoint
POST /api/video/analyze
| Analysis Option |
Description |
| Full Analysis |
Runs all intelligence modules |
| Selective Analysis |
Runs only specific requested modules |
Video Intelligence Domains
| Domain |
Signals Extracted |
| Scene Actor Analytics |
Actor emotion engagement, eye contact engagement, attention intensity |
| Scene Elements Analysis |
Audio genre detection, rhythm analysis, script linguistics, color traversal |
| Scene Psychology Analysis |
Emotion classification, sentiment signals, audience mirroring |
| Crescendo Detection |
Narrative peaks and high-impact scenes |
Audience Testing APIs
Audience testing must be explicitly requested through the API.
Endpoint
POST /api/audience/test
| Parameter |
Type |
Description |
| video_id |
string |
Video identifier generated during ingest |
| participants |
integer |
Number of audience participants (maximum 10) |
Audience Signals
| Signal |
Description |
| Emotion Engagement |
Viewer emotional reactions |
| Attention Zones |
Areas viewers visually focus on |
| Attention Intensity |
Strength of viewer attention |
| Engagement Zones |
Highly engaging moments |
| Drop-off Zones |
Moments where viewers lose interest |
Search APIs
Endpoint
POST /api/search/query
| Capability |
Description |
| Metadata Search |
Query structured metadata fields |
| Vector Similarity Search |
Semantic search across embeddings |
| Structured Filtering |
Filter results using metadata |
Authentication
All API requests require authentication using API keys.
Example Header
Authorization: Bearer API_KEY
| Header |
Required |
Description |
| Authorization |
Yes |
API authentication token |
| Content-Type |
Yes |
JSON request format |
Rate Limits
| Limit Type |
Description |
| Concurrent Requests |
Maximum simultaneous API calls |
| Daily Requests |
Maximum API requests per day |
| Monthly Token Limits |
Platform compute usage quota |
Getting Started
| Step |
Action |
| 1 |
Obtain API credentials |
| 2 |
Upload a video using the ingest API |
| 3 |
Run intelligence analysis |
| 4 |
Optionally request audience testing |
| 5 |
Retrieve indexed results using search APIs |