Vision AI Agents Developer Platform

APIs and architecture for ingesting, analyzing, indexing, and searching video intelligence.

Vision AI Agents enables developers and enterprises to ingest video content, run intelligence analysis, generate searchable indexes, and build video discovery applications powered by structured metadata and vector embeddings.


Platform Pipeline

Video Upload
      ↓
Video Ingest APIs
      ↓
Video Intelligence Analysis
      ↓
Optional Audience Testing
      ↓
Metadata + Vector Embeddings
      ↓
Search APIs
      ↓
Application Results

All downstream operations require the video_id generated during video ingest.


Quick Navigation

Section Description
Platform Overview Overview of the Vision AI Agents platform and capabilities
Platform Workflows Supported developer integration workflows
Video Ingest APIs Upload video content and generate video IDs
Video Intelligence APIs Run intelligence analysis pipelines
Audience Testing APIs Measure viewer engagement signals
Search APIs Query indexed intelligence signals
Getting Started Quick integration guide
Platform Architecture Technical architecture overview
Authentication API key authentication model
Rate Limits & Usage Tiers API usage scaling and quotas
Error Handling API error codes and handling strategies
Search Integration Guide Integrating semantic video search
Audience Testing Guide Audience analytics integration
Job Status & Polling Guide Monitoring asynchronous processing

Platform Overview

Vision AI Agents provides a full pipeline for video intelligence processing.

Platform Layer Description
Video Ingest Upload and register video content in the platform
Intelligence Analysis Extract signals from video frames, audio, and script
Metadata Indexing Generate structured metadata indexes
Vector Embeddings Create semantic vector embeddings
Search Retrieval Query indexed intelligence signals
Audience Testing Measure viewer engagement analytics

Platform Workflows

Vision AI Agents supports four primary developer workflows.

Workflow Description
Upload and Analyze Upload new videos and run intelligence analysis
Analyze Existing Videos Run analysis on videos already ingested
Batch Video Ingest Upload large video libraries asynchronously
Search Delivery Retrieve indexed results through APIs or hosted search

Video Ingest APIs

Video ingest APIs are the entry point to the platform.

All analysis and search operations require a system-generated video_id returned by the ingest endpoint.

Example Endpoint

POST /api/video/ingest
Capability Description
Single Video Upload Upload one video for analysis
Batch Video Upload Upload multiple videos simultaneously
Automatic Video ID Generation Generate system identifier for each video
Analysis Pipeline Trigger Automatically start processing pipeline

Video Intelligence APIs

Video intelligence APIs allow developers to run analytics modules on previously ingested videos.

Example Endpoint

POST /api/video/analyze
Analysis Option Description
Full Analysis Runs all intelligence modules
Selective Analysis Runs only specific requested modules

Video Intelligence Domains

Domain Signals Extracted
Scene Actor Analytics Actor emotion engagement, eye contact engagement, attention intensity
Scene Elements Analysis Audio genre detection, rhythm analysis, script linguistics, color traversal
Scene Psychology Analysis Emotion classification, sentiment signals, audience mirroring
Crescendo Detection Narrative peaks and high-impact scenes

Audience Testing APIs

Audience testing must be explicitly requested through the API.

Endpoint

POST /api/audience/test
Parameter Type Description
video_id string Video identifier generated during ingest
participants integer Number of audience participants (maximum 10)

Audience Signals

Signal Description
Emotion Engagement Viewer emotional reactions
Attention Zones Areas viewers visually focus on
Attention Intensity Strength of viewer attention
Engagement Zones Highly engaging moments
Drop-off Zones Moments where viewers lose interest

Search APIs

Endpoint

POST /api/search/query
Capability Description
Metadata Search Query structured metadata fields
Vector Similarity Search Semantic search across embeddings
Structured Filtering Filter results using metadata

Authentication

All API requests require authentication using API keys.

Example Header

Authorization: Bearer API_KEY
Header Required Description
Authorization Yes API authentication token
Content-Type Yes JSON request format

Rate Limits

Limit Type Description
Concurrent Requests Maximum simultaneous API calls
Daily Requests Maximum API requests per day
Monthly Token Limits Platform compute usage quota

Getting Started

Step Action
1 Obtain API credentials
2 Upload a video using the ingest API
3 Run intelligence analysis
4 Optionally request audience testing
5 Retrieve indexed results using search APIs