Skip to main content

Media Metadata

Each uploaded Media asset is analyzed automatically and its Metadata is extracted. This starts with basic Metadata like file size, creation date etc. and goes far beyond. It includes descriptive Metadata like transcripts; speakers diarization and recognition; keywords and entities; and a scene recognition, which is obtained applying various machine learning techniques.

Transcription in multiple languages is supported, you can retrieve a list of available languages for your tenant and request multiple language transcripts for an asset via extract metadata. In particular, we also support the automatic cross-language transcription from any source language to English.

The speakers in videos and audio assetss are automatically assigned representations, that can be labeled. Then they are named automatically in the transcripts and Subtitles if they are recognized again in subsquently uploaded Media assets.

The Metadata is indexed and constitutes the basis for the deep-search functionality accross all Media types. The Metadata can be retrieved as structured-data for SEO enhancement of embedded Media assets.