Media Metadata
Each uploaded Media asset is analyzed automatically and its Metadata is extracted.
This starts with basic Metadata like file size, creation date etc. and goes far beyond.
It includes descriptive Metadata like transcripts; speakers diarization and recognition; keywords and entities; and a
scene recognition, which is obtained applying various machine learning techniques.
Transcription in multiple languages is supported, you can retrieve a list of available languages
for your tenant and request multiple language transcripts for an asset via extract metadata.
In particular, we also support the automatic cross-language transcription from any source language to English.
The speakers in videos and audio assetss are automatically assigned representations, that can be labeled.
Then they are named automatically in the transcripts and Subtitles if they are recognized
again in subsquently uploaded Media assets.
The Metadata is indexed and constitutes the basis for the deep-search functionality accross all
Media types.
The Metadata can be retrieved as structured-data for SEO enhancement of embedded Media assets.
๐๏ธ Get metadata
Returns all derived `Metadata` for a given `Media` asset.
๐๏ธ Extract metadata
Extracts metadata (with optional presets) from the `Media` asset.
๐๏ธ Delete metadata
Deletes metadata for a given asset.
๐๏ธ List transcripts
Returns a list of transcripts associated with the given `Media` asset.
๐๏ธ Get transcript
Retrieves a transcript of a `Media` asset by its identifier. An ID is required since there can be multiple `Transcript`s for a single `Media` asset.
๐๏ธ Update transcript
No longer maintained. Use [Update and time-align the transcript](#tag/Media-Metadata/operation/updateTranscriptWithTimeAlignment) instead.
๐๏ธ Delete transcript
Deletes a `Transcript` of a `Media` asset.
๐๏ธ Update and time-align the transcript
Updates a `Transcript` by adjusting word-level timestamps. Supports bulk editing of multiple transcript paragraphs in a single request.
๐๏ธ Activate a transcript
Adds the `isActive` attribute to the specified `Transcript` and removes it from all other `Transcript`s of the given `Media` asset.
๐๏ธ Update speaker in a transcript
Updates a speaker in a `Transcript` by assigning a different speaker to the `paragraph`. This action informs the system that the recognition was incorrect and that the representation of the speaker should be updated.
๐๏ธ Assign transcript as subtitle
Derives subtitles from the `Transcript` and assigns them. The ISO-639 `languageModel` code of the `Transcript` is automatically mapped to the language name.
๐๏ธ Download a transcript
Downloads a transcript in the specified format. Available formats are `JSON`, `TXT`, `SRT`, `WebVTT`, and `DOCX` (*Microsoft Word*). A [template can be created](#tag/Design-Settings/operation/createDocxExportDesignSettings) and applied to the `DOCX` format export.