Media Metadata
Each uploaded Media asset is analyzed automatically and its Metadata
is extracted.
This starts with basic Metadata
like file size, creation date etc. and goes far beyond.
It includes descriptive Metadata
like transcripts
; speakers
diarization and recognition; keywords
and entities
; and a
scene recognition, which is obtained applying various machine learning techniques.
Transcription in multiple languages is supported, you can retrieve a list of available languages
for your tenant and request multiple language transcripts
for an asset via extract metadata.
In particular, we also support the automatic cross-language transcription from any source language to English.
The speakers
in videos and audio assetss are automatically assigned representations, that can be labeled.
Then they are named automatically in the transcripts
and Subtitles if they are recognized
again in subsquently uploaded Media
assets.
The Metadata
is indexed and constitutes the basis for the deep-search functionality accross all
Media
types.
The Metadata
can be retrieved as structured-data for SEO enhancement of embedded Media
assets.
๐๏ธ Get metadata
Returns all derived `Metadata` for a given `Media` asset.
๐๏ธ Extract metadata
Extracts metadata (with optional presets) from the `Media` asset.
๐๏ธ Delete metadata
Deletes metadata for a given asset.
๐๏ธ List transcripts
Returns a list of transcripts associated with the given `Media` asset.
๐๏ธ Get transcript
Retrieves a transcript of a `Media` asset by its identifier. An ID is required since there can be multiple `Transcript`s for a single `Media` asset.
๐๏ธ Update transcript
No longer maintained. Use [Update and time-align the transcript](#tag/Media-Metadata/operation/updateTranscriptWithTimeAlignment) instead.
๐๏ธ Delete transcript
Deletes a `Transcript` of a `Media` asset.
๐๏ธ Update and time-align the transcript
Updates a `Transcript` by adjusting word-level timestamps. Supports bulk editing of multiple transcript paragraphs in a single request.
๐๏ธ Activate a transcript
Adds the `isActive` attribute to the specified `Transcript` and removes it from all other `Transcript`s of the given `Media` asset.
๐๏ธ Update speaker in a transcript
Updates a speaker in a `Transcript` by assigning a different speaker to the `paragraph`. This action informs the system that the recognition was incorrect and that the representation of the speaker should be updated.
๐๏ธ Assign transcript as subtitle
Derives subtitles from the `Transcript` and assigns them. The ISO-639 `languageModel` code of the `Transcript` is automatically mapped to the language name.
๐๏ธ Download a transcript
Downloads a transcript in the specified format. Available formats are `JSON`, `TXT`, `SRT`, `WebVTT`, and `DOCX` (*Microsoft Word*). A [template can be created](#tag/Design-Settings/operation/createDocxExportDesignSettings) and applied to the `DOCX` format export.