Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
requirements.txt		requirements.txt
transform.py		transform.py
unstructured-to-structured.ipynb		unstructured-to-structured.ipynb

README.md

Transform unstructured data to structured in real-time

Media companies want to extract key information from livestreamed events for subtitles, translations, and content summaries but doing this manually or with bactch processing causes delays. This project showcases how to use GlassFlow for real-time extraction, transformation, and translation of YouTube video data. The handler extracts key topics from the video transcript, generates meaningful insights, and translates the transcript into any specified language.

Features

Extract video transcript from YouTube.
Process the data to extract topics and other meaningful data (identifies key metrics such as the number of speakers and the total duration of the spoken content).
Translate the transcript into the user's preferred language (for example, from English to Spanish).
Return structured data and derived metrics.

Pre-requisites

Create your free GlassFlow account via the GlassFlow WebApp.
Get your Personal Access Token to authorize the Python SDK to interact with GlassFlow Cloud.
Get your OpenAI API Key https://platform.openai.com/.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

unstructured-to-structured

unstructured-to-structured

README.md

Transform unstructured data to structured in real-time

Features

Pre-requisites

Files

unstructured-to-structured

Directory actions

More options

Directory actions

More options

Latest commit

History

unstructured-to-structured

Folders and files

parent directory

README.md

Transform unstructured data to structured in real-time

Features

Pre-requisites