Deploy the Video speech-to-text integration to process data on your processing stack in no time. Koyeb integrations are preconfigured applications to ingest, process, and store your data.
Compose processing workflows with ready to use integrations and your own code if needed. Build your business logic with our SDKs in Golang, Node.js, and Python or using Docker images.
Koyeb all-in-one platform is backed by a highly available infrastructure. Build advanced operation pipelines for your data with zero server maintenance.
This integration allows you to automate video speech transcription to convert audio in a video to text. You can create automatically create subtitles of your video, enrich the metadata of your videos to make it easily searchable or do statistics on the content of your videos.
This integration currently only support English, if you need other languages you can combine the Video Convertion integration to extract the audio file and the Audio Speech To Text Integration.
A video file in a format decodable by ffmpeg. This includes common video formats, including files with a, , or extension.
A JSON file containing all the speech detected, when in the video it was detected and the possible alternative words.
Google Cloud Platform, offered by Google, is a suite of cloud computing services that runs on the same infrastructure that Google uses internally for its end-user products, such as Google Search and YouTube.Discover all Google Cloud integrations →
Seconds to deploy