Audio speech-to-text

Google Cloud

Audio speech-to-text

Transcript audio file to text.

Deploy now
Ready to go in minutes
Ready to go in minutes

Deploy the Audio speech-to-text integration to process data on your processing stack in no time. Koyeb integrations are preconfigured applications to ingest, process, and store your data.

Infinite combination
Infinite combination

Compose processing workflows with ready to use integrations and your own code if needed. Build your business logic with our SDKs in Golang, Node.js, and Python or using Docker images.

No infrastructure management
No infrastructure management

Koyeb all-in-one platform is backed by a highly available infrastructure. Build advanced operation pipelines for your data with zero server maintenance.

Integration overview

Audio speech-to-text integration enables automated speech recognition with the Google Cloud Platform Speech-to-text API. The integration supports 120 languages and variants to support your global user base.

The integration process 30 seconds of audio in 15 seconds on average. Depending on the audio quality, the processing can take significantly longer.

For each audio file to process, you must specify:

  • The encoding scheme of the supplied audio: MP3, FLAC, LINEAR16, MULAW, AMR, AMR_WB, OGG OPUS, SPEEX WITH HEADER BYTE.
  • The sample rate in Hertz - optional for FLAC and WAV file where the sample rate is included in the source file header.
  • The language code to use for speech recognition of the supplied audio.

How it works

Audio speech-to-text integration takes an audio file as input and returns:

  • A JSON file containing the speech recognition results.

For instance, if a file name audio1.mp3 is uploaded and processed by this integration, the result of the processing will be:

  • audio1.mp3.gcp-audio-speech-to-text.json

About the Google Cloud Service Provider

Google Cloud

Google Cloud Platform, offered by Google, is a suite of cloud computing services that runs on the same infrastructure that Google uses internally for its end-user products, such as Google Search and YouTube.

Discover all Google Cloud integrations →
20+

20+

Region

50+

50+

Integrations

60

60

Seconds to deploy


Ready to get started?

Request your invitation now and deploy your processing stack in minutes.

footer-frame