| Aug | SEP | Oct |
| 20 | ||
| 2019 | 2020 | 2021 |
COLLECTED BY
Collection: GDELT Project
Tekton
Kubernetes-native resources for declaring CI/CD pipelines.
Cost Management
Tools for monitoring, controlling, and optimizing your costs.
●Media and Gaming
Zync Render
Platform for 3D modeling and rendering on Google Cloud infrastructure.
Anvato
Media content platform for OTT services and video streaming.
OpenCue
Open source render manager for visual effects and animation.
| Global vocabulary | Support your global user base with Speech-to-Text’s extensive language support in over 125 languages and variants. |
| Streaming speech recognition | Receive real-time speech recognition results as the API processes the audio input streamed from your application’s microphone or sent from a prerecorded audio file (inline or through Cloud Storage). |
| Speech adaptation | Customize speech recognition to transcribe domain-specific terms and rare words by providing hints and boost your transcription accuracy of specific words or phrases. Automatically convert spoken numbers into addresses, years, currencies, and more using classes. |
| Speech-to-Text On-Prem | Have full control over your infrastructure and protected speech data while leveraging Google’s speech recognition technology on-premises, right in your own private data centers. Contact sales to get started. |
| Multichannel recognition | Speech-to-Text can recognize distinct channels in multichannel situations (e.g., video conference) and annotate the transcripts to preserve the order. |
| Noise robustness | Speech-to-Text can handle noisy audio from many environments without requiring additional noise cancellation. |
| Domain-specific models | Choose from a selection of trained models for voice control and phone call and video transcription optimized for domain-specific quality requirements. For example, our enhanced phone call model is tuned for audio originated from telephony, such as phone calls recorded at an 8khz sampling rate. |
| Content filtering | Profanity filter helps you detect inappropriate or unprofessional content in your audio data and filter out profane words in text results. |
| Auto-detect language (beta) | Specify up to four language codes and Speech-to-Text will identify the correct language spoken in multilingual scenarios. |
| Automatic punctuation (beta) | Speech-to-Text accurately punctuates transcriptions (e.g., commas, question marks, and periods). |
| Speaker diarization (beta) | Know who said what by receiving automatic predictions about which of the speakers in a conversation spoke each utterance. |