Aug SEP Oct
20
2019 2020 2021
success
fail

About this capture

COLLECTED BY

Collection: GDELT Project

TIMESTAMPS

The Wayback Machine - http://web.archive.org/web/20200920132220/https://cloud.google.com/speech-to-text
 












Docs   Support  











AI and machine learning products  


Contact Sales   Get started for free
 














Why Google  

More  



Solutions  

More  



Products  

More  



Pricing  

More  



Getting started  

More  



Docs  

Support  

Console  

Contact Sales  

Get started for free  





Groundbreaking solutions. Transformative know-how.  

Learn more  

Why Google Cloud  

Choosing Google Cloud  

Trust and security  

Open cloud  

Global infrastructure  

Analyst reports  

Customer stories  

Partners  

Google Cloud Blog  

Events  



Industry Solutions  

Retail  

Financial Services  

Healthcare and Life Sciences  

Media and Entertainment  

Telecommunications  

Gaming  

Manufacturing  

Energy  

Government  

Education  

Small and Medium Business  

Cloud Natives  

See all solutions  

Application Modernization  

Hybrid and Multi-cloud Application Platform  

Cloud-Native App Development  

Serverless solutions  

DevOps  

Configuration Management  

Continuous Delivery (CD)  

Continuous Integration (CI)  

Infrastructure as Code  

Secrets Management  

Mainframe Modernization  

Hosting  

Artificial Intelligence  

Build and Use AI  

Contact Center AI  

Document AI  

Cloud Talent Solution  

Business Application Platform  

New Business Channels Using APIs  

Unlocking Legacy Applications Using APIs  

Open Banking APIx  

Data Management  

Database Migration  

Database Modernization  

Google Cloud Databases  

Migrate Oracle workloads to Google Cloud  

Open Source Databases  

SQL Server on Google Cloud  

Digital Transformation  

Business Continuity  

Digital Innovation  

Operational Efficiency  

COVID-19 Solutions  

COVID-19 Solutions for the Healthcare Industry  

Infrastructure Modernization  

VM Migration  

SAP on Google Cloud  

High Performance Computing  

Windows on Google Cloud  

Data Center Migration  

Marketing Technology  

Active Assist  

Virtual Desktops  

Productivity and Collaboration  

G Suite  

G Suite Essentials  

Cloud Identity  

Chrome Enterprise  

Cloud Search  

Security  

Application Security  

Security Analytics and Operations  

BeyondCorp Remote Access  

Smart Analytics  

Data Warehouse Modernization  

Stream Analytics  

Marketing Analytics  

Data Lake Modernization  

Business Intelligence  



Featured Products  

Compute Engine  

Cloud Storage  

Cloud SDK  

Cloud SQL  

Google Kubernetes Engine  

BigQuery  

Cloud CDN  

Dataflow  

Operations  

Cloud Run  

Cloud Functions  

See all products (100+)  

AI and Machine Learning  

Speech-to-Text  

Vision AI  

Text-to-Speech  

Cloud Translation  

Cloud Natural Language  

AutoML  

AI Platform  

Video AI  

AI Infrastructure  

Dialogflow  

AutoML Tables  

See all AI and machine learning products  

API Management  

Apigee API Platform  

Analyze APIs  

Monetize APIs  

Apigee Hybrid  

Apigee Sense  

Cloud Endpoints  

Developer Portal  

Apigee Healthcare APIx  

Apigee Open Banking APIx  

Cloud Healthcare API  

AppSheet  

Compute  

Compute Engine  

App Engine  

Cloud GPUs  

Migrate for Compute Engine  

Preemptible VMs  

Shielded VMs  

Sole-Tenant Nodes  

Bare Metal  

Recommender  

VMware Engine  

Cloud Run  

See all compute products  

Containers  

Google Kubernetes Engine  

Container Registry  

Container Security  

Cloud Build  

Deep Learning Containers  

Kubernetes Applications  

Artifact Registry  

Knative  

Cloud Run  

Cloud Code  

Data Analytics  

BigQuery  

Looker  

Dataflow  

Pub/Sub  

Dataproc  

Cloud Data Fusion  

Cloud Composer  

Data Catalog  

Dataprep  

Google Data Studio  

Google Marketing Platform  

Cloud Life Sciences  

Databases  

Cloud Bigtable  

Firestore  

Memorystore  

Cloud Spanner  

Cloud SQL  

Firebase Realtime Database  

Developer Tools  

Cloud SDK  

Container Registry  

Cloud Build  

Cloud Source Repositories  

Cloud Scheduler  

Tekton  

Cloud Tasks  

Cloud Code  

Tools for Visual Studio  

Tools for Eclipse  

Cloud Code for IntelliJ  

See all developer tools  

Healthcare and Life Sciences  

Apigee Healthcare APIx  

Cloud Healthcare API  

Cloud Life Sciences  

Hybrid and Multi-cloud  

Anthos  

Cloud Run for Anthos  

Google Cloud Marketplace for Anthos  

Migrate for Anthos  

Operations  

Cloud Build  

Traffic Director  

Apigee API Management  

Internet of Things  

Cloud IoT Core  

Edge TPU  

Management Tools  

Cloud Shell  

Cloud Console  

Cloud Deployment Manager  

Cloud Mobile App  

Cloud APIs  

Private Catalog  

Cost Management  

Media and Gaming  

Game Servers  

Zync Render  

Anvato  

OpenCue  

Migration  

BigQuery Data Transfer Service  

Cloud Data Transfer  

Cloud Foundation Toolkit  

Transfer Service  

Migrate for Anthos  

Migrate for Compute Engine  

Transfer Appliance  

VM Migration  

Networking  

Cloud Armor  

Cloud CDN  

Cloud DNS  

Cloud Load Balancing  

Cloud NAT  

Hybrid Connectivity  

Network Intelligence Center  

Network Service Tiers  

Network Telemetry  

Traffic Director  

Virtual Private Cloud  

Service Directory  

Operations  

Cloud Logging  

Cloud Monitoring  

Error Reporting  

Kubernetes Engine Monitoring  

Service Monitoring  

Cloud Trace  

Cloud Profiler  

Cloud Debugger  

Transparent Service Level Indicators  

Security and Identity  

Cloud IAM  

Assured Workloads  

Cloud Key Management  

Confidential Computing  

Security Command Center  

Cloud Data Loss Prevention  

Managed Service for Microsoft Active Directory  

Access Transparency  

Titan Security Key  

Secret Manager  

See all security and identity products  

Serverless Computing  

Cloud Run  

Cloud Functions  

App Engine  

Workflows  

Storage  

Cloud Storage  

Filestore  

Persistent Disk  

Cloud Storage for Firebase  

Local SSD  

Archival Storage  

Cloud Data Transfer  

G Suite Essentials  



Do more for less with Google Cloud  

Contact sales  

Google Cloud Platform  

Overview  

Price list  

Calculators  

Free on Google Cloud  

More Cloud Products  

G Suite  

Google Maps Platform  

Cloud Identity  

Apigee  

Firebase  

Zync Render  



Get started with Google Cloud  

Try GCP Free  

Get Started  

Resources to Start on Your Own  

Quickstarts  

GCP Marketplace  

Training  

Certification  

Get Help from an Expert  

Consulting  

Technical Account Management  

Find a Partner  

Become a Partner  

More ways to get started  






Home  


Products  


AI and machine learning products  


Cloud Speech-to-Text  


 


Jump to  
Speech-to-Text 







Speech-to-Text 



Accurately convert  speech into text using an API powered by Googles AI  technologies.  
Try it for free  

action/check_circle_24px  Created with Sketch.  
Transcribe your content in real time or from stored files  

action/check_circle_24px  Created with Sketch.  
Deliver a better user experience in products through  voice commands
 

action/check_circle_24px  Created with Sketch.  
Gain insights from customer interactions to improve your  service
 





Gartner logo

Gartner names  Google Cloud a Leader in the 2020 Magic Quadrant for Cloud  AI Developer Services.
 

Learn more  







Benefits
 



State-of-the-art accuracy

 

Apply Googles most advanced deep learning neural network  algorithms for automatic speech recognition (ASR).
 


Global reach

 

Meet your users where they are, globally, with voice  recognition that supports more than  125 languages and variants.  


Flexible deployment

 

Deploy speech recognition wherever you need, whether in  the cloud with the API or on-premises with  Speech-to-Text On-Prem.  






Demo
 

Put Speech-to-Text into action 





Key features
 

Key features 





Speech adaptation
 
Customize speech recognition to transcribe  domain-specific terms and rare words by providing hints  and boost your transcription accuracy of specific words or  phrases. Automatically convert spoken numbers into  addresses, years, currencies, and more using classes.
 

Domain-specific models
 
Choose from a  selection of trained models  for voice control and phone call and video transcription  optimized for domain-specific quality requirements. For  example, our enhanced phone call model is tuned for audio  originated from telephony, such as phone calls recorded at  an 8khz sampling rate.
 

Streaming speech recognition
 
Receive real-time speech recognition results as the API  processes the audio input streamed from your applications  microphone or sent from a prerecorded audio file (inline  or through Cloud Storage).
 

Speech-to-Text On-Prem
 
Have full control over your infrastructure and protected  speech data while leveraging Googles speech recognition  technology  on-premises,  right in your own private data centers.  Contact sales to  get started.
 
View all features  




Blog image speech to text


BLOG  

Enhanced models and features now available in new languages
 







Customers
 

Customers 




Cast box logo




Castbox uses Speech-to-Text to  deliver its in-audio search service for podcasts.
  Read the story  




Story highlights

 




Enabling users to  search audio content for words or phrases
 




Audio-to-text  conversion accuracy rates of greater than 96%
 




Typical search  queries with latency of just 50 milliseconds
 



Industry

 




Technology
 
















Voximplant uses Speech-to-Text to help companies build voice solutions and boost the number of calls they can handle.
 








InteractiveTel uses Speech-to-Text to provide accurate analysis of voice communications and increased customer satisfaction to its clients.
 








With Speech-to-Text and Vision API, Ananda Development created a mobile application to automate and streamline condominium inspections.
 




See all customers  




What's new
 



What's new



Sign up  for Google Cloud newsletters to receive product updates,  event information, special offers, and more.
 











Video  

Next '20 OnAir: Measuring and improving Speech-to-Text accuracy   Watch video  






Video  

Automated Subtitles with AI   Watch video  



YouTube video image


Video  

Solving for accessible phone calls with Speech-to-Text and Text-to-Speech   Watch video  



Speech to text logo


Video  

Getting Started with Converting speech to text with Node.js   Watch video  

















Documentation  




Documentation 










Google Cloud Basics 
Speech-to-Text basics

Learn the fundamental  concepts in Speech-to-Text.
 


Learn more  





Quickstart 
Quickstart: Using the gcloud tool

Send an audio  transcription request to Speech-to-Text using the  gcloud tool from the command line.
 


Learn more  





Best Practice 
Best  practices

Review the best  practices for transcribing audio with  Speech-to-Text.
 


Learn more  





Google Cloud Basics 
Supported languages

Learn which languages  are available for Speech-to-Text, plus the features  and recognition models available for each.
 


Learn more  





Google Cloud Basics 
Speech-to-Text On-Prem

Learn more about  Speech-to-Text On-Prem, which enables easy integration  of Google speech recognition technology into your  on-premises solutions.
 


Learn more  





View all product documentation  









Explore more docs

 




Quickstarts

Get a quick intro to using this product.
 



How-to guides

Learn to complete specific tasks with this product.
 



Tutorials

Browse walkthroughs of common uses and scenarios for this product.
 



APIs & references

View APIs, references, and other resources for this product.
 








Release notes  

Read about the latest releases for Speech-to-Text
 







Use cases
 

Use cases 





Use case 

Improve  customer service  

Empower your customer service system by adding IVR  (interactive voice response) and agent conversations to your  call centers. Perform analytics on your conversation data to  gain more insights into the calls and your customers.  Speech-to-Text and its enhanced phone call models are  already powering Google Clouds powerful solution,  Contact Center AI.  

Using contact center AI with speech to text technology to improve customer service





Use case 

Enable voice  control  

Implement voice commands such as turn the volume up, and  voice search such as saying what is the temperature in  Paris? Combine this with the  Text-to-Speech API  to deliver voice-enabled experiences in IoT (Internet of  Things) applications.
 
Workflow of voice control using speech to text API





Use case 

Transcribe  multimedia content  

Transcribe your audio and video to include captions and  improve your audience reach and experience. Add subtitles to  your content real time to your streaming content. Our  video transcription model  is ideal for indexing or subtitling video and/or  multispeaker content and uses machine learning technology  that is similar to video captioning on YouTube.
 
Transcribe multimedia content workflow





View all technical guides  




All features
 


All features 




Global vocabulary Support your global user base with Speech-to-Text’s extensive language support in over 125 languages and variants.
Streaming speech recognition Receive real-time speech recognition results as the API processes the audio input streamed from your application’s microphone or sent from a prerecorded audio file (inline or through Cloud Storage).
Speech adaptation Customize speech recognition to transcribe domain-specific terms and rare words by providing hints and boost your transcription accuracy of specific words or phrases. Automatically convert spoken numbers into addresses, years, currencies, and more using classes.
Speech-to-Text On-Prem Have full control over your infrastructure and protected speech data while leveraging Google’s speech recognition technology on-premises, right in your own private data centers. Contact sales to get started.
Multichannel recognition Speech-to-Text can recognize distinct channels in multichannel situations (e.g., video conference) and annotate the transcripts to preserve the order.
Noise robustness Speech-to-Text can handle noisy audio from many environments without requiring additional noise cancellation.
Domain-specific models Choose from a selection of trained models for voice control and phone call and video transcription optimized for domain-specific quality requirements. For example, our enhanced phone call model is tuned for audio originated from telephony, such as phone calls recorded at an 8khz sampling rate.
Content filtering Profanity filter helps you detect inappropriate or unprofessional content in your audio data and filter out profane words in text results.
Auto-detect language (beta) Specify up to four language codes and Speech-to-Text will identify the correct language spoken in multilingual scenarios.
Automatic punctuation (beta) Speech-to-Text accurately punctuates transcriptions (e.g., commas, question marks, and periods).
Speaker diarization (beta) Know who said what by receiving automatic predictions about which of the speakers in a conversation spoke each utterance.







Pricing
 


Pricing 




The first 60 minutes of Speech-to-Text successfully  processed each month is free, then it is priced per 15  seconds of audio. Specific rates vary depending on the model  used, if there is data logging, and the number of audio  channels.
 


View pricing details  






Take the next  step 


Start  building on Google Cloud with $300 in free credits and 20+  always free products. 
Try it for free  




Need help getting started? 
Contact sales  


Work with a trusted partner 
Find a partner  


Continue browsing 
See all products  









Choosing Google Cloud  

Trust and security  

Open cloud  

Global infrastructure  

Customers and case studies  

Analyst reports  

Whitepapers  





GCP pricing  

G Suite pricing  

Maps Platform pricing  

See all products  





Infrastructure modernization  

Data management  

Application modernization  

Smart analytics  

Artificial Intelligence  

Security  

Productivity & work transformation  

Industry solutions  

DevOps solutions  

Small business solutions  

See all solutions  





GCP documentation  

GCP quickstarts  

Google Cloud Marketplace  

G Suite Marketplace  

Support  

Tutorials  

Training  

Certifications  

Google Developers  

Google Cloud for Startups  

System status  

Release Notes  





Contact sales  

Find a Partner  

Become a Partner  

Blog  

Events  

Podcast  

Community  

Press center  

Google Cloud on YouTube  

GCP on YouTube  

G Suite on YouTube  

Follow on Twitter  

Join User Research  

We're hiring. Join Google Cloud!  






About Google  

Privacy  

Site terms  

Google Cloud terms  

Sign up for the Google Cloud newsletter   Subscribe