information community file
speech recognition Video Introduction
It adopts the internationally leading streaming end-to-end voice language integration modeling algorithm to quickly and accurately recognize speech into text, and supports multiple scenarios such as voice interaction, voice content analysis, robot dialogue, etc. of mobile phone applications
API online debugging HOT
Quickly debug speech recognition effect
Privatization deployment
Support multiple configuration options, out of the box
Customer Stories
Speech recognition helps iQIYI optimize search experience
Heavy upgrade of voice subtitle service NEW
AI helps comprehensively improve production efficiency
Application scenarios
Mobile app voice input
Robot dialogue
Voice content analysis
Real time voice transcription
Mobile app voice input
Real time speech recognition into text, suitable for voice chat, voice input, voice search, voice order, voice command, voice question and answer and other scenarios
Cooperation cases
Characteristic advantages
Leading technology
Adopting the leading international streaming end-to-end voice language integration modeling method, integrating Baidu natural language processing technology, the near-field Mandarin Chinese recognition accuracy rate reached 98%
Self training exclusive model
Support the self-help training model on the voice self training platform, upload the vocabulary text to complete the training with zero code, accurately improve the vocabulary recognition rate in the business field by 5-20%, and can be used exclusively
Simple and fast
API and multiple SDKs are supported, which can be accessed quickly and simply based on Demo. The latest identification decoding technology is adopted, which greatly improves the identification speed
Efficient and stable
Proprietary service cluster, providing enterprise level stable services, flexible high concurrency bearing and high reliability guarantee
Product pricing
Short Speech Recognition Standard
Short Speech Recognition Extreme Edition
Real time speech recognition
Audio file transfer
Times package prepayment
Applicable to enterprises with predictable call volume
Free adjustment amount
2 million times/enterprise account
term of validity
1 year
Concurrency
50 (capacity expansion is supported)
technical support
7 * 24 hours
1 million times
two thousand and four hundred
element
Buy Now
Pay after call
Applicable to enterprises that are not convenient to estimate the adjustment amount
Free adjustment amount
2 million times/enterprise account
Concurrency
50 (capacity expansion is supported)
technical support
7 * 24-hour response
Adjustment amount ≤ 6 million times
zero point zero zero three four
RMB/time
Subscription payment
Custom Edition
Applicable to key customers who need special mode
Enjoy special price for key customers
Purchase more concurrent
Other payment mode purchase
Cooperation consulting
Pricing Instructions
The product is free of charge when it is launched. After use, you can choose two billing methods: prepaid or pay as you go. The generated billing calls have priority in consuming the number of times package quota, and the excess part is billed by the volume ladder
Charging standard
Get instant voice recognition
Register to receive the free product experience package
Use Now
Technical capability
Voice technology
Character recognition
Face and Human Body
Image technology
Language and knowledge
video technique