information community file
Audible Reading Solution
Provide highly anthropomorphic, natural and smooth text to speech services, open up human-computer interaction closed-loop, support multi role, emotional voice selection and personalized audio library customization, comprehensively solve the problems of high cost and low efficiency of traditional audio production, and meet the voice synthesis requirements of various scenarios such as extensive reading, intelligent broadcasting, human-computer interaction, etc
API online debugging HOT
Quickly debug speech synthesis effect
Privatization deployment HOT
Support multiple configuration options, out of the box
Customer Stories
Helping Dejian Novels Provide High quality Listening Experience
New generation AIGC customized sound library NEW
Rapid customization and efficient empowering media creation
Functional experience
Scheme introduction
From seeing to listening, experience upgrading
One stop audio reading solution, which supports multiple voice synthesis methods and various audio library customization needs, leaves the traditional "machine sound", and creates a more immersive and personalized voice experience
Short text online composition
Covering a variety of characteristic audio libraries, it supports polyphone tagging, and can be flexibly configured for speech speed, tone, volume, etc., which is widely applicable to business scenarios such as order broadcast, information news, human-computer interaction, etc View details
Long text online composition
Quickly convert ultra long text into stable, smooth, full and true audio. It supports the one-time synthesis of up to 100000 words of text without splitting and splicing, and provides a variety of high-quality sound banks and language choices View details
Offline speech synthesis
Offline service with real-time response, supporting multiple audio libraries and mixed reading in Chinese and English, meeting the broadcast needs of APP, intelligent hardware, etc. in the non network or weak network environment, and providing a stable, consistent, smooth and natural synthetic experience View details
Customize sound library
20-200 sentences can reproduce the highly restored, natural and lifelike exclusive sound library, which can be delivered in 1-4 weeks, to create personalized and unique sound for the business, to help improve product features and play personalized marketing View details
Application scenarios
Novel listening
Information broadcast
Accessible reading
AIGC video production
Fully automated audio book production
AI intelligent picture book, which automatically distinguishes roles and emotions according to the context, realizes the supernatural multi sound broadcasting of audio books and radio dramas, replacing the expensive and long cycle live production scheme
Characteristic advantages
Usage
Public cloud service
API interface and multilingual service end SDK are provided, which can be quickly integrated to PC end/mobile end/Pad end, and can provide private cluster public cloud services for customized models
Offline speech synthesis SDK
Support Android IOS and Linux platforms, millisecond response; It supports pure offline and offline online integration modes, and can automatically switch according to network conditions
Privatization deployment
Support single machine, multi machine, virtual machine, cluster deployment, container application services, and multiple authentication schemes to meet the requirements of enterprises for data security
Customer Stories
Technical capability
Voice technology
Character recognition
Face and Human Body
Image technology
Language and knowledge
video technique