Audible reading scenario solution _ novel listening _ news broadcasting _ barrier free reading

preferential Exclusive for first purchase, with voice synthesis as low as 5.6 yuan/10000 times , snap up now >

API online debugging HOT

Quickly debug speech synthesis effect

Privatization deployment HOT

Support multiple configuration options, out of the box

Customer Stories

Helping Dejian Novels Provide High quality Listening Experience

New generation AIGC customized sound library NEW

Rapid customization and efficient empowering media creation

Functional experience

Scheme introduction

Application scenarios

Characteristic advantages

Usage

Customer Stories

Functional experience

Scheme introduction

From seeing to listening, experience upgrading

One stop audio reading solution, which supports multiple voice synthesis methods and various audio library customization needs, leaves the traditional "machine sound", and creates a more immersive and personalized voice experience

Short text online composition

Covering a variety of characteristic audio libraries, it supports polyphone tagging, and can be flexibly configured for speech speed, tone, volume, etc., which is widely applicable to business scenarios such as order broadcast, information news, human-computer interaction, etc View details

Long text online composition

Quickly convert ultra long text into stable, smooth, full and true audio. It supports the one-time synthesis of up to 100000 words of text without splitting and splicing, and provides a variety of high-quality sound banks and language choices View details

Offline speech synthesis

Offline service with real-time response, supporting multiple audio libraries and mixed reading in Chinese and English, meeting the broadcast needs of APP, intelligent hardware, etc. in the non network or weak network environment, and providing a stable, consistent, smooth and natural synthetic experience View details

Customize sound library

20-200 sentences can reproduce the highly restored, natural and lifelike exclusive sound library, which can be delivered in 1-4 weeks, to create personalized and unique sound for the business, to help improve product features and play personalized marketing View details

Application scenarios

Novel listening

Information broadcast

Accessible reading

AIGC video production

Fully automated audio book production

AI intelligent picture book, which automatically distinguishes roles and emotions according to the context, realizes the supernatural multi sound broadcasting of audio books and radio dramas, replacing the expensive and long cycle live production scheme

Characteristic advantages

Realistic and vivid composite effect

High fidelity of text to speech, stable, clear and fluent sound quality, excellent emotional expression, The badcase rate is as low as 1% to create a full telepresence effect

Scene sound library with different styles

Rich audio library options, including the old, young and young age groups, 9 emotions such as joy, anger, sorrow, fear, etc., to meet the voice needs of various application scenarios and create a unique listening experience for users

Customized audio library comparable to vocals

At least 20-200 sentences can be recorded, and the delivery can be completed in 1-4 weeks at the earliest; Support bilingual synthesis in Chinese and English to create a dedicated sound library with high restore, high definition and high stability

High cost-effective intelligent services

AI whole process automatic production, input text to obtain real-time multi role, multi emotional effect; High level of intelligent and large-scale technology, no need for too much manual intervention, greatly saving business use time and economic costs

Usage

Public cloud service

API interface and multilingual service end SDK are provided, which can be quickly integrated to PC end/mobile end/Pad end, and can provide private cluster public cloud services for customized models

Use Now

Technical Documentation

Offline speech synthesis SDK

Support Android IOS and Linux platforms, millisecond response; It supports pure offline and offline online integration modes, and can automatically switch according to network conditions

Use Now

SDK download

Privatization deployment

Support single machine, multi machine, virtual machine, cluster deployment, container application services, and multiple authentication schemes to meet the requirements of enterprises for data security

Business consulting

Customer Stories

Fast entry

AI Competency Experience Center

Develop resources

QQ Support Group

Ecology and market

common problem

Pre sales consultation

After sales intelligent assistant

Technical work order

Feedback

customer service telephone numbers
400-920-8999

Talent recruitment

Experience AI capabilities immediately Open Baidu APP "Scan"

Get the latest AI information Follow "Baidu AI" WeChat official account

QQ Support Group

Baidu Voice: five hundred and eighty-eight million three hundred and sixty-nine thousand two hundred and thirty-six

Text recognition: one billion fifty-five million six hundred and twenty-three thousand eight hundred and twenty-seven

Custom template OCR: one billion fifty-five million four hundred and two thousand seven hundred and twenty-one

Face recognition: six hundred and ninety-two million four hundred and fifty thousand eight hundred and fifty-two

Human body analysis: eight hundred and sixty million three hundred and thirty-seven thousand eight hundred and forty-eight

Content review: three hundred and seventy-five million seven hundred and sixty-five thousand one hundred and ninety-four

PaddlePaddle: seven hundred and seventy-eight million two hundred and sixty thousand eight hundred and thirty

Image recognition: three hundred and twelve million one hundred and fifty-six thousand seven hundred and eighty-two

EasyDL： six hundred and fourteen million nine hundred and fifty-one thousand two hundred and thirty-nine

Image search: one billion sixty-seven million two hundred and seventy-six thousand one hundred and fifty-four

Video analysis: six hundred and thirty-two million four hundred and seventy-three thousand one hundred and fifty-eight

Baidu AR: four hundred and seventy-two million eighty-one thousand one hundred and nineteen

Natural language: one billion fifty-one million four hundred and thirty-six thousand five hundred and fourteen

UNIT： one billion seventy-four million four hundred and ten thousand one hundred and eighty-nine

Baidu Translate: two hundred and fourteen million eight hundred and fifty-seven thousand seven hundred and six

Image effect enhancement: one billion ninety-two million three hundred and thirty-eight thousand eight hundred and twenty-nine

Data intelligence: six hundred and fifty million five hundred and ninety-six thousand eight hundred and twenty-nine

Knowledge map: six hundred and fifty-five million eight hundred and fifty-four thousand seven hundred and eighty-six

DuerOS： six hundred and four million five hundred and ninety-two thousand and twenty-three

Baidu AI open platform: two hundred and twenty-four million nine hundred and ninety-four thousand three hundred and forty

Intelligent writing: seven hundred and forty-three million nine hundred and twenty-six thousand five hundred and twenty-three

EdgeBoard： one billion sixty million six hundred and twenty-three thousand three hundred and fifty-two

Voice self training platform: six hundred and eighty-six million two hundred and sixty-seven thousand five hundred and twenty-one

Far field voice development kit: two hundred and ten million ninety-three thousand two hundred and four

Cooperation consulting

Pre sales consultation

Fill in your business needs, and the exclusive account manager will contact you as soon as possible to provide one-on-one consulting services

After sales intelligent assistant

Intelligent diagnosis to quickly solve the use problems

Contact Sales

For more information, please call 400-920-8999 to 1

Experience AI

Web end to AI Competency Experience Center

The mobile terminal opens Baidu APP "Scan"