Audio and video processing

Multimedia Cloud Processing provides efficient, intelligent and stable audio and video processing services for massive media resources, including standard transcoding, intelligent ultra definition, AI video processing, intelligent framing, video quality inspection, digital watermarking, etc., to achieve a smooth playback experience of multi terminal HD.

Selected into the hot selling list of "live scene" products

  • Popular Transcoding Package
  • Product advantages
  • Application scenarios
  • Product Functions
  • Customer Stories
  • Product demonstration
  • Documentation and Tools
  • Related products

Popular Transcoding Package

 background

Entry type

Suitable for individuals, small and micro enterprises with very few videos

Limited time special

Transcoding packet duration

Coding specification

term of validity

1000 minutes

H264 LD standard transcoding

12 months

Time limit 7.5 folding

seventeen /From ¥21
Buy Now
 background

Basic type

Suitable for start-ups with daily video within 2 hours

Limited time special

Transcoding packet duration

Coding specification

term of validity

20000 minutes

H264 LD standard transcoding

12 months

6.8% off time limit

two hundred and eighty /From ¥412
Buy Now
 background

Standard

Suitable for growing enterprises within 50 hours of daily video

Transcoding packet duration

Coding specification

term of validity

500000 minutes

H264 LD standard transcoding

12 months

Time limit 5 fold

five thousand one hundred and fifty /From ¥10300
Buy Now
 background

Higher order form

Suitable for large-scale video applications/online education/e-commerce, etc

Transcoding packet duration

Coding specification

term of validity

2 million minutes

H264 LD standard transcoding

12 months

Time limit 3 fold up

¥ twelve thousand three hundred and sixty /From ¥41200
Buy Now

Product advantages

high-quality

Through in-depth learning of AI model, the optimal coding parameters are dynamically allocated according to the video complexity, and the subjective enhancement of human eyes is carried out based on ROI to greatly improve the video picture definition.

high efficiency

Intelligent scheduling is carried out according to user level, queue level, video duration and complexity to ensure that high quality tasks are processed first and greatly improve the transcoding speed of long file slicing parallel processing.

Strong stability

The exclusive cluster ensures a strong and stable transcoding environment, dynamic expansion of distributed deployment, flexible response to business volume explosion, real-time monitoring and alarm of transcoding exceptions, and 7 * 24h technical service support.

low cost

Baidu's intelligent cloud transcoding service has the lowest price in the whole network, allowing you to get the best service at the lowest cost. In addition, smart ultra clear transcoding can help save a lot of bandwidth and storage costs.

Application scenarios

Short play APP
Radio, television and media
Live broadcast scene

A domestic short drama platform with large traffic

Online short plays are of great magnitude, and it is expected to reduce the cost of video storage and distribution. Users pursue the ultimate high-definition, and at the same time, let users get a smoother playing experience.

We can provide

  • Intelligent Ultra clear Perception Coding Technology

    Through self-developed editing coder and perceptual coding technology, it provides customers with transcoding solutions that can save code rate and improve image quality and clarity.

 Short play APP

Product Functions

View more audio and video processing MCP product functions

  • Digital Watermark Copyright Protection

    Watermark carrier
    It supports embedding/extracting watermarks in images and videos.
    Watermark Type
    Support the embedding of image and text watermarks.
    Anti attack of video extraction watermark
    It can resist attacks such as bit rate change, image zoom, image occlusion, brightness/contrast change to a certain extent. It supports extracting digital watermarks from videos after recording.
    Image Extraction Watermark Anti attack
    It is able to resist a certain degree of screen clipping, blocking, scaling, and screenshot attacks
  • Standard audio video transcoding

    Fast transcoding
    The AI model predicts the slicing strategy, and the transcoding speed can reach 50 times.
    Instant transcoding
    Support h264/265 inter conversion to meet the user's demand for smooth player while saving storage costs in live stream recording and playback of live video scenes.
    Perceptual coding
    Realize the transmission of on-demand and live broadcast services at a lower bit rate, and at the same time bring users a higher quality experience.
  • AI video processing

    Intelligent green screen matting
    For green screen recording scene video, it supports automatic deduction of portrait and generation of transparent channel background video.webp. Add any background to the generated video.
    Black edge detection and clipping
    Solve the black edge phenomenon of picture redundancy caused by the change of equipment size during the secondary distribution of video. On the one hand, it can improve the user's perception, on the other hand, it can save the bit rate and reduce the file volume.
    Intelligent subtitle removal/watermark removal
    Intelligent removal of dominant watermarks/subtitles in the video is applied to video transportation, secondary editing and distribution scenes.
    Intelligent horizontal to vertical rotation
    Identify the important people and wonderful areas in the picture through the target detection algorithm, and dynamically adjust the window position to change the video from horizontal screen (16:9) to vertical screen (9:16).
  • Intelligent frame drawing

    Brightness detection
    Detect video clips that are too bright and dark (including full black screen and white screen) beyond the comfort range of human eyes.
    Noise detection
    Detect video images mixed with bands, ripples, meshes and other pieces with periodic superimposed noise.
    Blocking/field effect detection
    As the bit rate of the detection video decreases, there will be discontinuous segments at the boundary of the block, forming obvious defects in the reconstructed image. Detect interlaced segments in image motion caused by compression in video post-processing stage.
    Volume detection
    Detect audio clips with too high/too low volume that are beyond the comfort range of human ears. Detect the intermittent audio clips caused by unstable signal source input during recording.

Customer Stories

 The image quality was improved, and the user perception experience was praised

The image quality was improved, and the user perception experience was praised

Duwu APP is a new generation of trendy online shopping community. The short videos uploaded by PGC/UGC have poor image quality, which affects the user's playing experience. It provides the super processing function to greatly improve the image quality.

 The image compression rate is 30% higher than that of open source+

The image compression rate is 30% higher than that of open source+

The scale of PUGC images on the platform is huge, resulting in high storage and distribution costs. Baidu Smart Cloud provides it with smart ultra clear image compression services, replacing open source compression services with extremely low delivery costs, helping customers reduce storage and distribution costs by nearly 40%.

 Support 180 million primary and secondary students to watch online smoothly

Support 180 million primary and secondary students to watch online smoothly

Baidu Smart Cloud, as the "protagonist", provides the Cloud School of the Ministry of Education with intelligent ultra clear transcoding to help the platform reduce bandwidth consumption by 30%+and some videos by 70% in the case of clearer picture quality.

Product demonstration

See more Smart HD Demos

Documentation and Tools

See more audio and video processing MCP documents

Related products