WebRTC series video auxiliary stream - truly stable personal space of Netease Yunxin - OSCHINA - Chinese open source technology exchange community

Video auxiliary stream of WebRTC series

cover

Author: Tao Jinliang, senior client development engineer of Netease Yunxin

In recent years, the field of real-time audio and video has become more and more popular. Many audio and video engines in the industry are implemented based on WebRTC. This paper mainly introduces the demand background of WebRTC on video auxiliary stream and the implementation of related technologies.

SDP in WebRTC supports two schemes: PlanB scheme and Unified Plan scheme. In the early days, the Plan B scheme using multiple PeerConnections only supports sending one video stream, which we call "mainstream". At present, we use the Unified Plan scheme of single PeerConnection to add a video secondary stream. What is the video "secondary stream"? Video sub stream refers to the second video stream, which is generally used for screen sharing.

Demand background

With the development of the business, one way of video streaming can not meet the needs of more actual business scenarios. For example, in multi person video chat, NetEase conference and other online education scenarios, two ways of video streaming need to be sent at the same time: one way is camera streaming, and the other is screen sharing streaming.

However, currently, when using the SDK to share screens, we use the camera acquisition channel to share screens. In this scheme, the sharer has only one uplink video stream. In this scenario, either the uplink camera image or the uplink screen image is mutually exclusive.

Unless a new SDK specifically collects and sends screen images, the solution of two SDKs in the instance is troublesome and has many problems in the business layer, such as how to handle the relationship between two streams.

In the WebRTC scenario, there is also a scheme that can independently open all the upstream video streams for screen sharing, which is called "Substream" 。 Secondary stream sharing means that the sharer releases the camera image and screen image at the same time.

In addition, with this auxiliary flow channel, when the device is a new version of iPhone (the new version of iPhone has the ability to simultaneously turn on the front and rear cameras), it also lays the foundation for supporting the front and rear 2-way cameras to send video data.

Technical background

The early SDK architecture design is a multi PeerConnection model, that is, one PeerConnection corresponds to one audio and video stream. With the introduction and support of the new SDP (Session Description Protocol) format (UnifyPlan), a PeerConnection can correspond to multiple audio and video streams, that is, a single PeerConnection model, that is, a single PC based architecture, Allow to create multiple transceivers for sending multiple video streams 。

Multiple audio and video streams

Technical implementation

At present, video streams are mainly divided into three categories: Camera stream, screen sharing stream and custom input video stream, which have different attributes:

Camera stream is the mainstream and Simulcast is supported;
Custom video input (non screen sharing) is the mainstream, and Simulcast is not supported;
Screen sharing is regarded as a secondary stream, which does not support Simulcast and has a separate screen sharing coding strategy;

Due to the particularity of iOS screen sharing, it needs to obtain video data through customized video input, so there is a flow chart as shown below:

flow chart

To sum up, iOS custom input can use either mainstream channels to send video (non screen sharing) or secondary channels to send video (screen sharing).

For other platforms, such as Mac, Win, Aos, etc., it is relatively simple. Camera data and screen sharing data come from inside the SDK, while external custom video input data comes from outside.

Key Class Diagram

The single PC architecture mentioned above currently has two RtpTransceivers, one is AudioTransceiver and the other is VideoTransceiver. A new RtpTransceiver will be added to the screen sharing of the secondary stream. A VideoRtpSender will contain a VideoMediaChannel.

Single PC architecture

Secondary flow architecture

Minor flow change

To implement the auxiliary flow, you need to make some adjustments and restructuring at different levels, as follows:

The signaling layer needs to support multiple video streams, and mediaType is used to distinguish the above video streams, ScreenShare, and external video streams;
Reconstruct the management of Capture and Source across platform layers;
Reconstruct the management of users and rendering canvases. From one UID to one render, the sourceId of one UID corresponds to one render. Each UID may contain two sourceIds;
The server streaming and recording of interactive live broadcast needs to support the combined recording of mainstream and secondary streams;
Implementation of congestion control schemes for mainstream and secondary flows;
Implementation of the rate allocation scheme for the main stream and auxiliary stream;
Encoder performance optimization of main stream and auxiliary stream;
Adjustment of PacedSender sending strategy, audio and picture synchronization and other schemes;
Adjustment of the allocation scheme of QoS downlink code rate of the server;
Summary of statistical data related to secondary flow;

The following describes the implementation of several important technical points in the whole process.

bandwidth allocation

In weak networks, when video sub streams are needed, we will give priority to the allocation of the code rate to the audio stream, followed by the sub stream, and finally to the mainstream, The overall strategy is to ensure auxiliary flow 。

The main process of bandwidth allocation is as follows:

The total bandwidth allocation assessed by the congestion control algorithm GCC (hereinafter referred to as CC) of WebRTC will be distributed to audio streams, mainstream streams and secondary streams;
In the mainstream, the Simulcast module allocates the bit rate of large and small streams. If Simulcast is not enabled, it will directly send the bit rate to the large stream;

The specific process is shown in the figure:

Specific process

The secondary stream will add a VideoSendStream based on the above figure.

rate allocation

The current process of bit rate allocation is shown in the figure below, which can be summarized as follows:

The code rate of CC is transferred to Call through the transport controller;
Then it is allocated to each registered stream (currently the video module) through BitrateAllocator;
The video module gets the allocated bit rate, which is allocated to fec and retransmission, and the rest is allocated to video encoder bit;
The video encoder module gets the video encoder bit, which is allocated to large streams and small streams according to our policy;

rate allocation

congestion control

In order to realize the function of video auxiliary flow, we need to make relevant changes to the congestion control, mainly through the following four changes:

SDP signaling change

according to RFC 2327 , use "b=<modifier>:<bandwidth value>" to specify the recommended bandwidth. There are two modifiers:

AS: Single media bandwidth;
CT: Total session bandwidth, representing the total bandwidth of all media;

At present, the SDK uses the b=AS: method to specify the recommended bandwidth of the camera stream or screen sharing stream, and uses this value as the upper limit of the CC module's estimated value.

The new requirement requires that camera bitstream and screen shared bitstream can be sent simultaneously in the same session. Therefore, the recommended bandwidth value of the whole session should be obtained by adding the recommended bandwidth value of the two channels of media, as the upper limit of the estimated value of the CC module.

WebRTC supports the b=AS: mode (single channel media), which can meet the demand by adding multiple media within WebRTC. WebRTC currently does not support the b=CT: mode, so it is recommended to use the b=AS: mode with relatively few changes.

CC total code rate update strategy

The capability of Pub code stream is updated, and the "maximum bandwidth" is synchronously set to the CC module through SDP mode (b=AS:). When a new media stream is added, the available bandwidth is quickly detected by starting probe fast detection:

CC total code rate update strategy

Fast bandwidth assessment

When a media stream is suddenly added, the real bandwidth value needs to be detected quickly, and the probe fast detection algorithm is used to achieve this goal:

If the detection is successful, the CC estimated value will converge rapidly to the CC upper limit in the bandwidth sufficient scenario, and the real bandwidth in the bandwidth constrained scenario;
If the detection fails (such as high packet loss rate scenario), the CC estimate will converge slowly, and eventually converge to the upper limit of CC in the bandwidth sufficient scenario, and the real bandwidth in the bandwidth constrained scenario;

Paced Sender Processing

The sending priority of the secondary stream is the same as that of the mainstream video streams. All video media data are sent smoothly using the budget and packing multiplier methods;
Add a video stream type, kVideoSubStream = 3， Distinguish from mainstream large and small stream video data;
During probe fast detection, when the encoded data is insufficient, send padding data to make up for it, so as to ensure that the transmitted code rate meets the requirements;

The following figure shows the actual rate allocation test results:

Bit rate allocation test results

Statistical reporting

The bandwidth statistics report is divided into two parts, which are obtained from MediaInfo and Bweinfo.

1. MediaInfo acquisition at sender and receiver

The current SDK bandwidth estimation is obtained from MediaInfo. The logic is:

Traverse all current transceivers to obtain the video_channel and voice_channel of each transceiver, so as to obtain the video_media_channel and voice_media_channel;
Get the MediaInfo of the current channel according to the getstats of media_channel;
Put the acquired MediaInfo in verter media_info for reporting;

The main stream and the secondary stream send scenarios at the same time, but only one transceiver is added. Therefore, this logic is applicable to scenarios where the main stream and the secondary stream send scenarios at the same time, as shown in the following figure:

Scenarios

2. Bandwidth estimation information acquisition

The current SDK bandwidth estimation obtains logic from Bweinfo:

Obtain gcc, probe probe, etc. to indicate the overall bandwidth information;
Obtain the bandwidth estimation information related to the voiceChannel and videoChannel of each transceiver (similar to the acquisition of MediaInfo);

The scenario where the primary stream and the secondary stream are sent at the same time only adds transceivers, so this logic applies to the scenario where the primary stream and the secondary stream are sent at the same time, as shown in the following figure:

Bandwidth estimation information acquisition

summary

The above is about the sharing of video auxiliary stream in WebRTC. The technical implementation of video auxiliary stream is shared in detail based on business requirements, technical background and key technology class diagram. You are also welcome to leave a message to communicate with us about WebRTC and audio and video related technologies.

Series Reading

Wang Zheng 2024-06-08 09:46

You said, "All the tests are graduate students" and smiled. I don't know my level is low.

abeet 2024-06-08 20:38

There are no pictures, for fear that we will learn, right

oldpig 2024-04-28 09:59

”Huawei contributed all the source code "?, the title is completely inconsistent with the content.

zzeric 2024-04-28 20:01

Although France is the parent community, the core developers of OCCT on github are all Russians. Without Russians, the French parent community cannot continue to operate. So Huawei took over, moved to China, changed its name and resumed open source and community operations. What's the problem?

Small and beautiful software development 2024-06-08 23:03

It's mainly about waist training

lyh97157268 2024-06-09 20:58

Like c++

Qin Liming 2024-05-11 09:12

be devoid of any sense of shame

brucepapa 2024-06-09 21:02

I also have several backaches... After a few days of exercise, it will be much better to focus on stretching the back muscles.

osc_92224065 2024-04-29 10:57

Long term oppressed outsourcing of state-owned enterprises

GDWhisperer 2024-05-15 17:23

I transferred tens of thousands of yuan to my own account, which was under risk control. How did I do this? The bank should be responsible for this**

golyu 2024-06-10 14:45

If only this was the library of solidjs

xiaoqibabby 2024-05-15 17:36

The bank is strongly required to be responsible for

One code Yma 2024-05-06 09:14

My technical article was moved by CSDN. Why didn't anyone step on the sewing machine? This kind of report is a joke to me. The monsters with background are fine, and the monsters without background fight to death

zhangleijie 2024-06-08 10:08

pretty good

-SORA- 2024-04-30 17:07

When this happened in a foreign country, the comment area suddenly became very objective and rational**

zhuzhua 2024-05-21 10:08

I'm laughing to death. Those who have been deeply kidnapped dare not pay? Who will use the domestic open source framework of small companies in the future will be 213!!! Wait for harvesting later

zoujiaqing 2024-06-07 21:21

Spring boot was not updated last year

gamedot 2024-05-17 11:14

Old Zhou is deeply concerned about Huawei's great cause of open source. He is not a Huawei person, but has Huawei's soul.

yh2216 2024-06-09 13:15

Kevin586 2024-06-08 14:41

Dream is garbage, which can also be listed and refresh my cognition

Bright 2024-05-19 23:25

What a fool! I killed myself. How can people deal with me later.

kakai 2024-05-10 10:21

The world only knows that Android was created by Google. Several people know that Android is only a product acquired by Google. Similarly, what is the problem with Huawei's contribution to the collection of OGG open source work and integration into its own proprietary product line?

iVista 2024-06-10 18:13

I was blinded by the math test

H Fine water and long flow H 2024-06-10 09:39

I haven't heard about whether fartran has paid. I'm in the top ten

generation

Code e person 2024-06-09 10:03

Prepare the next project and try it

osc_566335 2024-04-28 14:44

This is also called floor washing? Does it mean that Tesla will not wash the floor if it releases all the source code? Some people HWptds? That is to say, the language is ambiguous, which will also rise to the washing ground? Are some people too focused? Think the people he pays attention to must be staring at?

Ding Yun H 2024-06-07 20:44

There is no querydsl. Since querydsl was used, I can't look at other forms anymore

infoworld 2024-05-11 15:12

Universities should use open source free software instead of commercial ones. In this way, hands and feet will not be tied technically.

Xiao Xu Middle aged 2024-06-08 10:12

First place in making money!! Money and treasures will be plentiful

MrChen89 2024-04-29 09:18

There are a group of people like this. I don't know what they have experienced. When it comes to HW, I can't say anything good, even if it's neutral

Li Yinghui 2024-05-09 16:40

Buddhism has a good word, evil opinion. In dealing with the world, it is meaningless to draw conclusions from preset positions; It is also important to receive good logic training.

Francesca 2024-05-19 18:00

Wine runs the Android emulator of Windows. Chrome OS is installed in the Android emulator. Linux environment is installed in chrome OS. Linux environment is installed in the Linux environment. Wine is installed in the Android emulator

Ma Nong Little Fatty Brother 2024-05-16 14:40

I give you six seconds. I give you six moves with the same effect in the martial arts contest, which shows the invincibility and confidence of the master

Xiao Xu Middle aged 2024-06-08 12:43

Do AI functions need networking? Will it be 404?

zoujiaqing 2024-06-07 21:22

I dare not use it

SnailJob 2024-06-09 09:13

Yes, please continue to follow Snail Job

Xiao Xu Middle aged 2024-06-10 07:05

Learn

kangert 2024-06-09 20:07

The problem of docker hub is very uncomfortable

muwanqing123 2024-06-09 08:28

Bullshit authentication

CodeDoger 2024-05-02 20:48

35 It's too old to go to work and too early to retire at 60

Yoona520 2024-05-17 16:34

Zhou Hongyi is now living more and more like a clown. If he stays behind the scenes, he has to become an online celebrity. Can you learn from Lei Jun?

kangert 2024-06-09 20:10

Really need to practice

Single structure 2024-05-11 10:09

Selected as Open Source China's disgrace pillar

zhy 2024-05-16 13:16

At the end of Shannon is Nong

Yeah, for 2024-05-17 13:42

That's too right. Old Zhou can't control Google, but he can control 360. Do not do to others what you do not want. All 360 products should be opened first.

osc_27546117 2024-06-09 22:36

Learned electric programming and expected its progress

Monkeys think of apes 2024-05-31 18:31

You can cheat your brother. Just don't cheat yourself

Francesca 2024-06-09 13:21

But the end of closed source must be open source, because many people who are dissatisfied with closed source have created open source, so the end of open source is not necessarily closed source, but to find a business model that is open source= Free Admission

Happy LeapFrog 2024-05-18 09:18

But the question is: "What's the use of this for ordinary Android users?" Now the answer seems to be: "Almost nothing.".

sunday12345 2024-05-15 18:31

What does the bank do? It's blamed on the remote desktop. Persimmons really pick up soft pinches~?

Xiao_f 2024-06-07 22:59

One thing to say, compared with other domestic manufacturers, Qwen's relaxed licensing fully demonstrates the style of a large factory

Chief taxi captain 2024-05-17 11:17

I suggest that 360 open source all its products, and then become the leading enterprise in the domestic open source industry through open source, leading everyone to compete with foreign enterprises

yh2216 2024-06-09 23:03

I remember saying that one year C++was the language of the year,

Francesca 2024-06-10 16:19

Be ignorant. This thing has a long history. It is used for scientific computing and has high performance

pan3793 2024-06-07 22:26

Let AI give AI a score

One code Yma 2024-05-09 09:58

Recently, I often go to interviews. People who hate Ali background most regard me as a fool, even though I am a fool

intown 2024-06-07 18:20

I can't pull down any mirror image these two days

Shuimu Yi'an 2024-05-20 09:58

The news should be read continuously. I'm waiting for the third news besides rustdesk and teamviewer. Localized remote desktop software is far ahead.

Xiaoxia cat ball 2024-06-09 21:29

Very good, come on

monkey_cici 2024-05-09 00:25

My I9 CPU, 64GB memory module and 3080Ti computer are inferior to the top configuration of 19999 on a tablet

Video auxiliary stream of WebRTC series

Demand background

Technical background

Technical implementation

Key Class Diagram

Minor flow change

bandwidth allocation

rate allocation

congestion control

SDP signaling change

CC total code rate update strategy

Fast bandwidth assessment

Paced Sender Processing

Statistical reporting

summary

Series Reading

Hot content

Popular comments of the whole site

About the author

Author's Album

Author's other popular articles

Hot News

Hot software

OSCHINA Community

Online tools

Introduction

QQ group

Public account

Video number

Video auxiliary stream of WebRTC series

Demand background

Technical background

Technical implementation

Key Class Diagram

Minor flow change

bandwidth allocation

rate allocation

congestion control

SDP signaling change

CC total code rate update strategy

Fast bandwidth assessment

Paced Sender Processing

Statistical reporting

summary

Series Reading

Hot content

Popular comments of the whole site

About the author

Author's Album

Author's other popular articles

Hot News

Recommended attention

Hot software

OSCHINA Community

Online tools

Introduction

QQ group

Public account

Video number