From zero to one, use the real-time audio and video SDK to develop a Zoom Bar - Qiniuyun's personal space - OSCHINA - Chinese open source technology exchange community

From zero to one, use the real-time audio and video SDK to develop a Zoom

original

2018/10/18 20:20

Number of readings 489

Zoom (zoom. us) is a widely used online conference software. I believe you must have experienced or used it in various scenarios such as office, conference and chat. As a mature commercial software, zoom provides stable real-time audio and video call quality, as well as common functions such as whiteboard, chat, screen sharing and PPT projection. But how can real-time audio and video lag behind in the era when browsers become mainstream? Compared with Zoom, which needs to install the package, developing a similar conference software directly on the web page will certainly attract more attention. When a meeting is needed, people can access and start the meeting through a link. Now, we can easily turn your idea into reality by using the Qiniu real-time audio and video Web SDK.

First of all, let's sort out what key points need to be broken for a Web online conference product

Browser compatibility is good, and most mainstream desktop browsers need to be supported.

Qiniu real-time audio and video is based on the WebRTC protocol promoted by Google on Chrome. At present, this protocol has been officially written into the Web standard, and all modern browsers have good compatibility with it.

Good call quality, low latency, high definition

Unlike traditional WebRTC, which uses the form of user to user P2P for communication, we use nodes deployed worldwide as real-time interactive networks with low latency to communicate with each end through P2P, which ensures both the latency and the quality of calls.

The conference function should be rich, including ppt demonstration, whiteboard, screen sharing, etc

Our SDK provides a rich list of functions to meet the needs of most conference scenarios. In theory, we can use the SDK to completely reproduce a Web version of the zoo.

Having said so, is it difficult to access? Are there examples and documentation?

of course! Now you can experience the Demo of our Web (open it in the desktop browser). https://demo-rtc.qnsdk.com 。 Demo The source code of is also open source on Github for your reference https://github.com/pili-engineering/QNRTC-Web

This Demo implements the functions directly provided by most SDKs. The Demo integrating whiteboard/PPT sharing/chat and other scenarios is still preparing to go online. Please wait. We will briefly introduce the specific process of access below. Detailed instructions and references can be moved to our document station https://developer.qiniu.com/rtn/sdk/4412/description-of-web-sdk

Development process

A simple conference product usually goes through the following process:

User registration/login (developers integrate by themselves, and the SDK only needs to distinguish user userIDs)
Create a conference room/join a conference room
Collect your own camera/microphone data
Publish the collected media data to the room
Subscribe to other people's media data in the room and play it in real time
Processing user joining/leaving, publishing/unpublishing

Here we simplify to the various functions of the SDK. In fact, it is adding rooms - collecting local media streams - publishing media streams - subscribing to media streams - event processing. The SDK simply encapsulates each step and can be completed with a few lines of code.

Introducing SDK

It is recommended to use npm to import our SDK directly npm i pili-rtc-web You can, or you can choose to directly import the packaged js file https://github.com/pili-engineering/QNRTC-Web/blob/master/Release/pili-rtc-web.js

Asynchronous processing

Real time audio and video is a strongly asynchronous scene. All kinds of operations are asynchronous related to the network, so that developers can better control the asynchronous logic in the code writing process. The SDK does not use the cumbersome callback mode, but uses the async/await or promise features of modern Javascript to write asynchronous code, so as to avoid callback hell during development (all the await code below is assumed to be wrapped in an async function).

Join Room

After preparation, the first step is to join the room. It is said to join a room. The abstract expression is "what user joins what room in what identity". There are three unknowns: user ID, identity ID (permission), and room ID. In fact, there are still many unknowns in the whole process of adding rooms, such as which APP the room belongs to (the application is independent of rooms in different applications), which Qiniu account the APP belongs to, and so on. Here we uniformly code and sign these values, and turn them into a roomToken to provide to the front end. The end can join the room only through this token. (Token generation can be completed on the Qiniu console, or it can be dynamically generated using the service SDK as needed)

 const myRTC = new QNRTC.QNRTCSession(); //  initialization await myRTC.joinRoomWithToken(ROOM_TOKEN); //  Join Room

Collect local media stream

Generally, audio and video will be collected at the same time, that is, microphone and camera. However, the SDK also supports pure audio collection or pure video collection mode according to requirements. Calling methods is also very simple, just change the options.

 Const DOM_ELEMENT=...//The dom element on the page is prepared to play the stream //Collect local stream const localStream = await QNRTC.deviceManager.getLocalStream({ video: { enabled: true }, audio: { enabled: true }, }); //Play the collected stream localStream.play(DOM_ELEMENT)

Publish Media Stream

Directly take the stream object just obtained as the contest, and call the publish method

 await myRTC.publish(localStream);

Subscribe to media streams

After joining the room successfully, you can access the user status of the current room at any time by visiting the users member. If there are users in the room who are publishing but not themselves, then we can initiate a subscription

 const users = myRTC.users; users.forEach(async (user) => { if (user.published && user.userId !==  myRTC.userId) { //Stream data returned by other users in the subscription room const remoteStream = await myRTC.subscribe(user.userId); //Similarly, calling play directly can play the stream remoteStream.play(DOM_ELEMENT); }  });

event processing

The SDK exposes a rich event list to meet the needs of most scenarios. The event processing is also very simple. Take the event "published by other users" as an example

 //Listen for events myRTC.on('user-publish', handleUserPublish); //Listen only once myRTC.once('user-publish', handleUserPublish); //Cancel a listening function myRTC.off('user-publish', handleUserPublish); //Cancel all listening functions myRTC.removeAllListeners('user-publish');

The specific event list can be referred to https://developer.qiniu.com/rtn/sdk/4423/the-event-list-web

Features

In addition to these basic functions, the SDK also provides many powerful advanced functions to further meet the needs of all walks of life.

Screen sharing

In addition to collecting cameras directly, the SDK also supports screen acquisition (or window acquisition) to share screens for meetings. It also supports seamless switching between screen sharing and camera capture to ensure user experience.

 //Screen sharing await QNRTC.deviceManager.getLocalStream({ screen: { enabled: true }, audio: { enabled: true }, });

Live broadcast to push

For an online meeting, only about ten people may participate in the meeting discussion, but most people need to watch the meeting in real time (but not participate in the real-time discussion). This is the intersection of real-time audio and video and live broadcast. A small number of users with real-time interaction needs are allocated to the real-time audio and video cloud with ultra-low latency (200ms), and most users with only real-time viewing needs are allocated to the live broadcast cloud with low latency (2-3 s), which can minimize costs to meet the needs. At the same time, after the real-time video stream is pushed to the live cloud, you can use the API of Qiniu live cloud to store the stream data in various storage spaces for long-term storage. A complete business process has been completed from real-time interaction to live viewing to file storage (for on-demand, etc.).

First, associate the corresponding live video cloud space on the console page of the real-time audio and video cloud, and then turn on the switch of combining and pushing.

If you want to push to a customized RTMP address (instead of using Qiniu Live Cloud), you can also configure it through the backend API of the real-time audio and video cloud (see the document for details)

The next job is to use the SDK. It is also very simple to use the SDK to enable live broadcast forwarding. After adding a room, you can call a line of code.

 MyRTC.setDefaultMergeStream (WIDTH, HEIGHT);//The width height here corresponds to the merging output size set above

Using this code, the SDK will, by default, average the layout of all streams in the room, and finally push them to the target RTMP address for merging and forwarding. If you want to customize the screen layout, you can use the following API:

 MyRTC.setMergeStreamLayout ("target user ID", {w: 100, h: 100, x: 0, y: 0, muted: false, hidden: false });

In addition, we also provide a real-time whiteboard service for web pages. Like Zoom, users can share a whiteboard on the page with other users in the room for auxiliary presentation. At the same time, whiteboards also support PPT and PDF presentations. Demo of this section can visit our bullock class experience https://edu-demo.qnsdk.com 。

The above is just a list of common function scenarios of a conference software. By integrating these basic functions with the user's own scenarios, a simple conference software can be easily completed. If you are ready to try and experience it, it would be a good choice to visit our Demo (see above). If you plan to integrate your product into our real-time audio and video, here is a more detailed development practice of real-time audio and video applications https://developer.qiniu.com/rtn/sdk/5043/rtc-application-development-process , from From html/css to js, from each line of code to each function, there are detailed explanations and examples to help you access quickly.

haol666 2024-05-31 18:56

This story is powerful, I take it seriously, until I see the end.

One code Yma 2024-05-09 09:58

Recently, I often go to interviews. People who hate Ali background most regard me as a fool, even though I am a fool

Monkeys think of apes 2024-05-31 18:31

You can cheat your brother. Just don't cheat yourself

Yoona520 2024-05-17 16:34

Zhou Hongyi is now living more and more like a clown. If he stays behind the scenes, he has to become an online celebrity. Can you learn from Lei Jun?

looly 2024-06-02 14:32

@Qingmiao Hutool has also been mentioned some loopholes that I think are relatively "low-level", or I think are not loopholes. At first, I was also very angry, but after thinking it through, I found that CVE's idea was that once you did not actively remind users that there was a pit, the user fell into the pit is your fault, that is, your vulnerability. For example, as a traffic policeman, you should remind everyone who crosses the road to pay attention to safety, and ask him to answer whether he knows. Once you don't remind someone and are hit by a car, you can't get away from it. Similarly, when using frameworks and tools, you should provide at least one parameter to remind users that there may be SQL injection vulnerabilities. Note that it is not in the comments, but in the method parameters, which is the user's responsibility. Therefore, it is not comprehensive to provide solutions in comments or documents.

oldpig 2024-04-28 09:59

”Huawei contributed all the source code "?, the title is completely inconsistent with the content.

Bright Stars 2 2024-05-31 23:28

Remove Unsafe? You don't want netty anymore?

osc_25732934 2024-06-01 19:30

It seems that the current version of the Foreign Function&Memory API is not as fast as that of jni, or even worse. In addition, before vallhala comes out, all interactions between java and c have to get an additional memory. Even if it comes out, it may not be possible to directly throw a copy of binary data into memory as a structure. When the two apis are completely stable, the day lily is cold

osc_566335 2024-04-28 14:44

This is also called floor washing? Does it mean that Tesla will not wash the floor if it releases all the source code? Some people HWptds? That is to say, the language is ambiguous, which will also rise to the washing ground? Are some people too focused? Think the people he pays attention to must be staring at?

Hakuna 2024-05-31 18:28

It is compatible with Oracle, but does not know "just" or "just". Those who can be compatible with Oracle and do well are real men and real warriors. You should know that compatibility means that even bugs must be compatible, and you have no other code that can not be copied. It's all based on real skills and understanding of oracle.

zhy 2024-05-16 13:16

At the end of Shannon is Nong

Yokesily 2024-06-02 15:11

So designed

GDWhisperer 2024-05-15 17:23

I transferred tens of thousands of yuan to my own account, which was under risk control. How did I do this? The bank should be responsible for this**

Ning Jinnong 2024-06-01 21:04

Correct it. The example of loading the library is wrong. It should be # library=@ loading the dynamic library, "./yards to the treasurer. dll"

Bright 2024-05-19 23:25

What a fool! I killed myself. How can people deal with me later.

Xiao Xu Middle aged 2024-05-31 19:13

Very good

hanf 2024-05-31 17:45

If only the design and architecture are similar, what's the point? Good things must be learned, and you can't prove that the design is not the same. As for the source code, you also said that neither Oracle nor Damon is open source, and you can't prove it. There are many people who question Dream, but so far, no one has come up with strong evidence. You should at least provide evidence to copy

Dogo_Little People 2024-06-02 12:24

Not everyone will go to see the document in full detail. As a general basic framework, the method naming should consider not only readability but also understandability. At least, it should also establish a cognition for developers. LambdaQueryWrapper is recommended. The official only briefly said that QueryWrapper may lead to SQL injection risks, There are no detailed examples (many people don't understand what SQL injection is). Now I met a jerk and submitted it to CVE to see who is the most powerful

young crops 2024-06-01 16:21

There is no tipping point. There are also many official documents stating that SQL fragments involving direct string splicing need to be controlled by the user, and specific solutions are also provided. If you say that the value part is injected, then we are also 100% free of any dispute. This obvious SQL fragment is unrealistic for ORM to explain without your control, Since SQL allows splicing fragments, there must be some scenarios that cannot be forced into non SQL strings. It is also very simple. Have you ever thought about why not force them???

gamedot 2024-05-17 11:14

Old Zhou is deeply concerned about Huawei's great cause of open source. He is not a Huawei person, but has Huawei's soul.

Yeah, for 2024-05-17 13:42

That's too right. Old Zhou can't control Google, but he can control 360. Do not do to others what you do not want. All 360 products should be opened first.

Xiao Xu Middle aged 2024-06-01 07:03

good

Shuimu Yi'an 2024-05-20 09:58

The news should be read continuously. I'm waiting for the third news besides rustdesk and teamviewer. Localized remote desktop software is far ahead.

Code craftsman 2024-06-01 11:22

I also said "user controllable parameters"

infoworld 2024-05-11 15:12

Universities should use open source free software instead of commercial ones. In this way, hands and feet will not be tied technically.

sweet potato chips 2024-05-31 22:08

Glue code consumes few resources

Qin Liming 2024-05-11 09:12

be devoid of any sense of shame

Francesca 2024-05-19 18:00

Wine runs the Android emulator of Windows. Chrome OS is installed in the Android emulator. Linux environment is installed in chrome OS. Linux environment is installed in the Linux environment. Wine is installed in the Android emulator

xiaoqibabby 2024-05-15 17:36

The bank is strongly required to be responsible for

People are addicted to food 2024-06-01 13:53

History history combination

All the way north GP 2024-04-25 14:55

America, the future of mankind

jalena 2024-05-31 23:57

I can imagine that I will also receive the CVE repair request next week..... I don't use the key!!!!!!!!!

Happy LeapFrog 2024-05-18 09:18

But the question is: "What's the use of this for ordinary Android users?" Now the answer seems to be: "Almost nothing.".

zzeric 2024-04-28 20:01

Although France is the parent community, the core developers of OCCT on github are all Russians. Without Russians, the French parent community cannot continue to operate. So Huawei took over, moved to China, changed its name and resumed open source and community operations. What's the problem?

Li Yinghui 2024-05-09 16:40

Buddhism has a good word, evil opinion. In dealing with the world, it is meaningless to draw conclusions from preset positions; It is also important to receive good logic training.

The seven in one little King Kong 2024-06-02 15:54

Those people only use resources, others are not developed by NPM...

Apizza 2024-06-01 17:52

You can switch from lodash to radash in 2024!!!

kakai 2024-05-10 10:21

The world only knows that Android was created by Google. Several people know that Android is only a product acquired by Google. Similarly, what is the problem with Huawei's contribution to the collection of OGG open source work and integration into its own proprietary product line?

Xiao Xu Middle aged 2024-06-01 06:49

thank

Voice of God 2024-06-01 20:47

By default, injection ($) and splicing are turned off. If you want to use it, you need to sign the birth and death form and press the fingerprint.

osc_92224065 2024-04-29 10:57

Long term oppressed outsourcing of state-owned enterprises

-SORA- 2024-04-30 17:07

When this happened in a foreign country, the comment area suddenly became very objective and rational**

Chief taxi captain 2024-05-17 11:17

I suggest that 360 open source all its products, and then become the leading enterprise in the domestic open source industry through open source, leading everyone to compete with foreign enterprises

kangaroo 2024-06-01 22:23

The next version focuses on improving existing functions * improving internal power and qi * and continues to move towards the goal of Grand Master.

Rocket ship 2024-05-31 19:22

It's a ghost anyway.

MrChen89 2024-04-29 09:18

There are a group of people like this. I don't know what they have experienced. When it comes to HW, I can't say anything good, even if it's neutral

Starry Night Destiny 2024-06-01 21:49

It feels like Mybatis. It's OK to provide users with optional security solutions. It's useless for users to complain about this problem

CodeDoger 2024-05-02 20:48

35 It's too old to go to work and too early to retire at 60

Single structure 2024-05-11 10:09

Selected as Open Source China's disgrace pillar

Brother Xiao Yang 2024-06-01 20:39

Isn't Ali developed? What are you afraid of? There's no need for every family to set up a set

Shen Lang Panda 2024-06-01 08:16

You can directly ask questions in the project work order. The comment area is not suitable for answering such questions

-SORA- 2024-06-01 09:30

American characters

Small and beautiful software development 2024-06-01 05:06

Cheat one's job

sunday12345 2024-05-15 18:31

What does the bank do? It's blamed on the remote desktop. Persimmons really pick up soft pinches~?

Love to eat raw pears 2024-06-01 19:18

Don't expect programmers to have a deep understanding of the document. I still think that since the tool hides the details of $#, some necessary security checks are necessary. Many people do not use MybatisPlus directly, but use various so-called rapid development platforms. The MyBatisPlus rapid development platform Snowy, Guns, etc., has an impression that many versions have the problem of using Wrapper directly to splice the Request parameter. I remember that JeecgBoot was opened a lot of CVEs last year or the year before last because of the Wrapper splicing problem. Do you know the author of ibeetl? Many CVE blaming holes have been opened before. The problem is similar. The lack of basic knowledge "script editing permission" is actively handed over to the front end. What a low-level error or even low-energy behavior. However, I accepted it with an open mind and added a white list check.

Love to eat raw pears 2024-06-01 11:48

Why is this so-called "vulnerability" not a vulnerability? Spring, MyBatis and other frameworks can accept all kinds of CVE criticism, while MyBatisPlus has to dump the pot and accuse programmers of being too low-level# There is a difference. The premise is that you write XML, MyBatisPlus encapsulates Wrapper and claims to simplify code. Since it encapsulates and hides $#, it is not appropriate to do some necessary security checks? Instead of doubting the authority of CVE, you should know that SQL ->MyBatis ->MyBatisPlus ->various back-end scaffolds have multiple layers, each layer is simplifying, and each layer is throwing away the upper layer of the boiler. Who dares to use them. The programmers who use MyBatisPlus can't be expected to be at a high level. Every programmer wants to save effort. The front-end parameters can be directly obtained by HttpServletRequest from the back-end. Wrapper splicing can be found everywhere. If something goes wrong, is it the front-end or the framework? According to Qingmiao, can the injection vulnerability of the previous log4j and the deletion vulnerability of the Druid be used to eliminate low-level programmers?

Ma Nong Little Fatty Brother 2024-05-16 14:40

I give you six seconds. I give you six moves with the same effect in the martial arts contest, which shows the invincibility and confidence of the master

One code Yma 2024-05-06 09:14

My technical article was moved by CSDN. Why didn't anyone step on the sewing machine? This kind of report is a joke to me. The monsters with background are fine, and the monsters without background fight to death

monkey_cici 2024-05-09 00:25

My I9 CPU, 64GB memory module and 3080Ti computer are inferior to the top configuration of 19999 on a tablet

From zero to one, use the real-time audio and video SDK to develop a Zoom

Development process

Introducing SDK

Asynchronous processing

Join Room

Collect local media stream

Publish Media Stream

Subscribe to media streams

event processing

Features

Screen sharing

Live broadcast to push

Hot content

Popular comments of the whole site

About the author

Author's Album

Author's other popular articles

Hot News

Hot software

OSCHINA Community

Online tools

Introduction

QQ group

Public account

Video number

From zero to one, use the real-time audio and video SDK to develop a Zoom

Development process

Introducing SDK

Asynchronous processing

Join Room

Collect local media stream

Publish Media Stream

Subscribe to media streams

event processing

Features

Screen sharing

Live broadcast to push

Hot content

Popular comments of the whole site

About the author

Author's Album

Author's other popular articles

Hot News

Recommended attention

Hot software

OSCHINA Community

Online tools

Introduction

QQ group

Public account

Video number