Network Security Internet Technology Development Database Servers Mobile Phone Android Software Apple Software Computer Software News IT Information

In addition to Weibo, there is also WeChat

Please pay attention

WeChat public account

Shulou

This studio is a touch of tenderness in the passion of the World Cup.

2025-03-28 Update From: SLTechnology News&Howtos shulou NAV: SLTechnology News&Howtos > IT Information >

Share

Shulou(Shulou.com)11/24 Report--

Unconsciously, the World Cup in full swing has come to an end.

The night of the final was full of twists and turns, and finally Argentina won the third championship in team history.

Especially when I saw Messi finally holding the Hercules Cup and listened to the narrator's sentimental story, I believe many people were moved.

Speaking of commentators, which platform did you watch this year's World Cup?

Is it the solemn and elegant CCTV, or the relaxed and lively Migu? Or are you playing the starry Douyin with a lot of tricks?

No matter which platform, I believe everyone can harvest their own moved.

However, when the editor watched the game on Douyin this year, he received another emotion. That is the live studio of "accessibility subtitles" on Douyin.

It enables friends with hearing impairment to better grasp the dynamics of the field through subtitles, and feel the passion of the commentator when analyzing and commenting.

And behind such a "barrier-free subtitle" live broadcast room, there is another contribution that may be ignored by us, that is, the volcanic engine.

What exactly is going on? Next, CTOnews.com will talk to you in detail.

Barrier-free subtitle studio, a touch of tenderness under passion, we know that watching sports games is a process that requires sound and picture cooperation, especially combined with the analysis and comments of the commentator, in order to have a more in-depth understanding of the situation on the field.

But for a long time, such an experience has become a luxury for people with hearing impairment.

However, for this year's World Cup, if you are watching the match live on Douyin, you can see the live studio with "barrier-free subtitles" in different studio options.

This studio can not only provide the sound and picture of the live broadcast of the game, but also translate what the host and commentator said into text subtitles in real time.

You know, this is a live broadcast, so it is obviously not realistic for the transcript to type the output subtitles manually.

We need to use the power of AI.

On the other hand, the volcano engine adopts the innovative AI subtitle scheme of volcanic simultaneous interpretation.

The process of its work is:

First receive the live stream, then identify and transcribe the voice signal in real time, and output AI streaming subtitles. Then, the translator can manually proofread the AI subtitles within 30 seconds of the delay, and finally put out the subtitles of the whole sentence.

In this process, the accurate and fast speech recognition of AI is the foundation. Therefore, Volcano Voice team specially optimized the terms such as football proper nouns, team and player names, so as to improve the accuracy of AI model recognition.

Moreover, they also analyzed the audio characteristics of a large number of football commentary scenes and tuned the model to ensure that the human voice can be clearly identified in the case of background sound.

As a result, the studio of "barrier-free subtitles" can output "more accurate" subtitles in the case of "low delay", thus bringing viewers a better viewing experience.

In addition, in the presentation of subtitles, volcanic simultaneous interpretation has also been optimized, with well-designed and clearer double-line subtitles, so that people will not feel tired when watching the subtitles for a long time.

In short, they are not only doing the function of real-time subtitles, but also doing a good job in all aspects of subtitle presentation and viewing as far as possible, so that hearing-impaired users can really enjoy the quadrennial football feast more comfortably.

Statistics show that as of December 6, the World Cup's accessible subtitle studio has been watched more than 1800 times.

By the way, it may not only be used by hearing impaired people, but also convenient for ordinary users when they are in a quiet environment and are inconvenient to listen to and explain.

But more importantly, it must be what it means to the hearing impaired.

According to the statistics of the World Health Organization, there are more than 400 million people with hearing impairment in the world, and there are nearly 30 million people in China, which is the largest. It is believed that most of them, whether they are fans or not, will pay attention to the World Cup, which is discussed by the whole people. However, the obstacles in listening greatly affect their access to and understanding of information in the process of watching the game.

Moreover, in addition to the World Cup, they also have a need to watch other entertainment programs and listen to huge amounts of information in their daily life. Hearing impairment, however, has become a gap.

From this point of view, the technical solution of the volcano engine can not only help them solve the problem of watching the World Cup, but also has social significance worth promoting in more scenes.

It is like a touch of tenderness under the passion of this World Cup, which not only injects humanistic care into the live broadcast of sports events, but also makes many people feel the warmth of attention in this winter.

Isn't this the embodiment of the "people-oriented" of science and technology?

In addition to the barrier-free live broadcast, the volcano engine also makes more people feel better. In fact, in addition to the barrier-free live broadcast, this World Cup, the volcano engine has done more.

They are the main technical service providers for the live broadcast of the World Cup on Douyin platform. Not only that, the volcano engine also provides ultra-high-definition, low-delay live broadcast technical support for CCTV.

In addition, the PICO end of the World Cup ultra-high-definition low-delay immersive live broadcast is also supported by the volcano engine.

In short, its shadow can be seen in many places.

What it brings to us, first of all, is the better live broadcast quality.

In this World Cup, you can enjoy higher-definition pictures of the game, better HDR effects, more realistic colors, fuller stadium and commentary sound quality.

Along the way, the friends should be happy.

How do volcanic engines do that?

First, in terms of clarity. This year's volcano engine supported the live broadcast of the Douyin World Cup, realizing the industry's first large-scale provision of ultra-high-definition images to public mobile devices.

They deeply restore the details of the game through self-developed BVC encoders and high-definition and low-code algorithms, allowing fans to experience the picture quality of watching the game on a large screen on their mobile phones.

The second is the fluency and stability of the live broadcast of the game.

Take the World Cup final as an example, the highest number of simultaneous viewers in the Douyin studio reached an all-time peak of more than 37 million, while the volcano engine relied on global coverage of marginal cloud resources, efficiently connected collaborative networks, and massive computing resources. Douyin successfully passed all the traffic tests of the live broadcast of the World Cup.

According to the data given by the volcano engine, the peak bandwidth of their supporting platforms is close to 50Tbps, a new high.

Can have such a safe response ability, on the one hand, the volcano engine is well prepared, they built a "second monitoring, 1-minute response, 3-minute stop loss" safeguard SOP system.

On the other hand, it is also based on the strong response of the volcano engine edge cloud, which has global coverage of edge computing nodes, provides 1-40ms network access and data unloading capabilities, as well as 100 T-level edge resource reserves and over 100 million-level QPS concurrency capabilities. These are enough to support users to watch the game steadily and smoothly at the peak and complete the interaction.

In addition, the delay of live streaming is also an important factor affecting the experience.

For the live broadcast of this year's World Cup, the volcano engine uses RTM low-delay live broadcast technology for the first time. This technology can not only provide large-scale distribution capability, but also reduce the end-to-end delay of live images to about 1 second.

Therefore, the picture of the game that you see in front of the screen is almost synchronized with the scene of the Qatar match, and it is more immersive.

While you get a more immersive live audio and picture quality, you will further pursue the sense of participation and interaction when watching the live broadcast, which is a higher-level experience of watching the game.

And the volcano engine is also given to everyone through the corresponding technical solutions.

For example, the "watching while talking" function, which many friends like very much when watching the game on Douyin, can create an exclusive "friend chat area" in the studio and organize games with friends from all over the country to watch the game, not only by text chat, but also by voice chat. The wonderful moments of the game can also be shared with friends with one click, which is really full of the sense of interaction.

Behind this function, thanks to the volcano engine RTC (real-time communication) technology, it can always provide users with a high-quality audio experience under the concurrency of millions of traffic.

When everyone is chatting with audio while watching the game, the volcano engine RTC also uses audio hosting combined with self-developed intelligent 3A algorithm to ensure that everyone speaks without echo in the external scene, and provides adaptive human voice volume balance, intelligent audio evasion and other technologies, so as to ensure the best sound quality of the event, but also provide a clearer and fluent voice communication experience.

The volcano engine also helps Douyin, providing a virtual broadcast platform for football talents on the platform, and anchors can start a live commentary of World Cup matches anytime, anywhere, using the virtual broadcast platform. It breaks the restrictions on venue, equipment and time in traditional activities. The VJ sits in the green screen environment and can load his own virtual commentary hall scene through the virtual broadcast platform, which can not only save the time and manpower cost of setting up the scene offline, but also build high-quality and diversified scenes that can not be reproduced in the real space. Through various algorithms developed by the volcano engine, the software can achieve pixel-level matting, real-time rendering of high-precision 3D scenes, mapping characters and the spatial relationship between virtual scenes. Enhance the sense of reality, so that anchors and viewers seem to be there.

It can be said that the functions and immersion experiences brought by these technologies have accompanied us through the entire World Cup. It is because of this experience that we can see every wonderful moment as if we were on the spot, celebrate the joy of supporting the team's victory with the audience, and be able to watch the game, chat and have fun with our friends.

With the advent of the super video era, the volcano engine video cloud shows muscle World Cup, like an excellent proving ground, it shows us how important the wonderful presentation of video content is in the new era.

And this new era, we can call it the "super video era". According to the 48th Statistical report on the Development of China's Internet released by the China Internet Information Center, by June 2021, the number of short video users in China has reached 888 million, and the average daily use time of short video applications has exceeded 120 minutes.

At the same time, IDC also pointed out in its white paper "Video Cloud Evolution trend in the Super Video era" that the current era has gone through the stages of long video, short video and live streaming applications, and entered the super video era.

Whether it is the proliferation of live streaming or short video information flow, to mobile conferencing, office, distance education, medical care and other industries, various scenarios and videos are becoming a new generation of expressways for information transmission. at the same time, it is also the productivity of the new era that thousands of industries are tapping.

Consumers also have new requirements for video content, and high-definition, interactive, immersive experiences all bring new challenges to video content providers.

Under this background, video cloud construction has become the trend of the times. Video cloud capabilities have been applied to thousands of industries and become a new race track for business and technology.

And Volcano engine, the cloud service platform under byte beating, has built a complete video cloud product matrix through its own natural accumulation and innovation in the field of video applications.

According to the latest video cloud product matrix released by Volcano engine in February this year, they have formed a complete solution including pan-Internet, games, finance, radio and television scenarios, video-on-demand, veImageX, real-time audio and video and other core products. The lowest core platform is the technical capability accumulated and precipitated by the volcano engine in serving excellent applications such as Douyin and watermelon video, covering network transmission, intelligent production and intelligent processing of the whole link.

And subdivided into each capability, the volcano engine video cloud also has corresponding technical advantages.

For example, in terms of coding, Volcano engine's BVC series encoders have won 17 championships in the world's top video encoder MSU2020; in the video playback experience, Volcano engine has an original "zero first frame" optimization, so that the first frame of short video is less than 100ms, and the first frame of long video is less than 400ms, resulting in an imperceptible and smooth playback experience.

There are also the various leading technical capabilities that we focused on showing when we introduced the volcano engine to provide the World Cup live broadcast service.

Based on these advantages, the volcano engine video cloud is also providing the ultimate video experience for different industries, and has realized many excellent commercial landing cases.

For example, in July this year, Douyin, watermelon video, Jinri Toutiao, fresh time TV's "Beyond Live 1991 Life contact Concert" and memorial concert featured reruns, causing 140 million fans to be nostalgic.

Behind this is a successful application of the video cloud capability of the volcano engine. They show great strength in various aspects, such as damaged picture quality repair, color restoration, portrait reconstruction, motion compensation, sound quality restoration, and so on. While amazing the outside world, it also makes people see the broad application prospects of this technology.

Another example is the cooperation between the volcano engine and the quick look at the comics. Now many "Super New Generation Z" like to watch "comic drama" on the fast watch, and this new form of video content is backed by the volcano engine video cloud. Volcano engine Video Cloud provides powerful video editing for Quick View's comic drama creation and promotes the prosperity of UGC content. At the same time, it also provides zero first frame optimization experience for Quick View, which enhances the user viewing experience.

For example, watching the World Cup on the PICO VR all-in-one computer, behind it is the ultra-high definition live broadcast of the game through the volcano engine video cloud, and with the help of the volcano engine RTC scheme, it brings users spatial audio that can change with the position and head posture, so as to achieve a better interactive effect of VR watching the game.

In short, the successful commercial landing and the excellent industrial empowerment all make the future performance of the volcano engine in the video cloud track more anticipated.

Conclusion: the World Cup is over. But the era of video cloud has just begun.

The ancients said that man is "multiplied by dawn to see, sent to heaven to hear". "Audio-visual" is the most basic way for human beings to feel the world, and this way is bound to usher in a new subversion in the new era.

Volcano engine video cloud, is to let people see, hear, in an unprecedented in-depth and direct way, feel the world.

Welcome to subscribe "Shulou Technology Information " to get latest news, interesting things and hot topics in the IT industry, and controls the hottest and latest Internet news, technology news and IT industry trends.

Views: 0

*The comments in the above article only represent the author's personal views and do not represent the views and positions of this website. If you have more insights, please feel free to contribute and share.

Share To

IT Information

Wechat

© 2024 shulou.com SLNews company. All rights reserved.

12
Report