Network Security Internet Technology Development Database Servers Mobile Phone Android Software Apple Software Computer Software News IT Information

In addition to Weibo, there is also WeChat

Please pay attention

WeChat public account

Shulou

Consider: does the real distributed database make the concept of "data lake" a thing of the past?

2025-03-29 Update From: SLTechnology News&Howtos shulou NAV: SLTechnology News&Howtos > Internet Technology >

Share

Shulou(Shulou.com)06/03 Report--

Original address: http://www.fromgeek.com/ai/152830.html

Recently, Wu Ningchuan wrote "impressive, Ant Financial Services Group!" The creation of China's own database OceanBase reports the causes and consequences of the birth of OceanBase. The content is very detailed and worth sharing. At the same time, I also share a few thoughts:

First, killing ripe is not only a product of big data's era.

Previously, a familiar case of booking a hotel or taking a taxi from a certain network platform. It shows that in the era of big data, each of us was in a state of being a rookie, slaughtered at any time.

In fact, this phenomenon exists in all fields. For example, technical barriers are also one of the conditions for ripeness. As mentioned in the article, Wang Jian proposed to IOE when he was in Ali in 2008. It is because of the killing situation caused by the technical barrier. Normally, IT procurement is a tool to promote the efficiency of the enterprise. But procurement includes minicomputers, high-end storage, and databases, and the more they buy, the more the cost increases geometrically. Its IT procurement is no longer a promoting factor, and even seriously hindered the development of the enterprise.

The cost of devices like IOE is getting higher and higher in the process of large-scale development of Ali Cloud business. For Ali, it lost the driving force behind its technology to boost production. Under such circumstances, Ant Financial Services Group independently developed the OceanBase database.

Second, the birth of a real distributed database breaks the traditional concept of "data lake".

What is the traditional concept of "data lake", that is, multiple physical disks are regarded as a virtual storage unit. Chen Mengmeng, head of the SQL development direction of the OceanBase team, said that all databases see the same data disk and share data access, which can ensure that all data can be accessed, but put forward high requirements for hardware, that is, the underlying hardware itself should be stable and reliable. We can see that this concept has been accepted by the vast majority of traditional enterprises and even Internet enterprises.

And Ali broke this idea, only two enterprises in the world have broken this idea, one is Ali, the other is Google.

Chen Mengmeng believes that there are only two real distributed databases in the world: Ali's OceanBase and all the self-developed Spanner distributed database cloud services released by Google in February 2017.

Even the design principle of Aurora database launched by AWS is closer to the shared disk design of traditional database.

Specifically, when OceanBase deals with data access, it is equivalent to "slicing" an original minicomputer or storage device vertically into many machines, and then distributing the data to these different machines. Personal understanding should be to divide an overall "data lake" into multiple small "data pools".

One of the basic design ideas of OceanBase is to store each piece of data on three different machines, so if the probability of one PC server failure is 1/1000, the probability that two servers will fail at the same time is 1/1000000, and the probability that three servers will fail at the same time is 1/1000000000.

Third, can OceanBase distributed database be combined with blockchain technology?

First of all, we see that Wang Jian proposed that Ali should build a distributed database at the same time as the bitcoin white paper proposed by Satoshi Nakamoto. Here we can see that since 2009, Wang Jian has been considering a distributed database that really adapts to the future Internet business. To look at it another way, at the same time, Satoshi Nakamoto proposed a peer-to-peer electronic money system with block chain technology (jokingly known as "the slowest distributed database ever").

The difference is that Oceanbase as a commercial project, after several years of continuous development, simply look at this database, not only achieve distributed data storage, but also achieve database query optimization. In the real application scenario, compared with the traditional bank counter, manual windows spend a lot of time to get services. Ant Financial Services Group provides users with high-quality Internet service experience based on the Internet financial application provided by oceanbase.

For the slow distributed database technology of all blockchain, you can refer to Ali's Oceanbase or Google's Spanner database technology. In this way, it plays a positive role in the promotion of block chain technology.

Welcome to subscribe "Shulou Technology Information " to get latest news, interesting things and hot topics in the IT industry, and controls the hottest and latest Internet news, technology news and IT industry trends.

Views: 0

*The comments in the above article only represent the author's personal views and do not represent the views and positions of this website. If you have more insights, please feel free to contribute and share.

Share To

Internet Technology

Wechat

© 2024 shulou.com SLNews company. All rights reserved.

12
Report