Network Security Internet Technology Development Database Servers Mobile Phone Android Software Apple Software Computer Software News IT Information

In addition to Weibo, there is also WeChat

Please pay attention

WeChat public account

Shulou

How to solve the problem of consistency between redis cache and database in practical application of the project

2025-01-19 Update From: SLTechnology News&Howtos shulou NAV: SLTechnology News&Howtos > Development >

Share

Shulou(Shulou.com)06/02 Report--

This article will explain in detail how to solve the problem of consistency between redis cache and database in the practical application of the project. The editor thinks it is very practical, so I share it with you as a reference. I hope you can get something after reading this article.

First, the way to solve the problem

Is it possible to make sure that the first request must be executed first? The common idea is "serialization"

The figure above shows the upstream and downstream of a service service and the detailed deployment within the service. The details are as follows:

(1) upstream of service are multiple business applications. Upstream initiates requests for concurrent read and write operations on the same data. In the above example, a uid=1 balance modification (write) operation and a uid=1 balance query (read) operation are performed concurrently.

(2) downstream of service is the database DB. It is assumed that only one DB is read and written.

(3) in the middle is the service layer service, which is divided into several parts.

(3.1) the top layer is the task queue.

(3.2) in the middle is the worker thread, each worker thread completes the actual work task, and the typical work task is to read and write the database through the database connection pool.

The lowest layer is the database connection pool, and all SQL statements are sent to the database for execution through the database connection pool.

The typical workflow of a worker thread is as follows:

Void work_thread_routine () {

Task t = TaskQueue.pop (); / / get the task

/ / logical processing of tasks to generate sql statements

DBConnection c = CPool.GetDBConnection (); / / get a DB connection from the DB connection pool

C.execSQL (sql); / / execute sql statements over a DB connection

CPool.PutDBConnection (c); / / put the DB connection back into the DB connection pool

}

Question: the task queue has actually done the task serialization, can you ensure that the task will not be executed concurrently?

A: no, because

(1) A service has multiple worker threads, and tasks that pop up serially will be executed in parallel.

(2) A service has multiple database connections. Each worker thread acquires different database connections and executes concurrently at the DB level.

Question: assuming that only one service is deployed, can you ensure that the tasks will not be executed concurrently?

A: no, the reason is the same as above

Question: assuming that there is only one database connection for a service, can you ensure that the task will not be executed concurrently?

A: no, because

(1) A service has only one database connection, which can only guarantee that requests on one server are executed serially at the database level.

(2) because services are distributed, requests on multiple services may still be executed concurrently at the database level.

Question: assuming that only one service is deployed and there is only one connection for one service, can you ensure that the tasks will not be executed concurrently?

A: yes, globally, the request is executed serially, the throughput is very low, and the service cannot guarantee availability.

It's over. It seems hopeless.

1) Task queue cannot guarantee serialization

2) serialization is not guaranteed for single-service multi-database connections.

3) the serialization of multi-service single database connection is not guaranteed.

4) single service single database connection may guarantee serialization, but the throughput level is low, and the availability of the service can not be guaranteed, which is almost not feasible, so is there a solution?

To take a step back, there is no need to serialize global requests, but only to "serialize access to the same data".

In a service, how to "serialize access to the same data" only needs to "allow access to the same data to be performed through the same DB connection".

How to "allow the access of the same data to be performed through the same DB connection" only needs to be "slightly modified at the DB connection pool level, and the connection can be obtained by data"

Get the CPool.GetDBConnection () of the DB connection [return any available DB connection] to

CPool.GetDBConnection (longid) [returns the DB connection associated with id fetch]

The advantages of this amendment are:

(1) simple, you only need to modify the DB connection pool implementation and the DB connection acquisition.

(2) the modification of connection pool does not need to pay attention to business. What is the meaning of the passed id? connection pool is not concerned. You can simply return the DB connection according to id mode.

(3) it can be applied to a variety of business scenarios. The user data service is passed into user-id for connection, and the order data service is passed into order-id for connection.

In this way, we can ensure that the execution of the same data, such as uid, at the database level must be serial.

Wait a minute, many copies of the service are deployed. The above solution can only guarantee the access of the same data on one service. The execution at the DB level is serialized. In fact, the service is distributed and the global access is still parallel. How to solve this problem? Is it possible that the access to the same data must fall to the same service?

Is it possible to access the same data on the same service?

Above, we analyze the upstream and downstream and internal structure of the service layer service, and then take a look at the upstream and downstream and internal structure of the application layer.

The figure above shows the upstream and downstream of a business application and the internal deployment of the service. The details are as follows:

(1) the upstream uncertainty of a business application may be a direct http request or an upstream call to a service.

(2) downstream of business applications are multiple service service

(3) the middle is the business application, which is divided into several parts.

(3.1) at the top is the task queue [maybe web-server, such as tomcat, did it for you]

(3.2) in the middle is the worker thread (maybe the web-server worker thread or the cgi worker thread does this for you). Each worker thread completes the actual business task, and the typical work task is to make RPC calls through the service connection pool.

The lowest layer is the service connection pool, and all RPC calls are sent to downstream services through the service connection pool.

The typical workflow of a worker thread is as follows:

Voidwork_thread_routine () {

Task t = TaskQueue.pop (); / / get the task

/ / Task logic processing, which forms a network packet packet, and calls the downstream RPC API

ServiceConnection c = CPool.GetServiceConnection (); / / get a Service connection from the Service connection pool

C.Send (packet); / / send a message over the Service connection to execute the RPC request

CPool.PutServiceConnection (c); / / put the Service connection back into the Service connection pool

}

Deja vu? Yes, just make a few changes to the service connection pool:

Get the CPool.GetServiceConnection () of the Service connection [return any available Service connection] to

CPool.GetServiceConnection (longid) [returns the Service connection associated with id fetch]

In this way, requests for the same data, such as uid, can be guaranteed to fall on the same service Service.

Due to the concurrency of read and write at the database level, the problem of inconsistency between the database and the cached data (in essence, the later read request is returned first) may be solved by two small changes:

(1) modify the service Service connection pool, and select the service connection by using id to ensure that the reading and writing of the same data fall on the same back-end service.

(2) modify the database DB connection pool and select the DB connection in id mode, which can ensure that the read and write of the same data is serial at the database level.

This is the end of this article on "how to solve the problem of consistency between redis cache and database in the practical application of the project". I hope the above content can be of some help to you, so that you can learn more knowledge. if you think the article is good, please share it out for more people to see.

Welcome to subscribe "Shulou Technology Information " to get latest news, interesting things and hot topics in the IT industry, and controls the hottest and latest Internet news, technology news and IT industry trends.

Views: 0

*The comments in the above article only represent the author's personal views and do not represent the views and positions of this website. If you have more insights, please feel free to contribute and share.

Share To

Development

Wechat

© 2024 shulou.com SLNews company. All rights reserved.

12
Report