In addition to Weibo, there is also WeChat
Please pay attention
WeChat public account
Shulou
2025-01-19 Update From: SLTechnology News&Howtos shulou NAV: SLTechnology News&Howtos > Database >
Share
Shulou(Shulou.com)05/31 Report--
This article shows you how to optimize the performance of batch SQL insertion in MySQL, the content is concise and easy to understand, it will definitely brighten your eyes. I hope you can get something through the detailed introduction of this article.
1. Insert multiple pieces of data with one SQL statement
Commonly used insert statements such as:
INSERT INTO `insert_ table` (`datetime`, `uid`, `content`, `type`) VALUES ('0times,' userid_0', 'content_0', 0); INSERT INTO `insert_ table` (`datetime`, `uid`, `content`, `type`) VALUES (' 1percent, 'userid_1',' Table1)
Modified to:
INSERT INTO `datetime`, `uid`, `content`, `type`) VALUES ('0times,' userid_0', 'content_0', 0), (' 1times, 'userid_1',' content_1', 1)
The modified insertion operation can improve the insertion efficiency of the program. The main reason for the high efficiency of the second kind of SQL is that the log volume (MySQL's binlog and innodb's transactions let the log) be reduced after the merger, and the data amount and frequency of log flushing are reduced, thus improving the efficiency. By merging SQL statements, it can also reduce the number of parsing of SQL statements and reduce the IO of network transmission.
Some test comparison data are provided here, which are to import a single piece of data and convert it into a SQL statement for import, testing 100,1000 and 10,000 data records respectively.
2. Insert in the transaction
Modify the insert to:
START TRANSACTION; INSERT INTO `insert_ table` (`datetime`, `uid`, `content`, `type`) VALUES ('0times,' userid_0', 'content_0', 0); INSERT INTO `insert_ table` (`datetime`, `uid`, `content`, `type`) VALUES (' 1times, 'userid_1',' Table1); COMMIT
Using transactions can improve the efficiency of data insertion, because when you perform an INSERT operation, a transaction is established within the MySQL, and the actual insert processing is performed within the transaction. By using transactions, you can reduce the consumption of creating transactions, and all inserts are committed after execution.
A test comparison is also provided here, where the number of records is 100, 1000, and 10, 000, respectively, when no transactions are used and transactions are used.
3. Orderly data insertion
Orderly insertion of data means that the inserted records are arranged in order on the primary key, for example, datetime is the primary key of the record:
INSERT INTO `insert_ table` (`datetime`, `uid`, `content`, `type`) VALUES ('1times,' userid_1', 'content_1', 1); INSERT INTO `insert_ table` (`datetime`, `uid`, `content`, `type`) VALUES (' 0dates, 'userid_0',' content_0', 0); INSERT INTO `insert_ table` (`datetime`, `uid`, `content`, `type`) VALUES ('2dates,' type`)
Modified to:
INSERT INTO `insert_ table` (`datetime`, `uid`, `content`, `type`) VALUES ('0dates,' userid_0', 'content_0', 0); INSERT INTO `insert_ table` (`datetime`, `uid`, `content`, `type`) VALUES (' 1dates, 'userid_1',' content_1', 1); INSERT INTO `insert_ table` (`datetime`, `uid`, `content`, `type`) VALUES ('2dates,' type`)
Because the index data needs to be maintained when the database is inserted, unordered records will increase the cost of maintaining the index. We can refer to the B+tree index used by InnoDB. If each inserted record is at the back of the index, the positioning efficiency of the index is very high, and the adjustment to the index is small; if the inserted record is in the middle of the index, B+tree is required to split and merge, which will consume more computing resources, and the index positioning efficiency of inserting records will decline, and there will be frequent disk operations when the amount of data is large.
The performance comparison between random data and sequential data is provided below, which is recorded as 100,1000,10,000, 100000, and 1 million, respectively.
From the test results, the performance of the optimization method has been improved, but the improvement is not very obvious.
4. Comprehensive performance test
Here is a test of INSERT efficiency optimization using the above three methods at the same time.
From the test results, we can see that the performance improvement of the method of merging data + transactions is obvious when the amount of data is small, and the performance will decline sharply when the amount of data is large (more than 10 million). This is because the amount of data exceeds the capacity of innodb_buffer, and each positioning index involves more disk read and write operations, resulting in a rapid decline in performance. However, the performance of merging data + transaction + ordered data is still good when the amount of data reaches tens of millions of levels. when the amount of data is large, the location of ordered data index is more convenient, and there is no need to read and write to the disk frequently. so you can maintain high performance.
Note:
The SQL statement has a length limit, which must not exceed the SQL length limit in the same SQL for data merging. It can be modified through max_allowed_packet configuration. The default is 1m, and it is changed to 8m during testing.
Transactions need to be controlled in size, and too large transactions may affect the efficiency of execution. MySQL has an innodb_log_buffer_size configuration item. If you exceed this value, the data of innodb will be brushed to disk, and the efficiency will be reduced. So it is better to commit the transaction before the data reaches this value.
The above is how to optimize the performance of bulk SQL insertion in MySQL. Have you learned any knowledge or skills? If you want to learn more skills or enrich your knowledge reserve, you are welcome to follow the industry information channel.
Welcome to subscribe "Shulou Technology Information " to get latest news, interesting things and hot topics in the IT industry, and controls the hottest and latest Internet news, technology news and IT industry trends.
Views: 0
*The comments in the above article only represent the author's personal views and do not represent the views and positions of this website. If you have more insights, please feel free to contribute and share.
Continue with the installation of the previous hadoop.First, install zookooper1. Decompress zookoope
"Every 5-10 years, there's a rare product, a really special, very unusual product that's the most un
© 2024 shulou.com SLNews company. All rights reserved.