Discussion on MYSQL MRR NLJ BNL BKA from the principle of Sequential Random Imax O 07/19 Update SLTechnology News&Howtos

Discussion on MYSQL MRR NLJ BNL BKA from the principle of Sequential Random Imax O

2025-07-19 Update From: SLTechnology News&Howtos shulou NAV: SLTechnology News&Howtos > Database >

Shulou(Shulou.com)06/01 Report--

Discussion on MYSQL MRR NLJ BNL BKA from the principle of Sequential Random Imax O

This article only discusses the innodb storage engine, and some of the views are from the author's point of view, please point out if there is any mistake.

I. the principle of mechanical disk

The mechanical disk is composed of a moving arm, a disk, a read-write head and a spindle. The head is fixed and cannot be moved, and the corresponding sector can only be read through the disk.

Spin.

Each disk is double-sided, each side is distributed with concentric circles of the track, the track is divided into sectors generally 512 BYTES, modern disk

Generally speaking, the outer edge track has more sectors than the inner track, so the speed of reading and writing the outer edge track is faster because the rotational speed is fixed.

At the same time, the tracks with the same radius on different disks form a cylinder.

The following figure shows a typical disk organization (extracted data structure (C language version))

If we count ts (seek time) as the seek time, and tl (latency time) as the time to wait for the disk to rotate to a specific sector after the seek is completed.

Tw (transmission time) is the transfer time, so the time to read a sector is:

T (Imax 0) = ts+tl+tw

Obviously, when reading data is certain, the time of ts and tl becomes the decisive factor, but in fact, ts seek time takes longer than others.

The seek time is in the order of 10 milliseconds, and the tl time of 7200 rpm is 1000Universe 7200. it is about the order of 100 microseconds, and the transmission time is even shorter.

A large number of random Imax O will cause frequent track changes, resulting in too long a time, and it is likely that after reading a few sectors, you will soon jump to another track.

On the other hand, the sequential I _ map O can read more sectors at a time, thus minimizing the reading time.

2. Simulation of random and sequential Icantho.

The simulation is completed by calling LINUX API in C language, mainly in the following ways:

Reading a large file program is limited to 900m, while the program sequence and random reading of 20000 4096-size data, and CPY to other files

The file of cpy is 81920000 bytes

In order to reduce the impact of write operations, and magnify the impact of read operations, use the

O_CREAT | O_WRONLY | O_EXCL opens the write file. Enable OS BUFFER,write operation to write to OS kernel buffer, but cannot be opened at the same time

O_SYNC, start O_SYNC every time wirte calls fsync (), the impact of writing will be magnified.

O_RDONLY | O_DIRECT opens the read file and opens it with O_DIRECT. The purpose of opening it with O_DIRECT is to disable OS CACHE and, of course, disable pre-reading of OS to read files directly.

A picture taken from this aspect is easy to understand. In fact, I read this file after O_DIRECT, but the kernel caches it.

Of course, this program is a little complementary, I should use the sorting algorithm to sort the data in the random array and then read it, instead of taking a continuous array.

This is more telling, but it doesn't matter because random reading is ridiculously slow. The following is the result of my program.

. / a.out p10404530_112030_Linux-x86-64_1of7.zip

Fisrt sca array: 134709

Fisrt sca array: 198155

Fisrt sca array: 25305

Fisrt sca array: 46515

Fisrt sca array: 91550

Fisrt sca array: 137262

Fisrt sca array: 46134

Fisrt sca array: 10208

Fisrt sca array: 142115

Sequential cpy begin Time: Fri Dec 201: 36:55 2016

Begin cpy use sequential read buffer is 4k:

Per 25%, Time:Fri Dec 201: 36:56 2016

Per 50%, Time:Fri Dec 201: 36:57 2016

Per 75%, Time:Fri Dec 201: 36:57 2016

Per 100%, Time:Fri Dec 201: 36:58 2016

Scattered cpy begin Time: Fri Dec 201: 36:58 2016

Begin cpy use scattered read read buffer is 4k:

Per 25%, Time:Fri Dec 201: 37:51 2016

Per 50%, Time:Fri Dec 201: 38:40 2016

Per 75%, Time:Fri Dec 201: 39:29 2016

Per 100%, Time:Fri Dec 201: 40:20 2016

First output the random values in the partial array, and you can see that the reading position is random. To simulate random reading.

Then the output sequence reads and writes, and then random reads and writes are performed. You can see that there is a great difference in the use of

Iostat vmstat can see that the read speed is very slow. A comparison is given below:

-- order