Analyze the Stream module of Nodejs 07/19 Update SLTechnology News&Howtos

Analyze the Stream module of Nodejs

2025-07-19 Update From: SLTechnology News&Howtos shulou NAV: SLTechnology News&Howtos > Development >

Shulou(Shulou.com)06/02 Report--

This article mainly explains "analyzing the Stream module of Nodejs". The content of the explanation is simple and clear, and it is easy to learn and understand. Please follow the editor's train of thought to study and learn "analyze the Stream module of Nodejs".

First, the opening analysis

A stream is an abstract interface implemented by many objects in Node. For example, a request to a HTTP server is a stream, and stdout is also a stream. Streams are readable, writable, or both.

The first contact with Stream is from the early unix, decades of practice has proved that the idea of Stream can easily develop some huge systems.

In unix, Stream is implemented through "|". In node, as a built-in stream module, many core modules and tripartite modules are used.

Like unix, the main operation of node stream is .pipe (), and users can use the reverse pressure mechanism to control the balance between read and write.

Stream can provide developers with a unified interface that can be reused, and an abstract Stream interface is used to control the read-write balance between Stream.

A TCP connection is both a readable stream and a writable stream, whereas a Http connection is different. A http request object is a readable stream, while a http response object is a writable stream.

The transmission process of the stream is transmitted in the form of buffer by default, unless you set another encoding form for it. Here is an example:

two。

Var http = require ('http')

Var server = http.createServer (function (req,res) {

Res.writeHeader (200,{ 'Content-Type':' text/plain'})

Res.end ("Hello, big bear!")

})

Server.listen (8888)

Console.log ("http server running on port 8888...")

After running, there will be garbled code because the specified character set is not set, such as "utf-8".

Just modify it:

Var http = require ('http')

two。

Var server = http.createServer (function (req,res) {

Res.writeHeader (20000, {

'Content-Type':' text/plain;charset=utf-8' / / add charset=utf-8

})

Res.end ("Hello, big bear!")

})

Server.listen (8888)

Console.log ("http server running on port 8888...")

Running result:

Why use Stream

The iCandle O in node is asynchronous, so reading and writing to disk and network requires a callback function to read data. Here is an example of a file download.

The above code:

two。

Var http = require ('http')

Var fs = require ('fs')

Var server = http.createServer (function (req, res) {

Fs.readFile (_ _ dirname +'/ data.txt', function (err, data) {

Res.end (data)

})

Server.listen (8888)

The code can achieve the desired function, but the service needs to cache the entire file data to memory before sending the file data, if the "data.txt" file is very

If the concurrency is large, a lot of memory will be wasted. Because the user needs to wait until the entire file is cached in memory to accept the file data, which results in

The user experience is quite bad. Fortunately, (req,res) both arguments are Stream, so we can use fs.createReadStream () instead of fs.readFile (). As follows:

Var http = require ('http')

two。

Var fs = require ('fs')

Var server = http.createServer (function (req, res) {

Var stream = fs.createReadStream (_ _ dirname +'/ data.txt')

Stream.pipe (res)

})

Server.listen (8888)

The .pipe () method listens for the 'data' and' end' events of fs.createReadStream (), so that the "data.txt" file does not need to be cached

You can send a data block to the client as soon as the client connection is completed. Another benefit of using .pipe () is that it can solve when the customer

Read-write imbalance caused by very large end delay.

There are five basic Stream:readable,writable,transform,duplex,and "classic". (for specific use, please refer to api)

Second, the introduction of examples

We need to use data streams when we cannot hold data that needs to be processed at once in memory, or when it is more efficient to read and process at the same time. The operation of data stream is provided by various Stream in NodeJS.

Taking the large file copy program as an example, we can create a read-only data stream for the data source, as shown below:

Var rs = fs.createReadStream (pathname)

two。

Rs.on ('data', function (chunk) {

DoSomething (chunk); / / play with the details at will

})

Rs.on ('end', function () {

CleanUp ()

})

Data events continue to be triggered in the code, regardless of whether the doSomething function can handle it or not. The code can continue to make the following modifications to solve this problem.

two。

Var rs = fs.createReadStream (src)

Rs.on ('data', function (chunk) {

Rs.pause ()

DoSomething (chunk, function () {

Rs.resume ()

})

10.

Rs.on ('end', function () {

11.

CleanUp ()

twelve。

})

A callback is added to the doSomething function, so we can pause the data reading before processing the data and continue to read the data after processing the data.

In addition, we can also create a write-only data flow for the data target, as follows:

Var rs = fs.createReadStream (src)

two。

Var ws = fs.createWriteStream (dst)

Rs.on ('data', function (chunk) {

Ws.write (chunk)

})

Rs.on ('end', function () {

Ws.end ()

})

When doSomething writes data to a write-only data stream instead, the above code looks like a file copy program. However, the above code has the problem mentioned above, if the write speed can not keep up with the read speed, the cache within the write-only data stream will burst. We can determine whether the incoming data is written to the target or temporarily placed in the cache according to the return value of the .write method, and determine when the write-only data stream has written the data in the cache to the target and can pass in the next data to be written according to the drain event. So the code is as follows:

Var rs = fs.createReadStream (src)

two。

Var ws = fs.createWriteStream (dst)

Rs.on ('data', function (chunk) {

If (ws.write (chunk) = false) {

Rs.pause ()

}

})

Rs.on ('end', function () {

Ws.end ()

10.

})

11.

Ws.on ('drain', function () {

twelve。

Rs.resume ()

13.

})

Finally, it realizes the transportation of data from read-only data flow to write-only data flow, and includes explosion-proof warehouse control. Because there are many such usage scenarios, such as the large file copy program above, NodeJS directly provides a .pipe method to do this, and its internal implementation is similar to the code above.

Here is a more complete process of copying files:

Var fs = require ('fs')

two。

Path = require ('path')

Out = process.stdout

Var filePath ='/ bb/bigbear.mkv'

Var readStream = fs.createReadStream (filePath)

Var writeStream = fs.createWriteStream ('file.mkv')

Var stat = fs.statSync (filePath)

Var totalSize = stat.size

Var passedLength = 0

10.

Var lastSize = 0

11.

Var startTime = Date.now ()

twelve。

ReadStream.on ('data', function (chunk) {

13.

PassedLength + = chunk.length

14.

If (writeStream.write (chunk) = false) {

15.

ReadStream.pause ()

16.

}

17.

})

18.

ReadStream.on ('end', function () {

19.

WriteStream.end ()

20.

})

21.

WriteStream.on ('drain', function () {

twenty-two。

ReadStream.resume ()

23.

})

24.

SetTimeout (function show () {

25.

Var percent = Math.ceil ((passedLength / totalSize) * 100)

twenty-six。

Var size = Math.ceil (passedLength / 1000000)

twenty-seven。

Var diff = size-lastSize

twenty-eight。

LastSize = size

twenty-nine。

Out.clearLine ()

thirty。

Out.cursorTo (0)

thirty-one。

Out.write ('completed' + size +'MB,'+ percent +'%, Speed:'+ diff * 2 +

thirty-two。 'MB/s')

thirty-three。

If (passedLength < totalSize) {

thirty-four。

SetTimeout (show, 500)

thirty-five。

} else {

thirty-six。

Var endTime = Date.now ()

thirty-seven。

Console.log ()

thirty-eight。

Console.log ('shared time:' + (endTime-startTime) / 1000 + 'seconds.')

thirty-nine。

}

forty。

}, 500)

You can save the above code as "copy.js" and try to add a recursive setTimeout (or just use setInterval) to be a bystander.

Each 500ms observes the progress of completion, writes the size, percentage, and replication speed of the completed to the console, and calculates the total elapsed time when the replication is complete.

Thank you for your reading, the above is the content of "analyzing the Stream module of Nodejs". After the study of this article, I believe you have a deeper understanding of the problem of analyzing the Stream module of Nodejs, and the specific use needs to be verified in practice. Here is, the editor will push for you more related knowledge points of the article, welcome to follow!

Welcome to subscribe "Shulou Technology Information " to get latest news, interesting things and hot topics in the IT industry, and controls the hottest and latest Internet news, technology news and IT industry trends.

*The comments in the above article only represent the author's personal views and do not represent the views and positions of this website. If you have more insights, please feel free to contribute and share.