Streams are variables used to interact with large amounts of data without needing to keep it all loaded in RAM.
You can think of a ReadStream
as like a function that returns a string/Buffer, but only a little at a time, and a WriteStream
as like a function that takes a string/Buffer as input, but only a little at a time. An ObjectReadStream
is like a function that takes an array as input, but only a little at a time, and an ObjectWriteStream
is like a function that returns an array, but only a little at a time.
Node.js comes with built-in support for streams, and there is also a WHATWG Streams spec (which are incompatible, of course). Both APIs are hard to use and have unnecessary amounts of boilerplate; the Node version more so. This API can wrap Node’s API, or it can be used independently, and is a lot easier to use.
An overview:
WriteStream
is a string/Buffer write stream. Write to it with writeStream.write(data)
.ReadStream
is a string/Buffer read stream. Read inputs by line with readStream.readLine()
or by chunk with readStream.read()
, or pipe inputs to a WriteStream
with readStream.pipe(writeStream)
.ReadWriteStream
is a string/Buffer read/write stream.
ObjectWriteStream
is a write stream for arbitrary objects. Write to it with writeStream.write(data)
.ObjectReadStream
is a read stream for arbitrary objects. Read inputs by line with readStream.readLine()
or by chunk with readStream.read()
, or pipe inputs to a WriteStream
with readStream.pipe(writeStream)
.ObjectReadWriteStream
is a read/write stream for arbitrary objects.These streams are not API-compatible with Node streams, but can wrap them.
Buffer stream methods often take an “override encoding” parameter. Normally, you should never use it: streams will automatically convert between strings and Buffers using their built-in encoding, which defaults to utf8
, and can be changed by setting stream.encoding
.
However, if for some reason you need to change which encoding you use on a per-read or per-write basis, you can pass the relevant encoding to the individual methods.
A WriteStream
can be written to. You can think of it like a function taking a string/Buffer as an argument, except you can give it the string/Buffer one chunk at a time, instead of all at once.
So you can think of these as being the same thing:
// option 1: do it normally
await FS('file.txt').write("Here are some words.\n");
// option 2: do it as a stream
const stream = FS('file.txt').createWriteStream();
await stream.write("Here a");
await stream.write("re some ");
await stream.write("words.\n"); // OR: await stream.writeLine("words.");
await stream.writeEnd();
The stream version lets you do it a bit at a time instead of all at once, so you use less memory.
chunk {string |
Buffer | null} data to write |
encoding
override encodingWrites to the stream. writeStream.write(null)
is equivalent to writeStream.writeEnd()
.
chunk
{string} dataencoding
override encodingWrites a line to the stream. Equivalent to writeStream.write(chunk + '\n')
.
Ends the write stream.
This tells the write stream that you’re done writing to it. In the Buffer/string analogy, it means you’ve reached the end of the Buffer/string. Certain write streams require this.
A ReadStream
can be read from. You can think of it like a function that returns a string/Buffer, except you can read it out one chunk at a time, instead of all at once.
So you can think of these as being the same thing:
// option 1: do it normally
const contents1 = await FS('file.txt').read();
console.log(contents1);
// option 3: do it as a stream
const stream = FS('file.txt').createReadStream();
let contents2 = '';
let chunk;
while ((chunk = await stream.read()) !== null) {
contents2 += chunk;
}
console.log(contents2);
// option 2: do it as a stream, by line (this can add an ending \n where there wasn't one before)
const stream = FS('file.txt').createReadStream();
let contents3 = '';
for await (const line of stream.byLine()) {
contents3 += line + '\n';
}
console.log(contents3);
The stream version lets you do it a bit at a time instead of all at once, so you use less memory (this example doesn’t use less memory, since we’re just building the full string ourselves, but you can imagine doing something else that doesn’t just build the string back).
encoding
override encodingReturns: {Promise<string | null>} the data read. |
Reads data from the read stream as fast as possible.
Specifically, this waits for the internal buffer to become non-empty, and then reads the entire contents of the internal buffer.
The Promise resolves to null
if there is no more data to be read (you have already read to the end).
There’s rarely a need to use this function directly; you either know how many bytes you’re looking for (and should be using read(byteCount)
), or you’re looking for a delimiter (and should be using readLine()
or readDelimitedBy(delimiter)
), or you just want the entire stream contents (and should be using readAll()
).
byteCount
number of bytes to readencoding
override encodingReturns: {Promise<string | null>} the data read. |
Reads byteCount
bytes from the read stream.
As readStream.read([encoding])
above, but waits for at least byteCount
bytes to become available, and then reads and returns exactly byteCount
bytes. Only if you are at the end of the read stream will fewer than byteCount
bytes be returned.
This does mean that if the stream has new data, but less than byteCount
bytes of data, the Promise will not resolve unless the stream has ended.
You may also set byteCount
to null
to make this behave like readStream.read([encoding])
above.
encoding
override encodingReturns: {Promise<string | null>} the data read. |
Reads a line (a string delimited by \n
or \r\n
) from the stream.
The equivalent of readDelimitedBy('\n')
, but chopping off any trailing \r
from the result.
encoding
override encodingReturns: {Promise<string | null>} the data read. |
Reads a line delimited by delimiter
from the stream.
Specifically: waits for either delimiter
to appear in the stream, or for the stream to end.
If the stream has reached the end with no (more) appearances of delimiter
, read everything.
Otherwise, read up to the delimiter, then discard the delimiter.
readDelimitedBy
does not distinguish between a delimiter and a terminator. If the stream ends in delimiter
, it will be treated as a terminator, otherwise it will be treated as a delimiter.
const stream = new ReadStream('a,,b,c,');
await stream.readDelimitedBy(','); // Promise<'a'>
// stream is now ',b,c'
await stream.readDelimitedBy(','); // Promise<''>
// stream is now 'b,c'
await stream.readDelimitedBy(','); // Promise<'b'>
// stream is now 'c'
await stream.readDelimitedBy(','); // Promise<'c'>
// stream is now ''
await stream.readDelimitedBy(','); // Promise<null> - NOT Promise<''>
Like readStream.read
, but doesn’t remove the read data from the read queue.
Can return synchronously. Use await
or wrap the return value in Promise.resolve()
if you need a Promise.
Like readStream.read
and readStream.peek
, but returns a Buffer instead of a string.
There are many ways of creating a ReadStream.
You can convert a string or Buffer directly into a ReadStream. These all do the same thing:
new ReadStream('abc')
new ReadStream(new Buffer('abc'))
new ReadStream({buffer: 'abc'})
new ReadStream({buffer: new Buffer('abc')})
You can convert a Node.js ReadableStream into a ReadStream. These all do the same thing:
new ReadStream(process.stdin)
new ReadStream({nodeStream: process.stdin})
Streams.stdin()
You can set up your own read
functions. These all do the same thing:
new ReadStream('abc');
class MyReadStream extends ReadStream {
values = ['a', 'bc'];
_read() {
const next = values.shift();
if (next) {
this.push(next);
} else {
this.pushEnd();
}
}
}
new MyReadStream()
const values = ['a', 'bc'];
new ReadStream({
read() {
const next = values.shift();
if (next) {
this.push(next);
} else {
this.pushEnd();
}
}
})
In general, your read
function should call either push
or pushEnd
at least once. If it plans to call push
more than once with a delay in between, it should return a Promise
that resolves after all push
es are called. Call pushEnd
to end the stream, and remember that calling push
after that will be treated as a bug and throw an error.
Remember there’s no pushLine
function - you’ll need to manually call push(line + '\n')
.