[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <213343dc-3911-45de-8195-469da9dd1a91@wiesinger.com>
Date: Tue, 21 Jan 2025 19:47:24 +0100
From: Gerhard Wiesinger <lists@...singer.com>
To: "Theodore Ts'o" <tytso@....edu>
Cc: linux-ext4@...r.kernel.org
Subject: Re: Transparent compression with ext4 - especially with zstd
On 21.01.2025 05:01, Theodore Ts'o wrote:
> On Sun, Jan 19, 2025 at 03:37:27PM +0100, Gerhard Wiesinger wrote:
>> Are there any plans to include transparent compression with ext4 (especially
>> with zstd)?
> I'm not aware of anyone in the ext4 deveopment commuity working on
> something like this. Fully transparent compression is challenging,
> since supporting random writes into a compressed file is tricky.
> There are solutions (for example, the Stac patent which resulted in
> Microsoft to pay $120 million dollars), but even ignoring the
> intellectual property issues, they tend to compromise the efficiency
> of the compression.
>
> More to the point, given how cheap byte storage tends to be (dollars
> per IOPS tend to be far more of a constraint than dollars per GB),
> it's unclear what the business case would be for any company to fund
> development work in this area, when the cost of a slightly large HDD
> or SSD is going to be far cheaper than the necessary software
> engineering investrment needed, even for a hyperscaler cloud company
> (and even there, it's unclear that transparent compression is really
> needed).
>
> What is the business and/or technical problem which you are trying to
> solve?
>
Regarding necessity:
We are talking in some scenarios about some factors of diskspace. E.g.
in my database scenario with PostgreSQL around 85% of disk space can be
saved (e.g. around factor 7).
In cloud usage scenarios you can easily reduce that amount of allocated
diskspace by around a factor 7 and reduce cost therefore.
You might also get a performance boost by using caching mechanism more
efficient (e.g. using less RAM).
Also with precompressed files (e.g. photo, videos) you can safe around
5-10% overall disk space which sounds less but in the area of several
hundred Gigabytes or even some Petabytes this is a lot of storage.
On evenly distributed data store you can save even more.
The technical topic is that IMHO no stable and practical usable Linux
filesystem which is included in the default kernel exists.
- ZFS works but is not included in the default kernel
- BTRFS has stability and repair issues (see mailing lists) and bugs
with compression (does not compress on the fly in some scenarios)
- bcachefs is experimental
Regarding patents: IMHO at least the STAC patents are all no longer
valid anymore.
Thnx.
Ciao,
Gerhard
Powered by blists - more mailing lists