PQ: Add support for event-level compression using ZStandard (ZSTD) #18121

yaauie · 2025-09-04T22:43:02Z

Release notes

Adds support for event compression in the persisted queue, controlled by the per-pipeline queue.compression setting, which defaults to none.

What does this PR do?

Adds non-breaking support for event compression to the persisted queue, as
configured by a new per-pipeline setting queue.compression, which supports:

none (default): no compression is performed, but if compressed events are encountered in the queue they will be decompressed
speed: compression optimized for speed (minimal overhead, but less compression)
balanced: compression balancing speed against result size
size: compression optimized for maximum reduction of size (minimal size, but more resource-intensive)
disabled: compression support entirely disabled; if a pipeline is run in this configuration against a PQ that already contains unacked compressed events, the pipeline WILL crash.

This PR does necessary refactors as no-op stand-alone commits to make reviewing more straight-forward. It is best reviewed in commit order.

Why is it important/What is the impact to the user?

Disk IO is often a performance bottleneck when using the PQ. This feature allows users to spend available resources to reduce the size of events on disk, and therefore also the Disk IO.

Checklist

My code follows the style guidelines of this project
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
I have made corresponding change to the default configuration files (and/or docker env variables)
I have added tests that prove my fix is effective or that my feature works

Author's Checklist

[ ]

How to test this PR locally

Add a ndjson file named example-input.ndjson with event contents

Run Logstash with trace-logging enabled, using -S to set queue.type=persisted, queue.drain=true, and queue.compression=size:

bin/logstash --log.level=trace \
-Squeue.type=persisted \
-Squeue.drain=true \
-Squeue.compression=size \
--config.string 'input { stdin { codec => json_lines } } output { sink {} }' < example-input.ndjson

Observe trace logs showing compression and decompression:

[2025-09-04T22:36:07,055][TRACE][org.logstash.ackedqueue.Queue][main][454a7f73ae57dfec89e89329c9e0eba182f7780fd885c7bf2f17d8afba2bba67] serialized: 9000->3645
[2025-09-04T22:36:07,056][TRACE][org.logstash.ackedqueue.Queue][main][454a7f73ae57dfec89e89329c9e0eba182f7780fd885c7bf2f17d8afba2bba67] serialized: 9448->3838
[2025-09-04T22:36:07,057][TRACE][org.logstash.ackedqueue.Queue][main][454a7f73ae57dfec89e89329c9e0eba182f7780fd885c7bf2f17d8afba2bba67] serialized: 7160->2666
[2025-09-04T22:36:07,059][TRACE][org.logstash.ackedqueue.Queue][main][454a7f73ae57dfec89e89329c9e0eba182f7780fd885c7bf2f17d8afba2bba67] serialized: 8642->3572
[2025-09-04T22:36:07,060][TRACE][org.logstash.ackedqueue.Queue][main][454a7f73ae57dfec89e89329c9e0eba182f7780fd885c7bf2f17d8afba2bba67] serialized: 8714->3739
[2025-09-04T22:36:07,061][TRACE][org.logstash.ackedqueue.Queue][main][454a7f73ae57dfec89e89329c9e0eba182f7780fd885c7bf2f17d8afba2bba67] serialized: 8048->3500
[2025-09-04T22:36:07,063][TRACE][org.logstash.ackedqueue.Queue][main][454a7f73ae57dfec89e89329c9e0eba182f7780fd885c7bf2f17d8afba2bba67] serialized: 10007->3871

...

[2025-09-04T22:36:09,428][TRACE][org.logstash.ackedqueue.Queue][main] deserialized: 3594->8060
[2025-09-04T22:36:09,428][TRACE][org.logstash.ackedqueue.Queue][main] deserialized: 3571->8622
[2025-09-04T22:36:09,428][TRACE][org.logstash.ackedqueue.Queue][main] deserialized: 3673->9051
[2025-09-04T22:36:09,428][TRACE][org.logstash.ackedqueue.Queue][main] deserialized: 3538->8343
[2025-09-04T22:36:09,428][TRACE][org.logstash.ackedqueue.Queue][main] deserialized: 3729->8943
[2025-09-04T22:36:09,428][TRACE][org.logstash.ackedqueue.Queue][main] deserialized: 2683->7460
[2025-09-04T22:36:09,428][TRACE][org.logstash.ackedqueue.Queue][main] deserialized: 3928->10059
[2025-09-04T22:36:09,428][TRACE][org.logstash.ackedqueue.Queue][main] deserialized: 3563->8329
[2025-09-04T22:36:09,428][TRACE][org.logstash.ackedqueue.Queue][main] deserialized: 2708->7285

Inspect the page(s) left behind with lsq-pagedump:

2099    3851    0EB311A1        page.0  ZSTD(9552)
2100    3961    D497496F        page.0  ZSTD(10416)
2101    2667    59F1903D        page.0  ZSTD(6978)
2102    3677    B442D62D        page.0  ZSTD(9006)
2103    3748    3EEF1737        page.0  ZSTD(8791)
2104    2746    DE697FE9        page.0  ZSTD(7903)

Related issues

Superseeds PQ: Add support for event-level compression using deflate #17959
Implements PQ could benefit from event-level compression #17819

Use cases

Constrained or metered disk IO
Limited Disk capacity

The `ackedqueue.SettingsImpl` uses an _immutable_ builder, which makes adding options cumbersome; each additional property added needs to modify code from all existing options. By introducing an api-internal temporary mutable builder, we can simplify the process of creating an immutable copy that has a single component modified.

Adds non-breaking support for event compression to the persisted queue, as configured by a new per-pipeline setting `queue.compression`, which supports: - `none` (default): no compression is performed, but if compressed events are encountered in the queue they will be decompressed - `speed`: compression optimized for speed - `balanced`: compression balancing speed against result size - `size`: compression optimized for maximum reduction of size - `disabled`: compression support entirely disabled; if a pipeline is run in this configuration against a PQ that already contains unacked compressed events, the pipeline WILL crash. To accomplish this, we then provide an abstract base implementation of the CompressionCodec whose decode method is capable of _detecting_ and decoding zstd-encoded payload while letting other payloads through unmodified. The detection is done with an operation on the first four bytes of the payload, so no additional context is needed. An instance of this zstd-aware compression codec is provided with a pass-through encode operation when configured with `queue.compression: none`, which is the default, ensuring that by default logstash is able to decode any event that had previously been written. We provide an additional implementation that is capable of _encoding_ events with a configurable goal: speed, size, or a balance of the two.

github-actions · 2025-09-04T22:43:12Z

🤖 GitHub comments

Expand to view the GitHub comments

Just comment with:

run docs-build : Re-trigger the docs validation. (use unformatted text in the comment!)

github-actions · 2025-09-04T22:44:00Z

🔍 Preview links for changed docs

jsvd

first pass, minor annotations, going to test this manually now.

docs/reference/logstash-settings-file.md

docs/reference/persistent-queues.md

jsvd · 2025-09-05T09:15:27Z

logstash-core/src/main/java/org/logstash/ackedqueue/SettingsImpl.java

-            settings.getQueueMaxBytes(), settings.getMaxUnread(), settings.getCheckpointMaxAcks(),
-            settings.getCheckpointMaxWrites(), settings.getCheckpointRetry()
-        );
+        return new BuilderImpl(settings);


Just a suggestion, this Builder refactoring could have been a separate PR as it doesn't require the compression settings at all and is still a significant part of the changeset in this PR.

Another reason to introduce this change ASAP: yet another parameter is coming in https://github.com/elastic/logstash/pull/18000/files

pared off as #18180

jsvd · 2025-09-05T09:33:32Z

logstash-core/src/main/java/org/logstash/ackedqueue/Queue.java

+        byte[] serializedBytes = element.serialize();
+        byte[] data = compressionCodec.encode(serializedBytes);
+
+        logger.trace("serialized: {}->{}", serializedBytes.length, data.length);


I'd suggest this to be moved to the zstd aware codec, as we don't want a flood of "serialized X -> X" in the trace logs when the noop is used.

jsvd · 2025-09-05T14:29:11Z

logstash-core/src/main/java/org/logstash/util/CleanerThreadLocal.java

+import java.util.function.Supplier;
+
+/**
+ * A {@link CleanerThreadLocal} is semantically the same as a {@link ThreadLocal}, except that a clean action


Should we get this utility out the PR into a separate one if we still want it? right now it's only used by its own tests.

jsvd · 2025-09-05T14:30:23Z

logstash-core/src/main/java/org/logstash/ackedqueue/AbstractZstdAwareCompressionCodec.java

+import org.logstash.util.CleanerThreadLocal;
+import org.logstash.util.SetOnceReference;


These are not used in the abstract class

Suggested change

import org.logstash.util.CleanerThreadLocal;

import org.logstash.util.SetOnceReference;

jsvd · 2025-09-05T14:31:32Z

logstash-core/src/main/java/org/logstash/ackedqueue/AbstractZstdAwareCompressionCodec.java

+import java.io.ByteArrayOutputStream;
+import java.io.IOException;
+import java.lang.ref.Cleaner;
+import java.util.zip.DataFormatException;
+import java.util.zip.Inflater;


Some more cleanup:

Suggested change

import java.io.ByteArrayOutputStream;

import java.io.IOException;

import java.lang.ref.Cleaner;

import java.util.zip.DataFormatException;

import java.util.zip.Inflater;

jsvd · 2025-09-05T17:51:22Z

Look at profiling, it seems like with Zstd.compress/decompress the instance spends about nearly 9% of the time doing context initializations:

profile: profile.html

Something worth investigating is the use of thread locals for the contexts.

Co-authored-by: João Duarte <jsvd@users.noreply.github.com>

elastic-sonarqube · 2025-09-18T18:41:02Z

Quality Gate passed

Issues
0 New issues
0 Fixed issues
0 Accepted issues

Measures
0 Security Hotspots
No data about Coverage
0.0% Duplication on New Code

See analysis details on SonarQube

elasticmachine · 2025-09-18T18:55:38Z

💚 Build Succeeded

Buildkite Build
Commit: f18a7b1

History

💔 Build #3519 failed fc66e66
💔 Build #3440 failed 888e5d4

cc @yaauie

yaauie added 5 commits September 4, 2025 18:37

pq settings: validate while building

a555131

noop: refacor pq-related constructors to take ackedqueue.Settings

488f81c

noop: add pq compression-codec with no-op implementation

2e726f3

yaauie added enhancement persistent queues backport-skip Skip automated backport with mergify labels Sep 4, 2025

github-actions bot deployed to docs-preview September 4, 2025 22:43 View deployment

yaauie mentioned this pull request Sep 4, 2025

PQ: Add support for event-level compression using deflate #17959

Closed

5 tasks

yaauie requested a review from jsvd September 4, 2025 22:45

jsvd reviewed Sep 5, 2025

View reviewed changes

jsvd mentioned this pull request Sep 9, 2025

Measure average batch byte size and event count #18000

Open

5 tasks

yaauie self-assigned this Sep 9, 2025

yaauie mentioned this pull request Sep 17, 2025

PQ settings refactor: propagate builder upward #18180

Merged

2 tasks

yaauie added 2 commits September 17, 2025 17:16

license: add notice for com.github.luben:zstd-jni

9da4a28

pq: log compression encode/decode from the codec

f9b868f

github-actions bot deployed to docs-preview September 17, 2025 17:36 View deployment

Apply docs suggestions from code review

e481de9

Co-authored-by: João Duarte <jsvd@users.noreply.github.com>

github-actions bot deployed to docs-preview September 17, 2025 17:49 View deployment

remove CleanerThreadLocal utility

fc66e66

github-actions bot deployed to docs-preview September 17, 2025 17:52 View deployment

license: add mapping for com.github.luben:zstd-jni

f18a7b1

github-actions bot deployed to docs-preview September 18, 2025 18:25 View deployment

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

PQ: Add support for event-level compression using ZStandard (ZSTD) #18121

PQ: Add support for event-level compression using ZStandard (ZSTD) #18121

Uh oh!

yaauie commented Sep 4, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Sep 4, 2025

Uh oh!

github-actions bot commented Sep 4, 2025 •

edited

Loading

Uh oh!

jsvd left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jsvd Sep 5, 2025

Uh oh!

jsvd Sep 8, 2025

Uh oh!

yaauie Sep 17, 2025

Uh oh!

jsvd Sep 5, 2025

Uh oh!

jsvd Sep 5, 2025

Uh oh!

jsvd Sep 5, 2025

Uh oh!

jsvd Sep 5, 2025

Uh oh!

jsvd commented Sep 5, 2025 •

edited

Loading

Uh oh!

elastic-sonarqube bot commented Sep 18, 2025

Uh oh!

elasticmachine commented Sep 18, 2025

Uh oh!

Uh oh!

		import org.logstash.util.CleanerThreadLocal;
		import org.logstash.util.SetOnceReference;

PQ: Add support for event-level compression using ZStandard (ZSTD) #18121

Are you sure you want to change the base?

PQ: Add support for event-level compression using ZStandard (ZSTD) #18121

Uh oh!

Conversation

yaauie commented Sep 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Release notes

What does this PR do?

Why is it important/What is the impact to the user?

Checklist

Author's Checklist

How to test this PR locally

Related issues

Use cases

Uh oh!

github-actions bot commented Sep 4, 2025

🤖 GitHub comments

Uh oh!

github-actions bot commented Sep 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔍 Preview links for changed docs

Uh oh!

jsvd left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jsvd Sep 5, 2025

Choose a reason for hiding this comment

Uh oh!

jsvd Sep 8, 2025

Choose a reason for hiding this comment

Uh oh!

yaauie Sep 17, 2025

Choose a reason for hiding this comment

Uh oh!

jsvd Sep 5, 2025

Choose a reason for hiding this comment

Uh oh!

jsvd Sep 5, 2025

Choose a reason for hiding this comment

Uh oh!

jsvd Sep 5, 2025

Choose a reason for hiding this comment

Uh oh!

jsvd Sep 5, 2025

Choose a reason for hiding this comment

Uh oh!

jsvd commented Sep 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

elastic-sonarqube bot commented Sep 18, 2025

Quality Gate passed

Uh oh!

elasticmachine commented Sep 18, 2025

💚 Build Succeeded

History

Uh oh!

Uh oh!

yaauie commented Sep 4, 2025 •

edited

Loading

github-actions bot commented Sep 4, 2025 •

edited

Loading

jsvd commented Sep 5, 2025 •

edited

Loading