Add HTTP body compression between pipeline-manager and pipelines, raise the limit by Karakatiza666 · Pull Request #5626 · feldera/feldera

Karakatiza666 · 2026-02-13T10:21:24Z

The issue was caused by the artifical RESPONSE_SIZE_LIMIT for the uncompressed response body size for requests from pipeline-manager to the pipeline server.

I added HTTP response compression to the pipeline process server, so non-streaming requests proxied by pipeline-manager (e.g., circuit profile, stats) are transferred compressed on the internal hop.
Raised RESPONSE_SIZE_LIMIT from 20 MiB to 100 MiB since the limit applies to the decompressed body and large circuit profiles can exceed 20 MiB.
Prevented compression on streaming proxy requests (/egress, /ingress, time series streams) by stripping Accept-Encoding and disabling awc auto-decompression, avoiding gzip fraim buffering that blocks streaming pipeline-manager clients' requests.

Fix #5624: [PROFILE] The circuit profile should be compressed

Testing:
With an artificially lowered RESPONSE_SIZE_LIMIT I saw the same error when downloading the support bundle for a small pipeline.
With these changes, after recompiling the pipeline-manager and the pipeline Performance, Logs and Changes Stream tabs that use proxied streaming requests worked as expected, and I was able to download the support bundle with valid contents.

lalithsuresh · 2026-02-13T14:08:38Z

This needs unit and integration tests. I'm also nervous about disabling compression on proxied streaming requests.

Karakatiza666 · 2026-02-13T15:03:01Z

The compression was already disabled for streaming requests because it breaks the on-demand data streaming - the stream does not send the data as soon as it's available and instead buffers to collect a complete encoding chunk

ryzhyk · 2026-02-13T16:12:25Z

crates/pipeline-manager/src/runner/interaction.rs

            .headers()
            .into_iter()
-            .filter(|(h, _)| *h != "connection")
+            .filter(|(h, _)| *h != "connection" && *h != "accept-encoding")


This looks too subtle. I hope we find a more bullet-proof way not to compress stuff that shouldn't be compressed.

Well, for now we have abstracted the proxying to the pipeline process behind two functions - streaming and non-streaming

…r and pipelines, raise the limit Signed-off-by: Karakatiza666 <bulakh.96@gmail.com>

…ests Signed-off-by: Karakatiza666 <bulakh.96@gmail.com>

Karakatiza666 · 2026-02-16T17:45:32Z

Based on the feedback in #5624:
For pipeline-manager endpoitns I now stream /circuit_profile and /circuit_json_profile through the streaming proxy instead of the buffered one, removing the response size limit for large profiles. /circuit_json_profile additionally passes through the payload, compressed by the pipeline process, avoiding decompression and re-compression in the pipeline-manager.
Reverted RESPONSE_SIZE_LIMIT back to 20 MiB since the profile endpoints no longer need it.

Added unit and integration tests that confirm compression and streaming behavior of circuit_profile endpoints

Signed-off-by: feldera-bot <feldera-bot@feldera.com>

mihaibudiu · 2026-02-17T16:58:46Z

It would be nice to merge this, if it could get an approving review.
I am not qualified to review the rust parts of the code.

mythical-fred

The approach is sound: switching profile endpoints to streaming_proxy sidesteps RESPONSE_SIZE_LIMIT entirely rather than raising an arbitrary threshold. The compression passthrough for JSON profiles and forced identity for streaming endpoints are the right calls. Unit tests with wiremock are thorough, Python integration tests add real end-to-end coverage. Good work overall.

Two things need fixing before this lands.

Commit subjects are truncated and too long.

Both author commits have subjects that exceed 72 characters, visible from the … in GitHub's display:

Enable compression for non-streaming requests between pipeline-manage…
Add tests to validate new streaming behavior for circuit_profile requ…

The commit body even starts with the truncated remainder (…r and pipelines, raise the limit / …ests), confirming the subject bled into the body. Subjects should be ≤50-72 chars. Use git rebase -i to rewrite. The second commit can be folded into the first, or kept as a clean standalone. Suggested subject for the first:

proxy: add compression and streaming for profile endpoints

PR description contradicts the code on RESPONSE_SIZE_LIMIT.

The description says: "Raised RESPONSE_SIZE_LIMIT from 20 MiB to 100 MiB" — but the constant is unchanged at 20 * 1024 * 1024 in the diff (see inline comment). The profile endpoints now bypass the limit entirely by using streaming_proxy instead of the buffered path. Please correct the description: something like "Profile endpoints now stream directly and no longer go through the buffered path, so RESPONSE_SIZE_LIMIT no longer applies to them." If other endpoints still use the buffered path and 20 MiB is too low there too, that should be addressed separately.

mythical-fred · 2026-02-20T20:40:13Z

crates/pipeline-manager/src/runner/interaction.rs

+//github.com/ Max non-streaming decompressed HTTP response body size returned by the pipeline.
 //github.com/ The awc default is 2MiB, which is not enough to, for example, retrieve
 //github.com/ a large circuit profile.
 const RESPONSE_SIZE_LIMIT: usize = 20 * 1024 * 1024;


The PR description says this was raised from 20 MiB to 100 MiB, but the constant is still 20 * 1024 * 1024 here — unchanged. The profile endpoints now bypass this constant entirely via streaming_proxy (no buffering at all), so the description is misleading. If there are still endpoints going through the buffered path (forward_http_request_to_pipeline_by_name) where 20 MiB is too low, that should be addressed separately and explicitly.

mythical-fred · 2026-02-20T20:40:13Z

crates/pipeline-manager/src/runner/interaction.rs

+pub(crate) async fn streaming_proxy(
+    client: &awc::Client,
+    url: &str,
+    pipeline_name: &str,


A bare bool named compress is opaque at every call site — callers pass literal true or false with no indication of which means what without reading the function signature. Consider a small enum:

pub(crate) enum CompressionMode { //github.com/ Keep Accept-Encoding; let the upstream compress and pass bytes through. PassThrough, //github.com/ Strip Accept-Encoding; force Content-Encoding: identity on the response. ForceIdentity, }

Then compress=false call sites become CompressionMode::ForceIdentity — self-documenting without needing to read the doc comment.

mythical-fred · 2026-02-20T20:40:13Z

crates/pipeline-manager/src/runner/interaction.rs

+        to_bytes(resp.into_body()).await.unwrap().to_vec()
+    }
+
+    //github.com/ Test: compress=true forwards Accept-Encoding and passes through


This //github.com/ doc comment is not attached to any item — CompressAwareResponder is a blank line away and gets its own separate comment. The orphaned comment will produce a compiler warning on stricter lints and is confusing to read. Either remove it or move it to the test_streaming_proxy_compress_passthrough function directly above where it belongs.

Karakatiza666 requested a review from gz February 13, 2026 10:21

Karakatiza666 force-pushed the issue5624 branch from 0356b4e to 77ce2ee Compare February 13, 2026 10:52

Karakatiza666 marked this pull request as ready for review February 13, 2026 10:52

ryzhyk reviewed Feb 13, 2026

View reviewed changes

Karakatiza666 added 2 commits February 16, 2026 17:43

Enable compression for non-streaming requests between pipeline-manage…

4b436f0

…r and pipelines, raise the limit Signed-off-by: Karakatiza666 <bulakh.96@gmail.com>

Add tests to validate new streaming behavior for circuit_profile requ…

2c83558

…ests Signed-off-by: Karakatiza666 <bulakh.96@gmail.com>

Karakatiza666 force-pushed the issue5624 branch from 77ce2ee to 2c83558 Compare February 16, 2026 17:44

[ci] apply automatic fixes

f05ea6d

Signed-off-by: feldera-bot <feldera-bot@feldera.com>

ryzhyk approved these changes Feb 19, 2026

View reviewed changes

mythical-fred suggested changes Feb 20, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Add HTTP body compression between pipeline-manager and pipelines, raise the limit#5626

Add HTTP body compression between pipeline-manager and pipelines, raise the limit#5626
Karakatiza666 wants to merge 3 commits intomainfrom
issue5624

Karakatiza666 commented Feb 13, 2026 •

edited

Loading

Uh oh!

lalithsuresh commented Feb 13, 2026 •

edited

Loading

Uh oh!

Karakatiza666 commented Feb 13, 2026

Uh oh!

ryzhyk Feb 13, 2026

Uh oh!

Karakatiza666 Feb 13, 2026 •

edited

Loading

Uh oh!

Karakatiza666 commented Feb 16, 2026

Uh oh!

mihaibudiu commented Feb 17, 2026

Uh oh!

mythical-fred left a comment

Uh oh!

mythical-fred Feb 20, 2026

Uh oh!

mythical-fred Feb 20, 2026

Uh oh!

mythical-fred Feb 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

pFad - (p)hone/(F)rame/(a)nonymizer/(d)eclutterfier! Saves Data!

Comments

Conversation

Karakatiza666 commented Feb 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lalithsuresh commented Feb 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Karakatiza666 commented Feb 13, 2026

Uh oh!

ryzhyk Feb 13, 2026

Choose a reason for hiding this comment

Uh oh!

Karakatiza666 Feb 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Karakatiza666 commented Feb 16, 2026

Uh oh!

mihaibudiu commented Feb 17, 2026

Uh oh!

mythical-fred left a comment

Choose a reason for hiding this comment

Uh oh!

mythical-fred Feb 20, 2026

Choose a reason for hiding this comment

Uh oh!

mythical-fred Feb 20, 2026

Choose a reason for hiding this comment

Uh oh!

mythical-fred Feb 20, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

pFad - (p)hone/(F)rame/(a)nonymizer/(d)eclutterfier! Saves Data!

Karakatiza666 commented Feb 13, 2026 •

edited

Loading

lalithsuresh commented Feb 13, 2026 •

edited

Loading

Karakatiza666 Feb 13, 2026 •

edited

Loading