Add set_pledged_input_size to ZstdCompressor

# Feature or enhancement

### Proposal:

pyzstd's ZstdCompressor class had a method `_set_pledged_input_size`, which allowed users to set the amount of data they were going to write into a frame so it would be written into the frame header. We should support this use case in `compresison.zstd`.

I don't want to add a private API that is unsafe or only for advanced users, so I want to sketch out an implementation that could be used in general and catch incorrect usage:

1) Update ZstdCompressor's struct to include two `unsigned long long` members `current_frame_size` and `pledged_size`, both initialized to `ZSTD_CONTENTSIZE_UNKNOWN`
2) add `set_pledged_size`, the main difference from the pyzstd implementation is that it will update `pledged_size`
3) modify ZstdCompressor's `compress()` and `flush()` to track how much data is being written to the compressor, written into `current_frame_size`. If the mode is `FLUSH_FRAME` then after writing, check that `current_frame_size == pledged_size`, otherwise raise a `ZstdError` to indicate the failure. Reset `pledged_size` and `current_frame_size`.

I think the one drawback of the above is it will notify the user if something goes wrong but if they are streaming compressed data elsewhere they could still send garbage if they use the API wrong. But that's inherently not something we can really fix.

An open question I have is should we check `current_frame_size <= pledged_size` at the end of writing when the mode isn't `FLUSH_FRAME`? I think probably yes?

cc @Rogdham, I'd be interested in your thoughts.

### Has this already been discussed elsewhere?

I have already discussed this feature proposal on Discourse

### Links to previous discussion of this feature:

https://discuss.python.org/t/pep-784-adding-zstandard-to-the-standard-library/87377/143


### Linked PRs
* gh-135010
* gh-135173

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Add set_pledged_input_size to ZstdCompressor #134938

Feature or enhancement

Proposal:

Has this already been discussed elsewhere?

Links to previous discussion of this feature:

Linked PRs

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Uh oh!

Add set_pledged_input_size to ZstdCompressor #134938

Description

Feature or enhancement

Proposal:

Has this already been discussed elsewhere?

Links to previous discussion of this feature:

Linked PRs

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions