Allow seeking back to read skipped chunks by fintelia · Pull Request #657 · image-rs/image-png

fintelia · 2025-11-28T20:03:23Z

With #630 in place, we now have most of the pieces to load PNG metadata without consuming an unbounded amount of space.

The key here is that we track just the start position of each unbounded-size chunk we might care about. Which means that we can track thousands of text chunks with even a tiny space limit. When it comes time to read the chunks, we can seek back to them and provide the desired info. But if the chunk info is never requested then we don't pay the overhead of storing them. When in recording-mode, I believe that the only non-constant memory usage should be recording text chunk positions and reserving two rows of space for the unfiltering buffer.

This is particularly useful for the image crate which needs to know whether there's a tRNS chunk (to determine whether the color type is RGB or RGBA) before it knows whether the EXIF or ICC profile will be requested. By doing things this way, it neatly sidesteps the question of setting decoding limits before reading metadata. The main thing decoding limits were being used for was prevent zip-bombs in the compressed iCCP chunks and to a lesser degree preventing unreasonably sized EXIF or text chunks.

197g

I like it in principle, seeking is a capability that will be highly in the streaming decoder. Since all the seeks are relation we should make sure that intermittent interrupts work correctly with the position tracking. I don't forsee many problems with this but a few tests would still be added.

197g · 2025-11-28T23:02:56Z

src/decoder/stream.rs

+                if self.exif_position.is_none() {
+                    self.exif_position = Some(chunk_start);
+                }


The error treatment here differs from parsing the eXIf and iCCP chunk. In the latter case we error first with a Format and then later, probably, downgrade that to BadAncillaryChunk in most cases. Here however we do nothing.

Also do we consider the option changing through set_ignore_exif_chunk while it is read, i.e. first reading one fully and then only recording a position would still be a duplicate chunk. That may not happen but without the interface really assuring it we should know if that is in-scope of the implementation or not.

197g · 2025-11-28T23:09:44Z

src/decoder/stream.rs

+        let chunk_start = stream_position - 8 - self.current_chunk.raw_bytes.len() as u64;
+
+        match self.current_chunk.type_ {
+            chunk::tEXt | chunk::zTXt | chunk::iTXt => {


Wouldn't we still want to know the exact kind of text chunk? zTXt is handled very different from the other chunks. We can of course parse it back when we seek back and re-read the chunk anew but it's fixed-size metadata—so we might as well.

Yeah, I wasn't sure whether to save the chunk size/type or just re-read it when requested. Easy enough to track it alongside the chunk start position.

Allow recording chunk locations while skipping them

342680e

197g reviewed Nov 28, 2025

View reviewed changes

fintelia mentioned this pull request Dec 2, 2025

Set chunk action for IDAT and fdAT #658

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow seeking back to read skipped chunks#657

Allow seeking back to read skipped chunks#657
fintelia wants to merge 1 commit intoimage-rs:masterfrom
fintelia:seek-chunks

fintelia commented Nov 28, 2025

Uh oh!

197g left a comment

Uh oh!

197g Nov 28, 2025

Uh oh!

197g Nov 28, 2025

Uh oh!

197g Nov 28, 2025

Uh oh!

fintelia Dec 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

fintelia commented Nov 28, 2025

Uh oh!

197g left a comment

Choose a reason for hiding this comment

Uh oh!

197g Nov 28, 2025

Choose a reason for hiding this comment

Uh oh!

197g Nov 28, 2025

Choose a reason for hiding this comment

Uh oh!

197g Nov 28, 2025

Choose a reason for hiding this comment

Uh oh!

fintelia Dec 7, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants