Better `Debug` instances and repr methods. #891

paraseba · 2025-04-03T17:38:00Z

Python's SnapshotInfo now also includes the Manifests info.

Python's `SnapshotInfo` now also includes the Manifests info.

dcherian · 2025-04-03T17:56:48Z

icechunk-python/src/repository.rs

            written_at: val.flushed_at,
            message: val.message,
            metadata: val.metadata.into(),
+            manifests: val.manifests.into_iter().map(|v| v.into()).collect(),


Isn't this potentially a really big list? that will now be listed our with ancestry?

Maybe we should have SnapshotInfo.list_manifests() instead?

yes, but, ancestry is very slow anyway, since it fetches each snapshot from object store. Gathering the few manifests cannot add much

Do you think the list of manifests could get very long with split? Don't forget this is only done when you try to print a snapshot, not something you would do a lot, or would expect to be fast.

scratch, my latest about "print". What I should have said is: these objects get populated only by ancestry, which is very slow, and they are mostly used and discarded, I don't imagine gathering a few hundred extra objects will become a problem

Ah I understand now. My goal with this was trying to give us a bit more insight to debug, using the lookup_snapshot method. I like the idea of having a dedicated ancestry representation, that could be text in the console, html in a notebook, etc

An updating weather forecast repo might have O(100) manifests in each snapshot, one per array, for example.

But we are not fetching those manifests, only surfacing the data that is already in the snapshot, maybe 30 bytes per manifest or so.

But this definitely has an impact on the repr for snapshots. Do you want me to skip manifests there? we can do that

Yes, I'm only concerned about the repr, because we use it for ancestry right now.

Yes, let's just comment it out from the repr for now.

This will help for my update to test_can_read_old.py

Done in f9baf05

* main: Better `Debug` instances and __repr__ methods. (#891) Add chunk container repr, fix test dataset (#893)

paraseba requested review from dcherian and mpiannucci April 3, 2025 17:38

paraseba force-pushed the push-xzkplxxuwkrr branch from 88eee6a to c2b8a4a Compare April 3, 2025 17:46

Better Debug instances and __repr__ methods.

2a04921

Python's `SnapshotInfo` now also includes the Manifests info.

paraseba force-pushed the push-xzkplxxuwkrr branch from c2b8a4a to 2a04921 Compare April 3, 2025 17:48

dcherian reviewed Apr 3, 2025

View reviewed changes

Remove manifests from SnapshotInfo __repr__

f9baf05

paraseba requested a review from dcherian April 3, 2025 20:23

dcherian approved these changes Apr 3, 2025

View reviewed changes

Merge branch 'main' into push-xzkplxxuwkrr

5f26319

dcherian enabled auto-merge (squash) April 3, 2025 20:30

dcherian merged commit c094590 into main Apr 3, 2025
7 of 8 checks passed

dcherian deleted the push-xzkplxxuwkrr branch April 3, 2025 20:36

dcherian added a commit that referenced this pull request Apr 3, 2025

Merge branch 'main' into split-manifests

2ea8e2a

* main: Better `Debug` instances and __repr__ methods. (#891) Add chunk container repr, fix test dataset (#893)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Better `Debug` instances and repr methods. #891

Better `Debug` instances and repr methods. #891

Uh oh!

paraseba commented Apr 3, 2025

Uh oh!

dcherian Apr 3, 2025

Uh oh!

dcherian Apr 3, 2025

Uh oh!

paraseba Apr 3, 2025

Uh oh!

paraseba Apr 3, 2025

Uh oh!

paraseba Apr 3, 2025 •

edited

Loading

Uh oh!

paraseba Apr 3, 2025

Uh oh!

paraseba Apr 3, 2025 •

edited

Loading

Uh oh!

paraseba Apr 3, 2025

Uh oh!

dcherian Apr 3, 2025 •

edited

Loading

Uh oh!

paraseba Apr 3, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Better Debug instances and __repr__ methods. #891

Better Debug instances and __repr__ methods. #891

Uh oh!

Conversation

paraseba commented Apr 3, 2025

Uh oh!

dcherian Apr 3, 2025

Choose a reason for hiding this comment

Uh oh!

dcherian Apr 3, 2025

Choose a reason for hiding this comment

Uh oh!

paraseba Apr 3, 2025

Choose a reason for hiding this comment

Uh oh!

paraseba Apr 3, 2025

Choose a reason for hiding this comment

Uh oh!

paraseba Apr 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

paraseba Apr 3, 2025

Choose a reason for hiding this comment

Uh oh!

paraseba Apr 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

paraseba Apr 3, 2025

Choose a reason for hiding this comment

Uh oh!

dcherian Apr 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

paraseba Apr 3, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Better `Debug` instances and repr methods. #891

Better `Debug` instances and repr methods. #891

paraseba Apr 3, 2025 •

edited

Loading

paraseba Apr 3, 2025 •

edited

Loading

dcherian Apr 3, 2025 •

edited

Loading