Skip to content

Devise a mechanism for determining whether a file already exists at a destination. #66

@jeff-cohere

Description

@jeff-cohere

We've talked here and there about how to minimize unnecessary data transfers, and discussed the merits and drawbacks of various approaches. In particular, I'm not crazy about using a log to figure out where a file should or shouldn't be--I'd rather ask the source of truth itself!

In this connection, I'm considering an additional endpoint for the Database specification that searches for files by their MD5 checksums specifically, instead of using search queries. This endpoint would accept an array of checksums and return their corresponding file IDs (or null in the case that they aren't found).

Obviously this is a very complicated problem to solve, and the above approach doesn't begin to handle all of the nastiness to do with files that have been transferred but don't yet have IDs, etc. But I think it would at least give us a solid point of departure. I think I can probably stand up a JDP file checksum search endpoint that uses JAMO.

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions