-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Flagging file loss/corruption #14
Comments
F2F 2018.09.05: Clarification of 'Corruption' -- the first is that the bits changed on disk, i.e. 'bitrot', leading to files not matching their checksum. The second is format corruption: A file was provided with a correct checksum, but it did not conform to its own spec, e.g., an empty 'zip' file or a badly-encoded JPEG2000. Issue moved to #30. We should not allow broken invalid OCFL objects to exist. In spec this is a "MUST" that objects must be valid, and checksums should match. Interventions can include fixing an object to be valid, or, if this is not possible, deleting the object. |
NB: Changed original description to clarify parts of the Use Case that were determined not be in scope. |
I would say checksum failure is an OCFL concern. File format validation is not - it is too wooly to define what valid is. Try and define what a valid doc file is. |
Editors' meeting 2023-09-22: This might be handled by the same tombstone/flagging mechanism as #42 |
Feedback on Use CasesIn advance of version 2 of the OCFL, we are soliciting feedback on use cases. Please feel free to add your thoughts on this use case via the comments. Polling on Use CasesIn addition to reviewing comments, we are doing an informal poll for each use case that has been tagged as
The poll will remain open through the end of February 2024. |
Does "no longer matching it's checksum" also encompass when the file is simply missing or is that another case? In the event that a file has changed/gone missing and there isn't a way to repair the on-storage ocfl object (for example a replacement file can't be found), it would be good to be able to acknowledge that the file is broken in the object and make the object correct by indicating that the file should have been there but isn't. That way in the future you can either search for objects which contain this information for remediation. |
at the time of this comment the vote tallied to +2 we are marking this as in-scope for version 2 because we suspect it will be related to #42 which is already in scope for version 2 of the specification |
2024-09-20 Editors agree that this use case will be combined with the solution for File Deletion #42, see comment thread there |
A periodic audit of a filesystem has revealed that a PDF file no longer matches its checksum, and will no longer open in a PDF reader. Checksums should be used to flag this as a problem and alert a validation client that the object is no longer valid.
The text was updated successfully, but these errors were encountered: