-
Notifications
You must be signed in to change notification settings - Fork 690
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] Fast registration removes datetime metadata from files #4711
Comments
🤖 Hey there @deadlycoconuts! I'm Dosu, your friendly code enthusiast bot. Whether it's squashing bugs, answering queries, or assisting you become a contributor, I'm here to help. While the human maintainers are off refueling on coffee, I'm your go-to bot! Let's debug this together, shall we? 🐞💻 Thank you for bringing this issue to our attention. It seems like the The Unfortunately, without more information, it's difficult to determine the exact cause of this issue. It would be helpful if you could provide more details about your environment and the exact steps you're taking when you encounter this issue. This could include information such as:
This additional information will help us better understand the issue and work towards a resolution. Thank you for your cooperation. Dosu SourcesThis response is meant to be useful and save you time. It is not meant to be a precise solution, but rather a starting point for your own research. Help me be more useful! Please leave a 👍 if this is helpful and 👎 if it is irrelevant. If you want to continue the conversation, start your reply with @dosu-bot. |
This was originally added to ensure that the pre-signed URL used to upload the compressed file used during fast registration was deterministic, otherwise, simply opening a file might change its metadata and consequently change the hash used during fast registration. One of the original intents behind this feature is to allow clients to leverage the case where they are about to overwrite an existing file in the blob store and use the (local) hash to decide not to upload the file (since it's guaranteed that the metadata matches). Before we decide to go one way or the other, can you talk about the problem that not having the original metadata in the files is causing you, @deadlycoconuts , @ChungYujoyce? |
Thanks @ChungYujoyce for the PR and @eapolinario for taking a look at this. The reason why we brought this up was because one of our users was attempting to zip certain files in his/her Flyte task but the package
While it's a library-specific error on the metadata, this is quite an unexpected difference in behaviour for regular registration vs fast registration from the user's perspective - one approach leads to an error but not the other. |
@eapolinario Can you hash the files without using any time data? In this case, hashing should be time-agnostic, only the changes matter. We don't need to include the time and then strip out. pls correct me if I'm wrong. |
The hashing was hashing the tarfile and tarfile with timestamp in the metadata affects this. Not sure where the zone is used? |
@ChungYujoyce , I noticed that you shared an excerpt of a python exception, correct? From that it looks like you're using
Can you try that? |
Describe the bug
Using
pyflyte register
to fast register a workflow strips the datetime metadata from all the files that are packaged into the.tar.gz
file. This doesn't happen when usingpyflyte package
orflytectl register
.Flyte version: 1.10.6
Flytekit version: 1.10.1
Python version: 3.10.13
Expected behavior
Using
pyflyte register
to fast register a workflow should preserve the datetime metadata from all the files that are packaged into the.tar.gz
file.Additional context to reproduce
No response
Screenshots
Files that are zipped using fast registration have their datetime metadata stripped (all the datetime fields default to 1 Jan 1970 7.30 AM):
Files that are packaged using
pyflyte register
do not encounter this issue:Are you sure this issue hasn't been raised already?
Have you read the Code of Conduct?
The text was updated successfully, but these errors were encountered: