You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
RDT is supposed to be setting the random seed based on the first 5 values in the column. For a lot of cases, this is sufficiently allows users to:
(a) transform/reverse transform in a reproducible way while also
(b) creating different data for each different column
However, this would break down if you have 2 different columns and the first 5 values of those columns are exactly the same. In such a case, the random seed for those columns would be exactly the same, so whatever randomness they have will be in-sync.
This is a general issue that we can use to track the problem. If it ends up affecting a lot of users, we may want to consider finding a different way to set the seed.
Additional context
This issue assumes that #906 has already been fixed. (#906 identifies an bug where the first five column names are being used rather than the first 5 data values.)
The text was updated successfully, but these errors were encountered:
Problem Description
RDT is supposed to be setting the random seed based on the first 5 values in the column. For a lot of cases, this is sufficiently allows users to:
(a) transform/reverse transform in a reproducible way while also
(b) creating different data for each different column
However, this would break down if you have 2 different columns and the first 5 values of those columns are exactly the same. In such a case, the random seed for those columns would be exactly the same, so whatever randomness they have will be in-sync.
This is a general issue that we can use to track the problem. If it ends up affecting a lot of users, we may want to consider finding a different way to set the seed.
Additional context
This issue assumes that #906 has already been fixed. (#906 identifies an bug where the first five column names are being used rather than the first 5 data values.)
The text was updated successfully, but these errors were encountered: