-
Notifications
You must be signed in to change notification settings - Fork 37
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Problem:MillionAID dataset #18
Comments
@wangjiawen-123 我们把原始训练集(1w张)和原始测试集(99w张)共同拿来做预训练。实际操作时,为了保证和先前的RSP监督预训练的可比较性,我们仍然是将100w张图片分成两部分,首先挑出51000张,然后拿剩下的95w张做预训练,见Section III-A-4), |
请问51000张是以什么规则挑出的呢,是否可以提供原始的txt |
@XinzheGeng 数据集划分规则RSP那篇文章有写,如果我没记错的话,应该是从每类随机挑1000张组成验证集,剩下的作为训练集 |
谢谢,看到了 |
请问作者,文中使用的Million数据集在官网下载地址中,train文件只有1w张图片 1.82G,与所提到的MillionAID有百万图片不符,请问具体实验是使用多少图片做训练和测试的?
The text was updated successfully, but these errors were encountered: